Scan to PDF in Ubuntu, with Beagle Searchable Meta Data
This bash script lets you scan directly to a PDF and then search your scanned PDF's using beagle, not just by file name, but by the information (meta data), that you can save with your PDF.
I developed this script because I really want to have a paperless desk and I could not find an easy way to scan documents to PDF, (and find them again!).
A picture's worth a thousand words...
I have a "launcher" on my desktop called "scan", I just click it and....
Select colour or Grey (color or Gray) for the Americans!
data:image/s3,"s3://crabby-images/7771a/7771a2ab414371724f9f15889ded8e68c622c87a" alt=""
Then press OK for each page of the PDF you want to make. (Cancel to finish)
data:image/s3,"s3://crabby-images/0a222/0a2222a3a8685ce2e9bbb8a7ff064b193c269ad2" alt=""
Then give the PDF a file name... (jam recipe!)
data:image/s3,"s3://crabby-images/a88ea/a88ea44fc2f21bc3dcdf4569960b790432172053" alt=""
Then some meta data so beagle can find it... jam, recipe, grannys, yum.
data:image/s3,"s3://crabby-images/f30b1/f30b1b9781c102d1fb2f0e355e40546b1848d968" alt=""
Thats all !
Now search for "grannys jam" in Beagle.....
data:image/s3,"s3://crabby-images/72baa/72baacae4e10e1eff9d4bec680e90ef640f8ecd7" alt=""
And there it is !!
data:image/s3,"s3://crabby-images/9ae6d/9ae6ddf3ef21c53343f80e85c30621a5a0a0c472" alt=""
Here is the script, it was made pretty fast so......
Please feel free to mess about with it. And post improvements back here !
Programs you'll need..
zenity
ps2pdf
pdftk
scanimage
Copy the script into its own directory, chmod u+rxw. Thats all!.
-----------------------------------
#!/bin/sh
#scan to pdf with metadata, by Mac Jones, New Zealand
#http://maconstuff.blogspot.com/
#scan a batch
#decide grey or colour (gray or color for the Americans!)
colour=`zenity --list --title "Color or Gray?" --radiolist --column "-" --column "Scan" TRUE Gray FALSE Color`
a=0 #page counter
cont=1 #should we continue?
until [ $cont -eq "0" ] #keep doing it until cont variable is not a zero.
do
echo -n "$a "
let "a+=1"
if zenity --question --text "OK to scan a page, Cancel to finish, Page=$a" --title "Scanning pages"
then
cont=1
scanimage --format pnm --resolution 150 --mode $colour > "$a.pnm"
else
cont=0
fi
done # No surprises, so far.
#convert the raw file to postscript
convert -density 150 *.pnm out.ps | zenity --progress --auto-close --title "Converting to Postscript"
#convert the postscript to pdf
ps2pdf out.ps out.pdf | zenity --progress --auto-close --title "Converting Postscript to PDF"
#remove raw scan files
rm *.pnm
#remove old ps files
rm out.ps
#beep to get attention after processing
echo -e "\a"
#add the metadata and file name.
#this meta data can be searched from Beagle in Ubuntu.
#echo "Please enter a name for the PDF file (** no .pdf on end)"
nm=`zenity --entry --text "Enter file name, (no .pdf on the end)" --title "File Name?"`
#echo "Please enter Metadata for searching"
meta=`zenity --entry --text "Meta data for searching" --entry-text=$nm --title "Meta Data for Searching"`
echo "InfoKey: Producer" > tmp
echo "InfoValue: $meta" >> tmp
echo "InfoKey: Keywords" >> tmp
echo "InfoValue: $meta" >> tmp
echo "InfoKey: Title" >> tmp
echo "InfoValue: $nm" >> tmp
#update the metadata
pdftk out.pdf update_info tmp output "$nm.pdf"
#rm metadata file and pdf
rm tmp
rm out.pdf
zenity --info --text="All done, $nm.pdf is ready!" --title "Thanks!"
--------------------
I developed this script because I really want to have a paperless desk and I could not find an easy way to scan documents to PDF, (and find them again!).
A picture's worth a thousand words...
I have a "launcher" on my desktop called "scan", I just click it and....
Select colour or Grey (color or Gray) for the Americans!
data:image/s3,"s3://crabby-images/7771a/7771a2ab414371724f9f15889ded8e68c622c87a" alt=""
Then press OK for each page of the PDF you want to make. (Cancel to finish)
data:image/s3,"s3://crabby-images/0a222/0a2222a3a8685ce2e9bbb8a7ff064b193c269ad2" alt=""
Then give the PDF a file name... (jam recipe!)
data:image/s3,"s3://crabby-images/a88ea/a88ea44fc2f21bc3dcdf4569960b790432172053" alt=""
Then some meta data so beagle can find it... jam, recipe, grannys, yum.
data:image/s3,"s3://crabby-images/f30b1/f30b1b9781c102d1fb2f0e355e40546b1848d968" alt=""
Thats all !
Now search for "grannys jam" in Beagle.....
data:image/s3,"s3://crabby-images/72baa/72baacae4e10e1eff9d4bec680e90ef640f8ecd7" alt=""
And there it is !!
data:image/s3,"s3://crabby-images/9ae6d/9ae6ddf3ef21c53343f80e85c30621a5a0a0c472" alt=""
Here is the script, it was made pretty fast so......
Please feel free to mess about with it. And post improvements back here !
Programs you'll need..
zenity
ps2pdf
pdftk
scanimage
Copy the script into its own directory, chmod u+rxw. Thats all!.
-----------------------------------
#!/bin/sh
#scan to pdf with metadata, by Mac Jones, New Zealand
#http://maconstuff.blogspot.com/
#scan a batch
#decide grey or colour (gray or color for the Americans!)
colour=`zenity --list --title "Color or Gray?" --radiolist --column "-" --column "Scan" TRUE Gray FALSE Color`
a=0 #page counter
cont=1 #should we continue?
until [ $cont -eq "0" ] #keep doing it until cont variable is not a zero.
do
echo -n "$a "
let "a+=1"
if zenity --question --text "OK to scan a page, Cancel to finish, Page=$a" --title "Scanning pages"
then
cont=1
scanimage --format pnm --resolution 150 --mode $colour > "$a.pnm"
else
cont=0
fi
done # No surprises, so far.
#convert the raw file to postscript
convert -density 150 *.pnm out.ps | zenity --progress --auto-close --title "Converting to Postscript"
#convert the postscript to pdf
ps2pdf out.ps out.pdf | zenity --progress --auto-close --title "Converting Postscript to PDF"
#remove raw scan files
rm *.pnm
#remove old ps files
rm out.ps
#beep to get attention after processing
echo -e "\a"
#add the metadata and file name.
#this meta data can be searched from Beagle in Ubuntu.
#echo "Please enter a name for the PDF file (** no .pdf on end)"
nm=`zenity --entry --text "Enter file name, (no .pdf on the end)" --title "File Name?"`
#echo "Please enter Metadata for searching"
meta=`zenity --entry --text "Meta data for searching" --entry-text=$nm --title "Meta Data for Searching"`
echo "InfoKey: Producer" > tmp
echo "InfoValue: $meta" >> tmp
echo "InfoKey: Keywords" >> tmp
echo "InfoValue: $meta" >> tmp
echo "InfoKey: Title" >> tmp
echo "InfoValue: $nm" >> tmp
#update the metadata
pdftk out.pdf update_info tmp output "$nm.pdf"
#rm metadata file and pdf
rm tmp
rm out.pdf
zenity --info --text="All done, $nm.pdf is ready!" --title "Thanks!"
--------------------