{"id":1597,"date":"2013-08-24T23:27:32","date_gmt":"2013-08-24T21:27:32","guid":{"rendered":"http:\/\/raftaman.net\/?p=1597"},"modified":"2021-05-15T11:46:38","modified_gmt":"2021-05-15T09:46:38","slug":"search-contents-of-multiple-pdf-files","status":"publish","type":"post","link":"https:\/\/possiblelossofprecision.net\/?p=1597","title":{"rendered":"Search contents of multiple pdf files"},"content":{"rendered":"<p>pdftotext is a handy little tool to convert pdf files to plain text. It&#8217;s part of Fedora&#8217;s <code>poppler-utils<\/code> package and can be installed with yum:<\/p>\n<pre class=\"brush: bash; title: ; notranslate\" title=\"\">\r\n# yum install poppler-utils\r\n<\/pre>\n<p>You can use pdftotext and the <a href=\"http:\/\/www.gnu.org\/software\/findutils\/\">GNU findutils<\/a> to search for a specific string in multiple pdf documents:<\/p>\n<pre class=\"brush: bash; title: ; notranslate\" title=\"\">\r\n$ find \/path -name '*.pdf' -ls -exec pdftotext {} - \\; | grep -i &quot;string&quot;\r\n<\/pre>\n<p>See also <a href=\"http:\/\/linux.die.net\/man\/1\/pdftotext\">pdftotext manpage<\/a>, <a href=\"http:\/\/linux.die.net\/man\/1\/find\">find manpage<\/a>, <a href=\"http:\/\/linux.die.net\/man\/1\/grep\">grep manpage<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>pdftotext is a handy little tool to convert pdf files to plain text. It&#8217;s part of Fedora&#8217;s poppler-utils package and can be installed with yum: # yum install poppler-utils You can use pdftotext and the GNU findutils to search for a specific string in multiple pdf documents: $ find \/path -name &#8216;*.pdf&#8217; -ls -exec pdftotext {} &#8211; \\; | grep&#8230; <a href=\"https:\/\/possiblelossofprecision.net\/?p=1597\">Read more &raquo;<\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[28],"class_list":["post-1597","post","type-post","status-publish","format-standard","hentry","category-uncategorized","tag-bash"],"_links":{"self":[{"href":"https:\/\/possiblelossofprecision.net\/index.php?rest_route=\/wp\/v2\/posts\/1597","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/possiblelossofprecision.net\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/possiblelossofprecision.net\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/possiblelossofprecision.net\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/possiblelossofprecision.net\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=1597"}],"version-history":[{"count":6,"href":"https:\/\/possiblelossofprecision.net\/index.php?rest_route=\/wp\/v2\/posts\/1597\/revisions"}],"predecessor-version":[{"id":2177,"href":"https:\/\/possiblelossofprecision.net\/index.php?rest_route=\/wp\/v2\/posts\/1597\/revisions\/2177"}],"wp:attachment":[{"href":"https:\/\/possiblelossofprecision.net\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=1597"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/possiblelossofprecision.net\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=1597"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/possiblelossofprecision.net\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=1597"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}