Automate the Download of PDF files from Chrome
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Automate the Download of PDF files from Chrome
Does anyone know of a reliable way to interact with Adobe Reader embedded within Chrome to download PDF files? It appears to be using a 'Flex' control to download. I've tried to set the default download path, but this doesn't always automatically download the generated PDF. There are still dialogs that have to be interacted with.
Thanks!
- Labels:
-
Scripting
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Let's see the code that you tried
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
And here under this privacy and security you need to scroll down. And here we have the content settings click on that. And you need to find here the PDF. Ok. So just find the PDF. Section.
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Scroll to the privacy & security settings and click Site Settings. On the site settings page, click pdf documents. On the page that follows, turn on the download pdf files instead of automatically opening them in chrome option.
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
It looks like there are three scenarios that we are dealing with. First, I needed to switch-off the Adobe Acrobat Chrome Extension:
1. Direct Path to PDF generated on web server
Solution: Use Java IO Buffered Intput Stream.
//var url = JavaClasses.java_net.URL.newInstance("http://pathToServer/ServerName?&reportname=Custom_Bol_##########&type=pdf&fileextn=.pdf,Custom_Bol_W...");
var url = JavaClasses.java_net.URL.newInstance("http://pathToPDF/PDFile.pdf");
var inputStream = url.openStream();
var fileParse = JavaClasses.java_io.BufferedInputStream.newInstance(inputStream);
var docObj = JavaClasses.org_apache_pdfbox_pdmodel.PDDocument.load_3(fileParse);
var textStripperObj = JavaClasses.org_apache_pdfbox_text.PDFTextStripper.newInstance()
outputString = textStripperObj.getText(docObj);
docObj.close();
Log.Message("Output PDF String = " +outputString)
2. PDF loaded using a long URL and displayed in non-preview mode
Solution: In this scenerio, I can type 'CTRL S', and save via the Open File Dialog to the folder I need. This I'm still working on find a reliable method to wait for the PDF file to load in Chrome (Other than a static wait). After downloaded, I can use the following code:
var firstPDF = ProjectSuite.Path + "\pathToPDF\\BOL_Test.pdf";
var outputString = "";
var orderNumber = "60022500"
var browser = Aliases.browser;
browser.pageReportserv.Keys("^s");
var dlgSaveAs = browser.dlgSaveAs;
var comboBox = dlgSaveAs.DUIViewWndClassName.Explorer_Pane.FloatNotifySink.ComboBox;
comboBox.SetText(firstPDF);
dlgSaveAs.btnSave.ClickButton();
var doc = JavaClasses.java_io.File.newInstance(firstPDF)
var docObj = JavaClasses.org_apache_pdfbox_pdmodel.PDDocument.load_2(doc);
var textStripperObj = JavaClasses.org_apache_pdfbox_text.PDFTextStripper.newInstance()
outputString = textStripperObj.getText(docObj);
docObj.close();
Log.Message("Output PDF String = " +outputString)
// Run a basic test to find a string of text
if (aqString.Find(outputString, orderNumber) != -1)
Log.Message("Found Order #: " +orderNumber)
else
Log.Error("Didn't find Order # in search string.");
3. PDF Generated in Preview Mode (Read Only)
Solution: I'm still working on a way to either download using CTRL 'S' or Buffered Input Stream. The URL is also lond a has many special characters (for spaces, etc.).
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
I also want to be able to control where the downloaded file gets put, so that I can locate and run a comparison or test text content. I don't want all files downloading to the same Chrome Download folder. I want to set this programatically, since we run these via Jenkin's pipeline jobs.
