PDF Validation

Question

Hi,&nbsp;My pdf contains details of 4 clients but I have to validate the data of only one client whose data is in 3rd page. I know the account number of the required client. How can I jump to that page and validate thedetails?

alexkaras · Accepted Answer

Hi,
&nbsp;
Basically, the approach remains the same as mentioned by&nbsp;shankar_r: find the needed page by searching for account number (search either page by page or get the whole content of the document and search there - this depends on the structure of the given document and on how it was created) and do more detailed analysis for the found account. Exact implementation steps depend on the target document.
However, the usual question that is asked in the situations like yours: do you really need to bother with PDF parsing? Isn't it acceptable, for example, to split this test into three smaller ones: first will get the raw data used to generate the document and verify that the data are as expected. This is easily automated usually. The second test will check that the generated document has correct layout: headers, logos, text columns, etc. This can be more or less easily automated using either image comparison or visual testing tools. And finally the third test, that&nbsp;may be executed manually and less often, will verify that the document contains correct data and has correct layout from the human point of view.
Thoughts?

shankar_r · Answer

To get started with PDF validation you can use this link&nbsp;https://support.smartbear.com/articles/testcomplete/testing-pdf-files-with-testcomplete/
&nbsp;
In-order to verify the text in particular page, you can do as described in below link
&nbsp;
For Ex:
&nbsp;
If you know where the client details comes, then set the start and end pages as same and find the client details in it.
&nbsp;
// Set the page index as a start page
// Note that the page index here is not zero-based
textStripperObj.setStartPage(3);

// Set the same page index as an end page
textStripperObj.setEndPage(3);

// Get the text of the page
text = textStripperObj.getText_2(docObj);

//Use the text to compare your expected values

tristaanogre · Answer

Right with you,&nbsp;AlexKaras... it again goes along the line of the statement: Everything CAN be automated, true, but not everything SHOULD be automated.&nbsp; And that SHOULD is not a moral question but more on the lines of ROI.&nbsp; You can right a LOT of code to parse out the PDF, get it working, etc...&nbsp; when what is really important is that the back end data is right, that the PDF layout is right, and that plugging that data into the PDF is right.&nbsp; The first two are VERY easily automated, the last one not so easily... but a quick manual test "Hey, the PDF looks right" is really quick.
&nbsp;
Automation supplements manual testing, it should not replace it.

jasmeenkaur27 · Answer

The issue is I do not know the page number which contains the client details.I just know the account number of that client so I search the account number in the PDF and after that I have to verify the remaining details

Forum Discussion

PDF Validation

4 Replies

Related Content

Validate Table content in PDF document

get text from first page of pdf file with 'PDF to Text'

PDF validation without Intelligent Quality add-on & PDFBox in Testcomplete

PDF Testing

Data Validation

Recent Discussions

Access a child in a tree Structure through Keyword Strokes

Turn off all Edge messageboxes

How to verify data from another application, the data which keeps changing

Audit and Reporting purposes

Textbox values disappear after adding Year value from Date Picker