Search Text in PDF#

This feature is designed to search for specific text within a PDF document. It is capable of searching text even in scanned PDFs, making it a versatile tool for document analysis and information retrieval.

Make Step

Input#

Name

Description

Required

Import Options

Choose the input source, either Upload a File or Import a File From URL.

Yes


Upload a File#

Name

Description

Required

Data

Upload a file using raw binary data from another module. Note: This requires additional credits as it first uploads to PDF.co Temporary Files Storage.

Yes

Output File Name

Specify a custom file name for the output file.

No


Import a File From URL#

Name

Description

Required

URL

Provide the URL to the source PDF document, or a filetoken:// link from PDF.co Built-In Files Storage. If you use another cloud service such as Google Drive or Dropbox ensure the link is publicly accessible.

Yes

Output File Name

Specify a custom file name for the output file.

No


Name

Description

Required

Text to Search

Specify the text that you wish to search for in the PDF.

Yes

Use Regular Expressions

Opt to use regular expressions for more complex search patterns. For instance, to find a SSN format, use [0-9]{3}-[0-9]{2}-[0-9]{4}.

No

Execution Mode

Select Sync for small tasks up to 10 seconds. Choose Async for standard jobs, or Async For Large Docs for tasks over 30 seconds. Use Job Check module for retrieving results in large tasks.

No

Pages

Enter a comma-separated list of page indices (or ranges) for processing. Leave empty for all pages. The first page is numbered 0 (zero). Example: 0,1-2,5-.

No

Password

If the PDF is password-protected, enter the password here.

No

Profiles

Add custom options for the process in a JSON string format. See API Profiles for more details.

No

Output Links Expiration

Set the expiration time in minutes for output links. Default is 60 minutes. Increase this limit with a Business Plan or higher, see plans here for details.

No

Export Type

Choose between Download a File or JSON Output. Default is JSON Output.

No

Integrating External File Sources#

Note

Streamline your Make workflows with external file sources like Google Drive and Dropbox using their unique actions. Discover efficient integration strategies in our guide: File Source Integrations in Make.


Output#

Name

Description

url

This is the temporary URL provided by the PDF.co file server.

body

An object array containing search results such as text, left, top, width, height, pageIndex and others.

Data

Represents the output binary data. This data is generated only when the Export Type option is set to Download a File.

Status

Indicates the response status code. A success status is returned if the operation is successful.

outputLinkValidTill

Specifies the timestamp until which the url remains accessible.

error

Provides details about any errors encountered during the process, if applicable.

name

The designated name of the output file.

Job Id

A unique identifier assigned to the job.

credits

The amount of credits utilized for the process.

Remaining Credits

Displays the balance of credits available in your account.

duration

The duration of time the process took to complete.