Portable Document Format (PDF), standardized as ISO 32000, is a file format developed by Adobe in 1992 to present documents, including text formatting and images, in a manner independent of application software, hardware, and operating systems.
The Pdf File module decodes the base64-encoded data from the PDF document and then loads the PDF content.
If a textSplitter is provided, it uses it to split the text content.
Text Splitter (optional)
PDF File
Usage
One Document per Page OR One Document per File\
loads PDF content