Skip to main content
Transform your PDF documents into intelligent knowledge bases that power your AI Agents. This method processes and organizes PDF content to create comprehensive knowledge repositories.

What you get

  • Text extraction from PDF documents
  • Content organization and indexing for optimal search
  • Support for multiple PDF files in a single Knowledge Base
  • Efficient processing for large document collections

Perfect for

  • Technical manuals and product documentation
  • Research papers and academic publications
  • Legal documents and contracts
  • Training materials and educational content

Step-by-step creation process

Click PDF card

Click on the Documents (PDF) card to start creating a Knowledge Base from PDF files.
Knowledge Base page showing PDF card option
This opens the “Add Knowledge Base” modal where you can configure your PDF Knowledge Base.

Configure Knowledge Base

In the modal, set up your Knowledge Base details:
Add Knowledge Base modal with PDF configuration
Required information:
  • Name: Enter a name for your Knowledge Base (e.g., “pdf knowledge”).
  • Upload resources: Add your PDF files to the Knowledge Base.
File upload options:
  • Drag and drop PDF files into the upload area.
  • Click “browse files” to select files from your computer.
  • Multiple PDF files can be uploaded simultaneously.
File requirements: Only PDF files are supported. Ensure your PDFs contain readable text for optimal processing.

Upload files

After adding files , click the green Upload files button to upload them to the platform.
Files selected and ready for upload
Upload process:
  • Files are uploaded to cloud storage (AWS S3).
  • Upload progress is displayed with percentage and time remaining.
  • Individual files can be paused or canceled during upload.
  • All files must be uploaded before creating the Knowledge Base.
Upload Status: Wait for all files to complete uploading before proceeding to the next step.

Monitor Upload Progress

Track the upload progress of your PDF files.
Upload progress showing percentage and time remaining
Upload controls:
  • Progress bar: Displays the overall upload progress.
  • File status: Shows the status of individual file uploads with options to pause or cancel.
  • Time remaining: Provides an estimate of the time left to complete the upload.
  • Cancel options: Allows you to pause or cancel individual files or the entire upload.
Upload Interruption: Cancelling uploads will require you to restart the file upload process.

Upload Complete

Once all files are uploaded successfully, you can proceed to create the Knowledge Base.
Upload complete with all files ready
Upload confirmation:
  • All files display green checkmarks, indicating successful upload.
  • “Upload complete” status is shown.
  • Files can still be removed using the ‘X’ icon if needed.
  • Click Add Knowledge Base to create the Knowledge Base.
Ready to create: All files are now uploaded and ready for Knowledge Base creation.

Knowledge Base Created

After clicking Add Knowledge Base, you’re redirected to the Knowledge Base configuration page.
Knowledge base configuration page with training options
Configuration page features:
  • Data resources: Displays your uploaded PDF files.
  • Training status: Shows “Not Trained” status for new files.
  • Add resource: Button to add more PDF files.
  • Start training: Button to initiate training of the Knowledge Base.
  • Delete option: Three dots menu to delete individual resources.
Training Required: Your Knowledge Base must be trained before it can be used by AI Agents.

Start Training

Click the Start Training button to begin processing your PDF files.
Training progress with percentage and status
Training phase:
  • Initiate training: Begin by clicking the Train button to start the AI processing.
  • Data reading: The system reads and imports the stored data resources for processing.
  • Text chunking: The text is divided into smaller, manageable chunks for efficient processing.
  • Embedding generation: These text chunks are converted into vector embeddings using AI models.
  • Vector storage: The generated embeddings are stored in vector databases like Qdrant and Weaviate for efficient retrieval.
  • Index Optimization: The system optimizes the index to enhance search and retrieval performance.
Training in Progress: Wait for training to complete before using the Knowledge Base.

Training Complete

Once training is finished, your Knowledge Base is ready for use.
Training complete with file details and word counts
Training results:
  • Status: “Trained” with a green indicator
  • File details: Displays all processed files with their URLs
  • Word count: Shows the extracted word count from each file
  • Success message: “All resources have been successfully trained”
  • Progress bar: Indicates 100% completion
Knowledge Base Ready: Your PDF Knowledge Base is now trained and ready to be used by AI Agents.

Add More Resources (Optional)

You can add additional PDF files to your existing Knowledge Base.
Add more data resources to existing Knowledge Base
Adding resources:
  • Click + Add Resource to add more PDF files.
  • Follow the same upload process as the initial file upload.
  • New files will display a “Not Trained” status.
  • You can add multiple resources to the same Knowledge Base.
Resource Management: Each resource can be managed individually with its own training status.

Retrain Knowledge Base

After adding new resources, retrain the Knowledge Base to include the new files.
Retraining process with new resources
Retraining process:
  • Start training: The button appears after adding new resources.
  • Training progress: Displays progress for all resources.
  • Status updates: Provides individual resource status updates during training.
  • Completion: All resources display a “Trained” status when complete.
Refresh Page: If the “Start Training” button doesn’t appear, refresh the page to see the updated interface.

Training Complete

Final training completion with all resources successfully processed.
All resources trained successfully
Final status:
  • All resources trained: Green indicators for all files
  • Training complete: 100% progress bar
  • Success message: “All resources have been successfully trained”
  • Ready for use: Knowledge base is fully operational
Knowledge Base Complete: Your PDF Knowledge Base is now fully trained and ready for AI Agent integration.

View in Dashboard

Your Knowledge Base is now available in the main Knowledge Base dashboard.
Knowledge base visible in dashboard with configure option
Dashboard view:
  • Knowledge Base card: Shows your created Knowledge Base
  • Resource count: Displays number of data sources
  • Last updated: Shows creation/update timestamp
  • Training status: “Trained” indicator
  • Configure button: Click to access the Knowledge Base configuration page
Access configuration: Click the “Configure” button to return to the training and management page for your Knowledge Base.
I