Usage
To use this project, you will primarily interact with its __main__ file from your terminal.
Running the Extraction Process
Ensure your Ollama server is running and the required models (phi4:14b and gemma3:27b) are pulled.
Activate your Poetry shell (if you haven’t already):
poetry shell
Execute the main script:
poetry run python main.py
Provide the PDF directory path: The script will prompt you to enter the path to the directory containing the PDF documents.
Please enter the path to the directory containing the PDF documents:
You should enter the full path to your PDF folder, for example:
On Windows:
C:\path\to\your\pdf_documents
On Linux/macOS:
/home/user/path/to/your/pdf_documents
Warning
Ensure the input folder contains the call for proposal PDFs. PDF filenames must include the word ‘call’ (e.g., AMIF-2024-TF2-AG-INFO-01_separator_call-…).