Skip to content

[Feature Request] Any plans to integrate opendataloader-pdf? #7577

@alaap001

Description

@alaap001

Problem Description

https://github.com/opendataloader-project/opendataloader-pdf

This is the new SOTA pdf parser, we should look into adding support for these type of parsers, is anyone in team working on this?
or should I open a pull request for this?

Proposed Solution

We can likely improve accuracy of RAG based pipelines if we use opendataloader-pdf type of repos.

Alternatives Considered

No response

Additional Context

No response

Would you like to work on this?

  • Yes, I’d love to work on it!
  • I’m open to collaborating but need guidance.
  • No, I’m just sharing the idea.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions