Extract articles from PDF page -- 3

In Progress Posted 1 year ago Paid on delivery
In Progress Paid on delivery

I need to extract articles from any PDF file like the sample attached.

You can find a sample of how the texts and regions are extracted here:

[login to view URL]

Here's a tool that promised to do the same but it's offline:

[login to view URL]

You're supposed to developed an article extraction that generates a JSON or XML file from any newspaper or magazine PDF file. In the image "[login to view URL]" you can see how it should be extracted from.

Technologies accepted: Java, Linux, Kotlin - open source, it can't depend on cloud or any other paid services.

Step1 - Development - You generate a json/xml from a pdf that follows these rules and you win the project.

Step2 - Tests - You send us the JAR (executable) file so we can test with other pdf files

Step3 - Payment - If works, we release you 50% of the payment and you send the sources. If it's everything ok with the source code you'll have the other half released.

Python PDF Artificial Intelligence Image Processing OpenCV

Project ID: #34290794

About the project

5 proposals Remote project Active 1 year ago

5 freelancers are bidding on average $19 for this job

jahoyz

Hi, I've read your description carefully. I have full experience with Python, PDF2XML I've also worked on several similar projects. So I can complete your project with high quality on time. Looking forward to hear more More

$20 USD in 5 days
(10 Reviews)
3.9
stevst

Hi. How are you? As a highly skilled developer, I can help you perfectly. I am very confident with my skills and I'd like to help your business by doing my best. I always believe to make long business relation to clie More

$20 USD in 7 days
(2 Reviews)
2.3
Digitalexpertuae

Hey there, I am a professional writer having experience in Python, Image Processing, PDF, OpenCV and Artificial Intelligence. Do you need an article or any piece of content written or rewritten and you do not want to More

$10 USD in 2 days
(0 Reviews)
0.0
mahikakker16

Hi I am himanshi from india. My skills data entry into words opretor, copy typing, article writing, health education tips etc Degital marketing is Fun it's not hard because I do work very easily and clearly in this pla More

$20 USD in 4 days
(0 Reviews)
0.0