Data Processing/Scraping from Standard Format txt Files

In Progress Posted Oct 8, 2013 Paid on delivery
In Progress Paid on delivery

Hi, we are looking to hire someone to manipulate already existing data files (will be given web link) that are in a standard .txt file format with numeric and text entries to a format used for computing.

1) We would like you to start with taking 100 of the entries (randomly selected with random number generator) in one of the 30 files we will give you.

2) We would like you to transform these 100 entries into a matrix in .csv form based on pre-specified categories given by us. Two of the columns are word and word count. Another is entry ID.

3) We also would like a sparse representation of the two columns of word and word count where there is a new matrix (rows are entry #, columns are word label - filled with the count) and that depends on size of file. We can talk about this.

4) The deliverable should be in manageable csv file sizes, which won't be a problem for this data...

But, we will definitely have more work if this is done successfully (over all files and more entries needed), so scalable routines are highly encouraged. Thinking about a million entries with a higher budget, if this goes well.

Thank you very much.

Please note that we will only hire someone who has the ability to do this automatically since we are looking for FUTURE work primarily. This is just a pilot.
Once we go from 100 entries to 1 million, manual typing will not work. We realize that file size will be an issue depending on the matrix, so if things eventually need to be broken apart into let's say 1000 files of 1000 entries, we will then use this with parallel computing routines for our computations. Thank you so much and we look forward to working with you.

Big Data Sales Data Entry Data Mining Data Processing Web Scraping

Project ID: #5006785

About the project

40 proposals Remote project Active Oct 9, 2013

40 freelancers are bidding on average $141 for this job

jaylancer43

Hello - I am an expert techno-functional analyst having vast experience in lots of arenas of IT industry including Excel Macros. I am an Engineering Graduate with an MBA degree. If you see, I am among the niche bid More

$111 USD in 3 days
(414 Reviews)
8.0
Toperfection

Dear "statsphd" Hope you are doing well. I have reviewed the project details and would like to offer our services. We have completed many Research/Data collection/Product add/Data mining assignments on [login to view URL] More

$151 USD in 3 days
(168 Reviews)
7.8
uumairkhalid

Hi.. Expert web scraper/Data Minor here. Interested in your project. I assure you 100% accurate and good quality work. Regards

$105 USD in 3 days
(189 Reviews)
7.1
tjawad17

Hello Sir, We are a professional company specialized in Data Mining and Web Scraping. We have our own server, team and tools for data mining and scraping efficiently and accurately. We can parse your given text More

$155 USD in 4 days
(165 Reviews)
6.9
happy2helpp

Respected sir, We saw project description and got complete idea about project. We are expert in Big Data, Data Entry, Data Mining, Data Processing and Web Scraping!!! We have worked on many similar tasks before and More

$231 USD in 4 days
(84 Reviews)
6.9
diamond247

Hello Sir, We are a big set up company with excellent skilled operator who have a lot of experience in this segment, our employee complete more than 300 similar job, i have gone through your project specification, i More

$144 USD in 3 days
(243 Reviews)
7.1
ashok7925

Hi, I am much interested in this work. Please share me more details with sample text file and describe me what would like to do. I can automate all of the process once I get understood your requirement. Please sha More

$100 USD in 3 days
(33 Reviews)
5.3
elMancha

Hello there. I have high Excel and Visual Basic skills with great professionalism. I study electronics and computer engineering at Oporto university and I'm looking for work to fill the blanks on my schedule. I' More

$60 USD in 3 days
(40 Reviews)
5.0
arvt

Hi I'm interested and I like to know more details about your project to bid accordingly. I have experience doing programs and scripts in some projects here and in other freelancer site. I have Skype, Gtalk, MS More

$35 USD in 3 days
(12 Reviews)
4.9
mohanlg

Hi, I am interested to do these project work. Expert in data conversion work. Please send me more details of work to start. Thanks sunny

$35 USD in 2 days
(25 Reviews)
4.3
RajakScripts

Hi, Please attach the .txt file AND a matrix in .csv form based on your given pre-specified categories for a review, so I can adjust my bid & delivery time precisely. Yes, I aware that you want this to be perform More

$88 USD in 3 days
(7 Reviews)
4.3
gokhanonal

Dear Sir / Madam, I'm a computer engineer (with BS Degree), working freelance in Istanbul, Turkey. I can complete your project as fast & accurate. Please let me know. Looking forward to hearing from you soon, More

$35 USD in 1 day
(13 Reviews)
3.6
signo

Hello, I am experienced in working with large files and back-end processing in general. I will definitely finish this project in the next 24 hours. I still need some clarifications before getting started, regardi More

$133 USD in 1 day
(32 Reviews)
4.2
thanhhungqb

Dear sir, I have read your requirement carefully and interested in it. I am expert on data entry, data scrapping and process data. I usually to do it automatic. For your project, I think I can automatic by a prog More

$126 USD in 3 days
(15 Reviews)
3.6
sunil440

Good day! I would like to submit my application as Data Collector. I shall be pleased to consider me as a qualified applicant.I believe my qualifications would make me an outstanding asset to your organization. I woul More

$100 USD in 3 days
(16 Reviews)
3.4
GurpreetSngh220

Hi, I am very much interested in your project. I would like to discuss with you more regarding the project. You can rely on me because i am serious on my work and not sitting here to waste time (both of us). you More

$188 USD in 5 days
(7 Reviews)
3.2
FernandoCanizo

Hello, I'm interested, I'd to give it a try. Can you provide a sample file so I can send you my attempt? No compromises. Also send me any other information I should need to build a proper processing script, I'm t More

$30 USD in 2 days
(2 Reviews)
3.4
inoussakabore

Hi i have almready do this kind of job. You can see that in my profile. I am ready to start it. I can do that in about one week.

$250 USD in 7 days
(3 Reviews)
3.3
igors233

Greetings, I'm professional software developer with 15+ years of experience in similiar tasks. I will produce a standalone exe (no dependecies) that will take as input given txt file (it could be downloaded automatical More

$147 USD in 10 days
(4 Reviews)
3.5
szymszteinsl

Hi! I am professional C/C++/C#/Java programmer. I can do this project with highest quality, Best Regards, Szymszteinsl

$144 USD in 3 days
(2 Reviews)
3.3