I'm working on a personal database project and need to be able to retrieve the IMDb ID, the unique seven-digit number associated with a single film title, for several thousand titles. Ideally, I would be able to pass a CSV or other text file containing many movie title and year pairs, and have a CSV with corresponding IMDb IDs returned. The IMDb ID for a film can be found in its URL (for example, [login to view URL]). I would need to perform this task with a new set of titles periodically, so the outcome of this project would be some sort of script that could be used and re-used when needed, though titles whose ID has been retrieved once would not need to be re-done, since IMDb IDs are stable.
i have written script for linkedin instagram... bots... i will give you python script... and you will run it whenever you want... believe me i am the perfect guy for this.. thanks
I can do this using SSIS by pulling the data from your CSV file and convert it to SQL/Database and write a query get all the information need (names,titles, ids) and produce an output either csv,text files, or script which output will be reusable.
What you are requesting can be implemented using C#, which allows for easy scalability if new features are required. It also has the capability to be run in multipleenvironments such as Windows, Mac and Linux
I'm an Informatics Engineer. C# (.NET), C++, Java Tutor at College and Unity3D (C#) Game Developer
I have worked for many software development firms under interdisciplinary environments
I did many similar projects before, I am sure I can help you with this one!
I can think of many ways to do this, we can discuss which one you prefer in chat!
Two possible ways - retrieve IMDb ID from URL:
1. I can use preg_match_all()
Just Obtain the HTML source
Parse all <a> href attributes
Test with a regular expression if their value matches.
If it matches, extract the id from the link and store it in a way that you don't get any duplicates.
Done.
2. A simple function of "View selection source" in Firefox lets me have a look that each link has href property in format:
href="/title/tt0075148/"
By the way, have you any planning to retrieve the ID from Url by using any API? The concept will be quite similar with Facebook Graph API. If you wish this planning, I need to develop the custom API.
So using any of the plan, I will successfully retrieve IMDb Id from URL. You will upload CSV files or text file contained with URL. And the IMDb Id be retrieved from there.
But I have a query- After retrieve the IDs , where you wish to store them? Will you wish to store into database or wish to use the data in some activity or functional purpose?- Please share the details.
There are lot's of other queries there- Please ping me on your convenience.