I am looking for an experienced web scraper to handle web scraping projects from three specific websites: 1. TheNumbers.com 2. IMDb.com 3. Boxofficemojo.com The project will potentially include up to approximately 60 variable categories and approximately 85,000 observations. Details: The web scraping will be conducted on a public website that houses data on motion pictures. Variables that need to be retrieved include, but are not limited to the following: -Movie Title -Movie Release Date -Movie Release Country -Movie Box Office -Movie Rating -Movie Genre -Movie language -Movie Director -Movie Producer -Movie Writer -Movie Release Date (Theaters) -Release Date (Streaming) -Movie Box Office (all available data, including Gross USA Box Office) -Movie Runtime (length) -Movie Distributor - Movie Studio -Movie CAST & CREW (I need the first four listed Actors on the film and I need additional data on these Actors from individual pages linked for each of the actors) -Total Number of User or Audience Reviews -Average score of the User or Audience Reviews -Total Number of Critic Reviews -Average score for critic review -Metascore - Number of critic reviews for Metascore -Award Nominations (categories of nomination, individuals nominated, type of award, etc.) -Awards Wins (category of win, individual(s) who won, type of award, etc.) Individual Data Needed for Each Actor: - Actor Name - Actor Birthday - Actor Birthplace - Actor Gender (male or female) - Actor Nationality - Actor Race or Ethnicity I need all of these observations to be placed into a single file with these listed variables and headers. My goal is to be able to take this single file and use Stata to run regressions and similar processes on the observations. I need this Motion Picture data for all films with a Movie Release Date from 2000 through 2024. For each of these films, I need the data for the four first-billed (credited) actors associated with the film. Note: Please see the following page for example of data required: https://www.imdb.com/title/tt1560747/?ref_=fn_al_tt_1. From this page, there are links to actors data and review/awards data that will also need to be captured. Also, please view the attached file for a list of data requested. If there is applicable data on these three sites that should also be included, I would also want that. I will be able to provide further details and explanations to the right candidate. I expect this to be a short-term project that will not require longer than a few hours over a week or so. I want to begin this project immediately and have it completed within the next two days. Compensation will be provided accordingly. Thank you.
Price: $30.0
Seeking a freelancer to manage content, scheduling, and analytics across LinkedIn, X, and Google My Business. Must have B2B social experience and be fluent in GA4, HubSpot, and native platform tools. QFlow Systems is a B2G-focused SaaS company that delivers enterprise d...
View JobWe are seeking a creative and detail-oriented individual to design an informative 'how to' flyer/document for our new hires. This document should provide clear instructions on how to open a checking account with our organization. The ideal candidate will have experience...
View JobWe are an AI Agency looking to bring on board a Lead AI Developer for our start-up. We are seeking a skilled developer to create both complex and simpler automated workflows. The ideal candidate will have experience in designing and implementing efficient automation sol...
View Job