Lazzy Heist is a Web Scraper built in Ruby. It scraps data for Movies and TV Shows listed under different categories from the IMDb website.
Data Scraping is one of the way to extract desired data from a website. The information is collected and then exported into a format that is more useful for the users.
You can select to extract data for either Movies or TV Shows:
- Movies
- TV Shows
Further sub-categories are available for both the categories:
- Top-Rated Movies/TV Shows
- Most Popular Movies/TV Shows
Save your desired data in a CSV file
- Ruby
- Ruby Gems
- HTTParty
- Nokogiri
- csv
- RSpec
- Clone the Repository to get a local copy
git clone https://github.com/KumarAmitt/Lazzy-Heist.git
- Open Terminal or CLI
- Navigate into the Repository
cd/path_to_folder
- Install Required Gems
bundle install
- Run the program
ruby ./bin/main.rb
To test the code make insure RSpec in installed
Run the following command to install RSpec
gem install rspec
Run the following command to install all dependencies
bundle install
Finally to test the code Run:
rspec
Note: Run all the commands under this section from the root directory
If all the test pass successfully you will get the following message
Ruby should by installed on your local machine.
Refer the official installation guide to install Ruby.
-
It creates a specific CSV file and writes into it if the file is not available
-
If the file with the same name already available then it will overwrite it with the new data
-
Please check the csv directory at the root level to see the generated .csv file
-
The scraped data will be stored is this file and you cab save it.
👤 Amit Kumar
- GitHub: @githubhandle
- Twitter: @twitterhandle
- LinkedIn: LinkedIn
Contributions, issues, and feature requests are welcome!
Feel free to check the issues page.
Give a ⭐️ if you like this project!
- Hat tip to anyone whose code was used
- Thanks to IMDb for allowing to scrap data
© 2020 Amit Kumar
Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE