Note This project is currently under development. It is not yet ready for production.
Please install first the required packages with the following command:
pip install --upgrade -r requirements.txtThen you should setup a Twitter developer account and create a new app to get your API keys. You can find more information here.
Then you should create a new file named .env in the root directory of the project and add the following lines (based on .env.example):
API_KEY =
API_KEY_SECRET =
BEARER_TOKEN =Oh god! Please don't... Still, make sure you have pytest installed and run the following command:
pytest .\twitter_scraper\You can also use the vscode UI to run the tests.
If you ever want to contribute, please begin by reading our Contributing Guidelines.
The standard procedure is :
fork -> git branch -> push -> pull requestNote that we won't accept any PR :
- that does not follow our Contributing Guidelines
- that is not sufficiently commented or isn't well formated
- without any proper test suite
- with a failing or incomplete test suite
Happy coding ! 🙂
This project is licensed under the CeCILL-C FREE SOFTWARE LICENSE AGREEMENT. For more information, please refer to the official website.
See changelog.md for more information.
gantt
title Main Versions
dateFormat YYYY-MM-DD
section source Code (v0)
v0.1 : 2023-01-16, 1d
v0.2 : 2d
v0.3 : 2d
section stable Versions
v1 : 2023-01-19, 9d
Stable Version 1 (click here to expand)
v1.0 first stable release
collection.abcinstead oftyping(deprecated)- lowered the requirements
- min supported python version is now 3.10.6
v1.1 more queries and less storage
- encoded
tweet.contentintobytesfor storage - added retweet and reply selectors to
SearchQuery
known bugs (final correction patch version) see Issues
tweet.dateis alwaysNonewhen scraping (stored as0)
todo (first implementation version)
- encode
tweet.contentintobytesfor storage - should add
tweet.dateback in when scraping - add large search queries
- a posteriori tweet inspection