Commit Graph

196 Commits

Author SHA1 Message Date
jeancf
5e0fb1a9c3 Corrected typo 2022-09-14 16:35:10 +02:00
jeancf
bfbe9704f7 Cosmetic changes 2022-09-14 16:28:48 +02:00
jeancf
4ccce6aac1 asctime() instead 2022-09-08 10:19:23 +02:00
jeancf
392b0bafd0 more str conversion 2022-09-08 10:17:14 +02:00
jeancf
357e45844d convert int to str 2022-09-08 10:15:14 +02:00
jeancf
2b21a626d4 Less stupid 2022-09-08 10:11:37 +02:00
jeancf
ffdce1ad12 updated url 2022-09-08 10:05:19 +02:00
jeancf
63a7a578a4 epoch to local time 2022-09-08 09:37:30 +02:00
jeancf
a7b63f569f Changed logging to info 2022-09-08 09:35:02 +02:00
jeancf
4704890ddf check rate limit 2022-09-08 09:28:28 +02:00
jeancf
7ffa81ffbd No longer try creating unique index 2022-08-22 14:50:03 +02:00
jeancf
65b880f5be Bug removed 2022-08-22 14:27:18 +02:00
jeancf
29cf330699 Improved error message and removed nitter mirror 2022-08-22 14:09:43 +02:00
jeancf
fe145525ab Added index on sqlite database 2022-08-22 14:00:28 +02:00
jeancf
98ed69e232 Correct mirror URL 2022-08-22 13:34:56 +02:00
jeancf
94d1fc4e22 Fixed the fix of the fix 2022-08-22 09:33:27 +02:00
jeancf
82a9430160 Fixed the fix 2022-08-22 09:30:52 +02:00
jeancf
3c847e4f06 Fixed false positive on search for "replying-to" 2022-08-22 08:54:17 +02:00
jeancf
c4abee2835 Updated Nitter URLs 2022-08-19 11:15:49 +02:00
jeancf
e6854106eb Updated user agents 2022-08-19 10:48:33 +02:00
jeancf
00f374896d Fliexibility in timestamp 2022-01-03 18:11:40 +01:00
jeancf
65d91bf025 Clarified info and updated nitter sites 2022-01-03 18:03:56 +01:00
BuildTools
2a63371336 Adjusted nitter sites 2022-01-03 17:44:37 +01:00
BuildTools
735503c1b1 Merge branch 'master' of https://gitlab.com/jeancf/twoot
Merging master
2021-10-16 19:29:28 +02:00
BuildTools
204f1e5c9f Updated nitter site list 2021-10-16 19:27:49 +02:00
jeancf
a463ce335b Catching connection exception to nitter site 2021-10-16 19:26:02 +02:00
jeancf
200837c336 Improved logging message of cap limit 2021-06-03 09:35:34 +02:00
jeancf
0637c8ccda Corrected basicConfig parzmeter 2021-06-01 16:12:05 +02:00
jeancf
c688035fd0 Implemented timestamps in logs 2021-06-01 15:49:11 +02:00
BuildTools
29629e2785 Logging improvementµ 2021-06-01 14:57:43 +02:00
jeancf
71acd65ba0 Implemented cap 2021-06-01 11:54:08 +02:00
jeancf
3148180e9a Some cleanup
Rebased
2021-06-01 11:27:22 +02:00
BuildTools
3963b102b9 Modified active nitter hosts 2021-06-01 11:05:33 +02:00
jeancf
588e6003ca Set logging to WARNING 2021-03-07 21:29:20 +01:00
jeancf
56b87e4756 Merge branch 'master' of https://gitlab.com/jeancf/twoot 2021-03-07 21:26:58 +01:00
jeancf
cf856bee08 Login only when there is something to upload 2021-03-07 21:26:52 +01:00
BuildTools
b9842db677 Added 300s timeout to twitter video download 2021-03-05 17:13:59 +01:00
jeancf
807dad3480 Random selection of nitter mirror to use 2021-03-02 22:08:52 +01:00
jeancf
8e4f13c26a placed nitter url in const 2021-02-11 19:03:12 +01:00
jeancf
a9109884a4 More debug messages 2020-12-19 10:59:23 +01:00
jeancf
1d40071b27 Added log of twitter:image download 2020-12-19 10:53:11 +01:00
jeancf
40185ef817 Improved last logging syntax 2020-12-19 10:48:46 +01:00
jeancf
5df11dbe4b Fixed last logging syntax 2020-12-19 10:36:59 +01:00
jeancf
3c7693fe66 Updated README
Improved decimal format in log
2020-12-19 10:30:19 +01:00
jeancf
dc6c16ae16 Keep logs for now 2020-12-19 10:09:03 +01:00
jeancf
43d63b1e5a Added logging run time 2020-12-19 09:21:39 +01:00
jeancf
bb52e54c0d Logging set to debug 2020-12-18 22:43:50 +01:00
jeancf
066f737a61 quote is an 'a' tag 2020-12-18 22:41:57 +01:00
jeancf
60f7054fac Separate logging for exceptions 2020-12-18 22:16:27 +01:00
jeancf
1525955c52 Added info log messages 2020-12-18 22:09:34 +01:00
jeancf
33342cdfb7 Cards can have no pic 2020-12-18 21:32:26 +01:00
jeancf
986d902ccd Fixed video download url 2020-12-18 21:06:05 +01:00
jeancf
62ba2f505e Issues with video download 2020-12-18 17:55:12 +01:00
jeancf
a0ce29f4c5 Fine tuning 2020-12-18 17:35:50 +01:00
jeancf
67bf87213d Correct url in image downloads 2020-12-18 17:21:41 +01:00
jeancf
822215fefe download more images. Improved logging 2020-12-18 17:06:09 +01:00
jeancf
3a88438ec2 Some easy bugs squashed 2020-12-18 14:57:22 +01:00
jeancf
f229976861 Improved logging. "OMG, it's full of bugs!" 2020-12-18 14:39:13 +01:00
jeancf
551c47d576 Implemented process attachment 2020-12-18 14:28:17 +01:00
jeancf
efa84f85d3 Download nitter video 2020-12-18 13:26:26 +01:00
jeancf
b4a596eff2 Downloaded pics attachments 2020-12-18 11:45:43 +01:00
jeancf
14c24fe847 started process_attachments() 2020-12-17 22:59:21 +01:00
jeancf
8079914282 Reworked process_media_body 2020-12-17 22:08:43 +01:00
jeancf
711ec9677a Added a bunch of TODO 2020-12-17 21:44:32 +01:00
jeancf
992f91537f TODO done 2020-12-17 18:59:02 +01:00
jeancf
fbec4004f9 Handled reply-to 2020-12-17 17:56:12 +01:00
jeancf
557ef6deb9 Handling reply-to 2020-12-17 17:50:10 +01:00
jeancf
0787669a3a Moved time check to beginning of process 2020-12-17 17:31:43 +01:00
jeancf
d92bcea2a7 Added cookie to preserve twitter and youtube addresses 2020-12-17 10:44:30 +01:00
jeancf
3a2c8093a3 Improved logging in cleanup_tweet_text 2020-12-17 10:15:46 +01:00
jeancf
857a7f9b9e Extracted full_status_url 2020-12-16 22:46:01 +01:00
jeancf
e6e24cbfd5 Extracted author, author_account, time_string, timestamp 2020-12-16 22:15:27 +01:00
jeancf
19d988dfcb Removed extracting avatar 2020-12-16 22:03:09 +01:00
jeancf
4e6a97d765 Removed downloading of status page with uncensored pics 2020-12-16 21:58:24 +01:00
jeancf
e87599d40b Removed downloading of full status page of the tweet 2020-12-16 21:57:03 +01:00
jeancf
7cc076053f Extracted tweet_id and status_id 2020-12-16 21:55:13 +01:00
jeancf
c25e36b498 Extracted timeline 2020-12-16 20:55:26 +01:00
jeancf
910b7a8b13 Safer implementation 2020-12-16 20:48:00 +01:00
jeancf
e2841535f6 Extracted twit_account 2020-12-16 20:42:44 +01:00
jeancf
894c13d551 Download page from nitter.net 2020-12-16 19:43:17 +01:00
jeancf
9fc76b9981 Updated user agents 2020-12-16 18:47:27 +01:00
BuildTools
c4bf95c1a7 Commented out printing of extracted tweets 2020-12-13 21:04:33 +01:00
jeancf
010f5fdeec Merge remote-tracking branch 'gitlab/logging' into logging 2020-12-13 18:30:57 +01:00
jeancf
b7175067c0 Added timeout to execution of twitterdl.py 2020-12-13 18:25:27 +01:00
jeancf
267d4cb551 TODO is done 2020-12-13 10:44:07 +01:00
jeancf
4f326ee3cd Added more debug messages 2020-11-09 15:55:42 +01:00
jeancf
1781eb5653 Basic logging setup 2020-10-14 21:51:00 +02:00
jeancf
67fdbba510 Stop trying to regex a string into linked picture file 2020-09-10 13:09:51 +02:00
JC Francois
a95006fae6 Added tolerance for malformed URL in picture download 2020-04-09 18:17:13 +02:00
JC Francois
092f2ab371 Cleanup and README.md update for release 2020-04-05 10:37:54 +02:00
JC Francois
e32620d79b Implemented proper naming of downloaded videos 2020-03-29 17:16:54 +02:00
JC Francois
965317f5b2 Added details on optional dependencies to README.md 2020-03-29 13:57:18 +02:00
JC Francois
6fa2019618 Calling twitterdl.py as subprocess 2020-03-29 13:41:49 +02:00
jeancf
2090d214b6 Trying to stop debug messages 2020-03-28 14:11:06 +01:00
jeancf
9c56ad57c8 Added TODOs to improve management of locations of video download 2020-03-28 14:07:00 +01:00
jeancf
df4eaa0dd7 Set debug=0 on call to download to avoid mail spam 2020-03-28 13:55:43 +01:00
jeancf
ba3da6ab7c Handled exception of video download directory absent when trying to delete it 2020-03-28 11:21:28 +01:00
jeancf
dd1d54d2a4 Check if tweet in db before ingest to speed up processing of feed 2020-03-28 11:08:09 +01:00
jeancf
2fe06c0bbc Use correct capitalization of twitter account name for deleting video directory 2020-03-27 17:45:40 +01:00
jeancf
0231f224a3 Improved naming of downloaded videos and implemented cleanup 2020-03-27 17:26:04 +01:00