Commit Graph

218 Commits

Author SHA1 Message Date
jeancf
d336711514 Struggling with config 2022-11-17 23:23:11 +01:00
jeancf
dc8c89243c Updated user agents 2022-11-17 20:56:21 +01:00
jeancf
a2c9deb250 Removed duplicate tracker tags and added 'xtor' 2022-11-17 20:53:03 +01:00
jeancf
6a20c257e5 Merged contribution from
mathdatech
2022-11-17 20:18:42 +01:00
jeancf
b04b7dc195 Removed temp debug 2022-11-14 12:40:56 +01:00
jeancf
f96d8fa93c Added missing logging 2022-11-14 12:36:06 +01:00
jeancf
514a1b3304 Added some temp debug code 2022-11-14 12:26:55 +01:00
jeancf
608bc7519f Corrected condition on retweet tag 2022-11-13 22:35:46 +01:00
jeancf
84b94a38b9 Implemented retweet suppression 2022-11-13 22:17:43 +01:00
jeancf
506c4a05b7 Merge branch timeout into vid_dl 2022-11-06 12:05:23 +01:00
BuildTools
bd7860bb43 Keep log file history 2022-11-06 11:56:29 +01:00
jeancf
11b88e729a Added timeout to all downloads 2022-11-06 11:50:08 +01:00
jeancf
e8c03ab50b youtube-dl set in quiet mode 2022-11-06 11:24:57 +01:00
jeancf
4d1fec306f using youtube-dl as a class 2022-11-03 22:10:23 +01:00
jeancf
10a329fdb1 Replaced twitterdl.py by youtube-dl 2022-11-03 16:53:17 +01:00
jeancf
9c2438382e Added timeout to get request 2022-11-02 18:38:23 +01:00
jeancf
ebf32cebc9 Initialized variable referenced later 2022-10-08 10:25:04 +02:00
BuildTools
216da5519f Removed n.actionsack.com 2022-09-24 13:26:08 +02:00
jeancf
cfd1232f35 Merge remote-tracking branch 'gitlab/master' into cleandb
# Conflicts:
#	twoot.py
2022-09-15 20:35:27 +02:00
jeancf
3273b21608 Fixed bug in query 2022-09-15 20:12:20 +02:00
jeancf
dada20d0b9 Added database cleanup (untested) 2022-09-15 19:58:17 +02:00
jeancf
7f462a5a6e Minor improvement to logging 2022-09-14 16:54:47 +02:00
jeancf
5e0fb1a9c3 Corrected typo 2022-09-14 16:35:10 +02:00
jeancf
bfbe9704f7 Cosmetic changes 2022-09-14 16:28:48 +02:00
jeancf
4ccce6aac1 asctime() instead 2022-09-08 10:19:23 +02:00
jeancf
392b0bafd0 more str conversion 2022-09-08 10:17:14 +02:00
jeancf
357e45844d convert int to str 2022-09-08 10:15:14 +02:00
jeancf
2b21a626d4 Less stupid 2022-09-08 10:11:37 +02:00
jeancf
ffdce1ad12 updated url 2022-09-08 10:05:19 +02:00
jeancf
63a7a578a4 epoch to local time 2022-09-08 09:37:30 +02:00
jeancf
a7b63f569f Changed logging to info 2022-09-08 09:35:02 +02:00
jeancf
4704890ddf check rate limit 2022-09-08 09:28:28 +02:00
jeancf
7ffa81ffbd No longer try creating unique index 2022-08-22 14:50:03 +02:00
jeancf
65b880f5be Bug removed 2022-08-22 14:27:18 +02:00
jeancf
29cf330699 Improved error message and removed nitter mirror 2022-08-22 14:09:43 +02:00
jeancf
fe145525ab Added index on sqlite database 2022-08-22 14:00:28 +02:00
jeancf
98ed69e232 Correct mirror URL 2022-08-22 13:34:56 +02:00
jeancf
94d1fc4e22 Fixed the fix of the fix 2022-08-22 09:33:27 +02:00
jeancf
82a9430160 Fixed the fix 2022-08-22 09:30:52 +02:00
jeancf
3c847e4f06 Fixed false positive on search for "replying-to" 2022-08-22 08:54:17 +02:00
jeancf
c4abee2835 Updated Nitter URLs 2022-08-19 11:15:49 +02:00
jeancf
e6854106eb Updated user agents 2022-08-19 10:48:33 +02:00
jeancf
00f374896d Fliexibility in timestamp 2022-01-03 18:11:40 +01:00
jeancf
65d91bf025 Clarified info and updated nitter sites 2022-01-03 18:03:56 +01:00
BuildTools
2a63371336 Adjusted nitter sites 2022-01-03 17:44:37 +01:00
BuildTools
735503c1b1 Merge branch 'master' of https://gitlab.com/jeancf/twoot
Merging master
2021-10-16 19:29:28 +02:00
BuildTools
204f1e5c9f Updated nitter site list 2021-10-16 19:27:49 +02:00
jeancf
a463ce335b Catching connection exception to nitter site 2021-10-16 19:26:02 +02:00
jeancf
200837c336 Improved logging message of cap limit 2021-06-03 09:35:34 +02:00
jeancf
0637c8ccda Corrected basicConfig parzmeter 2021-06-01 16:12:05 +02:00
jeancf
c688035fd0 Implemented timestamps in logs 2021-06-01 15:49:11 +02:00
BuildTools
29629e2785 Logging improvementµ 2021-06-01 14:57:43 +02:00
jeancf
71acd65ba0 Implemented cap 2021-06-01 11:54:08 +02:00
jeancf
3148180e9a Some cleanup
Rebased
2021-06-01 11:27:22 +02:00
BuildTools
3963b102b9 Modified active nitter hosts 2021-06-01 11:05:33 +02:00
jeancf
588e6003ca Set logging to WARNING 2021-03-07 21:29:20 +01:00
jeancf
56b87e4756 Merge branch 'master' of https://gitlab.com/jeancf/twoot 2021-03-07 21:26:58 +01:00
jeancf
cf856bee08 Login only when there is something to upload 2021-03-07 21:26:52 +01:00
BuildTools
b9842db677 Added 300s timeout to twitter video download 2021-03-05 17:13:59 +01:00
jeancf
807dad3480 Random selection of nitter mirror to use 2021-03-02 22:08:52 +01:00
jeancf
8e4f13c26a placed nitter url in const 2021-02-11 19:03:12 +01:00
jeancf
a9109884a4 More debug messages 2020-12-19 10:59:23 +01:00
jeancf
1d40071b27 Added log of twitter:image download 2020-12-19 10:53:11 +01:00
jeancf
40185ef817 Improved last logging syntax 2020-12-19 10:48:46 +01:00
jeancf
5df11dbe4b Fixed last logging syntax 2020-12-19 10:36:59 +01:00
jeancf
3c7693fe66 Updated README
Improved decimal format in log
2020-12-19 10:30:19 +01:00
jeancf
dc6c16ae16 Keep logs for now 2020-12-19 10:09:03 +01:00
jeancf
43d63b1e5a Added logging run time 2020-12-19 09:21:39 +01:00
jeancf
bb52e54c0d Logging set to debug 2020-12-18 22:43:50 +01:00
jeancf
066f737a61 quote is an 'a' tag 2020-12-18 22:41:57 +01:00
jeancf
60f7054fac Separate logging for exceptions 2020-12-18 22:16:27 +01:00
jeancf
1525955c52 Added info log messages 2020-12-18 22:09:34 +01:00
jeancf
33342cdfb7 Cards can have no pic 2020-12-18 21:32:26 +01:00
jeancf
986d902ccd Fixed video download url 2020-12-18 21:06:05 +01:00
jeancf
62ba2f505e Issues with video download 2020-12-18 17:55:12 +01:00
jeancf
a0ce29f4c5 Fine tuning 2020-12-18 17:35:50 +01:00
jeancf
67bf87213d Correct url in image downloads 2020-12-18 17:21:41 +01:00
jeancf
822215fefe download more images. Improved logging 2020-12-18 17:06:09 +01:00
jeancf
3a88438ec2 Some easy bugs squashed 2020-12-18 14:57:22 +01:00
jeancf
f229976861 Improved logging. "OMG, it's full of bugs!" 2020-12-18 14:39:13 +01:00
jeancf
551c47d576 Implemented process attachment 2020-12-18 14:28:17 +01:00
jeancf
efa84f85d3 Download nitter video 2020-12-18 13:26:26 +01:00
jeancf
b4a596eff2 Downloaded pics attachments 2020-12-18 11:45:43 +01:00
jeancf
14c24fe847 started process_attachments() 2020-12-17 22:59:21 +01:00
jeancf
8079914282 Reworked process_media_body 2020-12-17 22:08:43 +01:00
jeancf
711ec9677a Added a bunch of TODO 2020-12-17 21:44:32 +01:00
jeancf
992f91537f TODO done 2020-12-17 18:59:02 +01:00
jeancf
fbec4004f9 Handled reply-to 2020-12-17 17:56:12 +01:00
jeancf
557ef6deb9 Handling reply-to 2020-12-17 17:50:10 +01:00
jeancf
0787669a3a Moved time check to beginning of process 2020-12-17 17:31:43 +01:00
jeancf
d92bcea2a7 Added cookie to preserve twitter and youtube addresses 2020-12-17 10:44:30 +01:00
jeancf
3a2c8093a3 Improved logging in cleanup_tweet_text 2020-12-17 10:15:46 +01:00
jeancf
857a7f9b9e Extracted full_status_url 2020-12-16 22:46:01 +01:00
jeancf
e6e24cbfd5 Extracted author, author_account, time_string, timestamp 2020-12-16 22:15:27 +01:00
jeancf
19d988dfcb Removed extracting avatar 2020-12-16 22:03:09 +01:00
jeancf
4e6a97d765 Removed downloading of status page with uncensored pics 2020-12-16 21:58:24 +01:00
jeancf
e87599d40b Removed downloading of full status page of the tweet 2020-12-16 21:57:03 +01:00
jeancf
7cc076053f Extracted tweet_id and status_id 2020-12-16 21:55:13 +01:00
jeancf
c25e36b498 Extracted timeline 2020-12-16 20:55:26 +01:00
jeancf
910b7a8b13 Safer implementation 2020-12-16 20:48:00 +01:00