Commit Graph

107 Commits

Author SHA1 Message Date
Tanner 38cfc4bda4 Remove extra logging 2020-07-08 02:36:40 +00:00
Tanner 162142083b Fix crash when HN feed fails 2020-07-08 02:36:40 +00:00
Tanner 02c8bbad20 Remove document img and ignore r/technology 2020-07-08 02:36:40 +00:00
Tanner 5ef0fd120b Tune search rankings and attributes 2020-07-08 02:36:40 +00:00
Tanner 5634cc812c Add more logging 2020-07-08 02:36:40 +00:00
Tanner 21c221925d Remove get first image 2020-07-08 02:36:40 +00:00
Tanner 437b1e313b Add requests timeouts and temporary logging 2020-07-08 02:36:40 +00:00
Tanner 64ef3a2a18 Integrate with external MeiliSearch server 2020-07-08 02:36:40 +00:00
Tanner d69e054311 Integrate sqlite database with server 2020-07-08 02:36:40 +00:00
Tanner 5913f894ca Update whoosh migration script 2020-07-08 02:36:40 +00:00
Tanner 894d3654c0 Store ref list in database too 2020-07-08 02:36:40 +00:00
Tanner e97bc4b2c7 Begin initial sqlite conversion 2020-07-08 02:36:40 +00:00
Tanner 8c1ddd4a43 Check if cache is broken 2020-07-08 02:36:40 +00:00
Tanner 490bcd5235 Fall back to ref on manual submission title 2020-07-08 02:36:40 +00:00
Tanner f956656647 Check content-type 2020-07-08 02:36:40 +00:00
Tanner c3c5fa0c0a Remove technology subreddit 2020-07-08 02:36:40 +00:00
Tanner 54f30e20f5 Update tildes parser group tag 2020-07-08 02:36:40 +00:00
Tanner ee3df25c63 Remove keys of uncached stories 2020-01-28 04:20:05 +00:00
Tanner de1bcd9abc Fix tildes deleted comment parser error 2020-01-28 04:19:26 +00:00
Tanner 9cc73da33c Add del tag and sort tags 2020-01-04 23:37:41 +00:00
Tanner 957beea2a7 Stop using archive.is on articles (hits CAPTCHAs) 2019-12-15 22:47:33 +00:00
Tanner 114be7a559 Whitelist more html tags 2019-12-14 07:39:10 +00:00
Tanner ad5da72578 Grab comments on manually submitted links 2019-12-02 23:15:51 +00:00
Tanner 393b676791 Sanitize html 2019-12-01 22:18:41 +00:00
Tanner a9dbfa0a6f Decrease feed cache length to 150 2019-12-01 22:18:14 +00:00
Tanner f11d4ff20c Drop articles more than two days old 2019-11-08 21:50:33 +00:00
Tanner 6ca4a32030 Allow manual submission of articles 2019-11-08 05:55:30 +00:00
Tanner 5482af40e5 Move to gevent production http server 2019-11-08 02:37:57 +00:00
Tanner d6619f188c Handle hostnames better 2019-11-07 22:10:08 +00:00
Tanner dc87026f99 Add subreddit 2019-11-07 22:09:45 +00:00
Tanner 9283f8439c Fix Tildes down for maintenance edge case 2019-10-22 05:01:30 +00:00
Tanner 0742432541 Prefetch first images 2019-10-19 07:33:06 +00:00
Tanner 109ba0eb23 Fix crash from domain and ext check bug 2019-10-16 08:56:31 +00:00
Tanner ca9bed855f Fix copy/paste error, switch to info logging 2019-10-16 05:26:47 +00:00
Tanner 9f60ee7864 Begin README and add license 2019-10-15 16:40:55 -06:00
Tanner 8f8a11954a Archive WSJ articles first, catch KeyboardInterrupt 2019-10-15 21:03:47 +00:00
Tanner c4281ca215 Stop using python keyword id for id 2019-10-15 20:36:20 +00:00
Tanner 6bd3bf1090 Cache all articles in IndexedDB 2019-10-12 23:41:31 +00:00
Tanner f798c06a9b Move archive to Whoosh and add search 2019-10-12 05:32:17 +00:00
Tanner 5f8884a5ca Gitkeep archive directory 2019-10-10 21:55:21 +00:00
Tanner 055439c6db Serve client through apiserver, adding meta info 2019-10-10 21:54:29 +00:00
Tanner 536214be1f Fix Tildes comments with unknown authors 2019-10-08 08:01:17 +00:00
Tanner c396a432d8 Archive Bloomberg articles first 2019-10-08 08:00:50 +00:00
Tanner 37283d09dc Gitkeep apiserver data directory 2019-10-08 07:59:30 +00:00
Tanner 8a01d533da Ignore certain files and domains, remove refs 2019-09-24 08:22:06 +00:00
Tanner 1acdd92cbf Ignore new Tildes posts and handle deleted ones 2019-09-24 08:21:26 +00:00
Tanner 37904a467b Handle Reddit PRAW exceptions 2019-09-24 08:20:46 +00:00
Tanner 1682cb8247 Filter out False comments 2019-08-30 06:23:14 +00:00
Tanner f1c89fcf8b Render reddit markdown, poll tildes better, add utils 2019-08-28 04:13:02 +00:00
Tanner b76cbcd046 Try outline.com for reader mode first 2019-08-25 23:49:08 +00:00