f67fbefb5e
Preserve newlines in toot corpus
...
The original code was already trying to do this, but in a way that
Beautiful Soup ended up stripping out. This way preserves the newlines
properly, which will prevent the bot from smooshing together words
accidentally.
2021-08-21 15:09:59 +10:00
73f2cff1d5
Remove unnecessary +x mode from Python files
2021-08-21 13:37:11 +10:00
Agatha Rose
a904587b32
Clean up formatting and help linter calm down
2021-06-05 00:38:36 +03:00
Agatha Rose
dd78364f2d
Expose overlap ratio and length limit to config
2021-06-05 00:14:56 +03:00
Agatha Rose
27f61c4374
Make bs4 only replace the tag name instead of name and contents
2021-02-18 18:01:43 +02:00
Lynne
8274409bf4
update extract code to match fedibooks
2020-05-27 22:31:16 +10:00
Lynnesbian
7d718bbe3a
minor code cleanup
2020-03-08 19:46:07 +10:00
Lynne
1fbaf17e3e
allow users to specify custom config.json location
2019-08-07 13:46:57 +10:00
Lynne
32fa2302aa
handle empty database
2019-07-10 21:25:07 +10:00
Lynne
068ea7242f
fixed typo
2019-07-04 18:08:00 +10:00
Lynne
cd659d86a1
removed unused media code
2019-07-02 20:43:34 +10:00
Lynne
354ea46dcb
code cleanup, fixes #23
2019-05-19 23:06:31 +10:00
Lynne
1d8f7b26f5
fixed a very silly mistake
2019-04-29 14:38:44 +10:00
Lynne
02343668b8
add an option to avoid creating fake mentions
2019-04-29 14:21:46 +10:00
Lukas Fülling
c0cf42a41a
add config option to filter for language
2019-02-25 19:44:58 +01:00
Lynne
2cecfd42a5
added support for learning from CW'd posts
2019-02-25 11:17:06 +10:00
Lynne
213c40351d
only use 10k toots rather than all of them to avoid having Too Many Toots
2019-01-14 08:38:54 +10:00
Lynne
3e3f905839
removed unused import of random
2019-01-11 22:57:18 +10:00
Lynne
c2997ae0d1
fixed a silly mistake, fixed " and ' stuff
2019-01-11 22:56:35 +10:00
Lynne
de3449ae56
added extract_toot function to functions.py
2019-01-11 22:55:31 +10:00
Lynne
85fec32c83
renamed create.py to functions.py
2019-01-11 22:47:42 +10:00