Alexandru Fikl
|
1be02fc29c
|
remove usage of deprecared sre_constants
|
2023-04-30 14:12:34 +03:00 |
|
lilydjwg
|
addc7d0b52
|
regex source: cache by url and post_data
so we can search for different regexes for the same web page.
fixes #221.
|
2023-03-26 17:30:08 +08:00 |
|
lilydjwg
|
f3be1c585f
|
regex: make sure we don't have more than one group in the regex
|
2021-11-06 10:44:10 +08:00 |
|
Yichao Yu
|
c43d4e900f
|
Support HTTP POST request in the htmlparser and regex source
Example:
source = "regex"
regex = "spcm_linux_libs_v[0-9a-zA-Z]*"
url = "https://spectrum-instrumentation.com/spcm_downloads_downloads_ajax"
post_data = "series%5B%5D=273&families%5B%5D=475"
|
2021-09-07 00:40:23 -04:00 |
|
Chih-Hsuan Yen
|
9acae25c0f
|
regex: make sure regex fails if nothing is found
This is a regression since 2019 [1].
Before that commit, max() raises ValuError if re.findall returns an
empty list. After that commit, regex fails silently if nothing is found.
[1] 7897317294 (diff-fd40f1295f9ddece86df8d2c385ddb02bd4e2a10cd20bd9a95306d3d35a9b601)
|
2021-08-31 20:07:06 +08:00 |
|
lilydjwg
|
305c329e02
|
add a httpheader source
see #175 for use cases.
|
2021-01-28 18:05:58 +08:00 |
|
lilydjwg
|
4887ec66d9
|
port more tests
|
2020-08-17 16:21:02 +08:00 |
|
lilydjwg
|
19553c3564
|
create an api for modules
|
2020-08-14 20:04:05 +08:00 |
|
lilydjwg
|
72d1d27f89
|
port more sources, add cache.get_json
|
2020-08-14 18:25:48 +08:00 |
|