summaryrefslogtreecommitdiff
path: root/sfeed.c
AgeCommit message (Collapse)Author
2019-03-03skip spaces in parsetime() itselfHiltjo Posthuma
2019-03-03sfeed: style, break in switch instead of returnHiltjo Posthuma
this style change is useful for my local coverage profile.
2019-02-27atomlinktype make enum TagId instead of intHiltjo Posthuma
2019-02-27improve RSS2 permalink supportHiltjo Posthuma
In RSS2 (but not RSS0.9), a <link> is optional and it can also be specified by <guid isPermaLink="true"> (permalink is "true" by default). When a <link> is also present this will be used instead of the GUID permalink.
2019-02-27sfeed.c: improve commentHiltjo Posthuma
2019-02-24stricter Atom link parsingHiltjo Posthuma
the Atom link parsing is more strict now and checks the rel attribute. When the rel attribute is empty it is handled as a normal link ("alternate"). This makes sure when an link with an other type is specified (such as "enclosure", "related", "self" or "via") before a link it is not used. sfeed does not handle enclosures, but the code is reworked so it is very simple to add this. Enclosures are often used for example to attach some image to a newspost or an audio file to a podcast.
2019-02-24fix RFC822 ANSI and military zones parsingHiltjo Posthuma
2019-02-08don't read XML data inside tag for Atom <link href/>Hiltjo Posthuma
Noticed in the webcomic "amphibian": http://amphibian.com/feeds/atom
2019-02-08trim whitespace around uri field valueHiltjo Posthuma
... and abstract printing timetamp and uri to string_print_{timestamp,uri} similar to string_print_trimmed (normal string) and string_print_encoded (content). Noticed with whitespace around the field in the webcomic "amphibian": http://amphibian.com/feeds/atom
2019-02-08short some callback variable names, change "name" to "t" (tag)Hiltjo Posthuma
2019-01-29sfeed: use the same handler names as the XMLParserHiltjo Posthuma
2018-12-14sfeed: rename buffer to buf, change entitytostr check, it can never happenHiltjo Posthuma
2018-10-08improve code-style for parsing timeHiltjo Posthuma
- reorder and remove a goto. - no need for a separate variable "end". - don't use s[0] style because the pointer was changed.
2018-10-05sfeed: parsetime: weekday part in RFC822 time is optionalHiltjo Posthuma
noticed in "RMS notes" RSS.
2018-09-07fix many undefined behaviour in usage of ctype functionsHiltjo Posthuma
- cast all ctype(3) function argument to (unsigned char) to avoid UB POSIX says: "The c argument is an int, the value of which the application shall ensure is a character representable as an unsigned char or equal to the value of the macro EOF. If the argument has any other value, the behavior is undefined." Many libc cast implicitly the value, but NetBSD does not, which is probably the correct thing to interpret it. - no need to cast for putchar + rename some fputc(..., stdout) to putchar POSIX says: "The fputc() function shall write the byte specified by c (converted to an unsigned char) to the output stream pointed to by stream [...]" Major thanks to Leonardo Taccari <iamleot@gmail.com> for reporting and testing it on NetBSD!
2018-08-22remove stdint.h includeHiltjo Posthuma
the uint* types in XML are not exposed anymore.
2018-03-11include <sys/types.h> for types size_t, ssize_t etcHiltjo Posthuma
This makes sure xml.c in particular can be compiled without further feature macros.
2017-04-27improve gettag()Hiltjo Posthuma
2017-04-27simplify pledge stubHiltjo Posthuma
2016-08-06add USE_PLEDGE, remove pledge dummy functionHiltjo Posthuma
2016-08-06gettzoffset: simplify, default is 0 so remove UTC zones to checkHiltjo Posthuma
2016-04-18simplify sfeed(5) format, remove feedtype alsoHiltjo Posthuma
Remove type of feed per item, it is not that interesting. sfeed(1) can parse both RSS and Atom feeds.
2016-04-10absuri, encodeuri: make encodeuri static, change argument orderHiltjo Posthuma
2016-04-10strtotime: improveHiltjo Posthuma
use long long: atleast 32-bit, but now time_t (real) to 32-bit or 64-bit is supported. Long long is C99 though, but that is fine. check errno, it can have ERANGE.
2016-04-10small style fixHiltjo Posthuma
2016-04-03fix warning: return early, fixes bogus end NULL dereference warningHiltjo Posthuma
2016-04-03just initialize to zero, no need for memsetHiltjo Posthuma
2016-03-29add time parsing to sfeed itself, remove time fieldHiltjo Posthuma
- less overhead (we only need GMT time) so no setenv("TZ", ...) tzset() crap. - timezone format (for example %z in strptime) is non-standard, this will add some lines of code and some complexity to our code though, but the trade-off is worth it imho.
2016-03-27time experimentHiltjo Posthuma
2016-03-21check for tagid == TagUnknown, bit more clearHiltjo Posthuma
2016-03-20code cleanup: put table of parsed tagid at top (near reverse table)Hiltjo Posthuma
2016-03-20fix crash with repeated <link href=""> tags in Atom feedsHiltjo Posthuma
add some detail to the comments
2016-03-19don't use temporary pointer for realloc, it will exit on errorHiltjo Posthuma
2016-03-10remove cast of unused variablesHiltjo Posthuma
2016-03-04remove optimization with no effect, makes the code cleanerHiltjo Posthuma
2016-03-02sfeed: no need to call strlen, we know the length alreadyHiltjo Posthuma
2016-02-27various improvementsHiltjo Posthuma
- pledge tools and add define to enable it on platforms that support it, currently only OpenBSD 5.9+ - separate getline and parseline functionality. - use murmur3 hash instead of jenkins1: faster and less collisions. - make some error messages a bit more clear, for example with path truncation. - some small cleanups, move printutf8pad to util.
2016-02-27check <author><name> tag case-insensitively, just like the restHiltjo Posthuma
2016-02-27consistency: check snprintf result == -1Hiltjo Posthuma
2016-02-27remove some unneeded checksHiltjo Posthuma
2016-01-31Revert "sfeed: realloc, faster near pow 2 bufsiz"Hiltjo Posthuma
This reverts commit 5e43bd658e578ced54f6065e95f6efb4892e114c. It is a neat bit trick, but it doesn't matter much in thiscase and it's less readable and possibly less portable.
2016-01-31sfeed: realloc, faster near pow 2 bufsizHiltjo Posthuma
2016-01-31code-styleHiltjo Posthuma
2015-10-04sfeed: fix ctype functions againHiltjo Posthuma
2015-10-02sfeed: fix escape of backslash, simplifyHiltjo Posthuma
2015-10-02dont used deprecated timegm, use mktime with TZ set to UTCHiltjo Posthuma
this also makes it compile on NetBSD
2015-10-02update README, fix some typos and wordingHiltjo Posthuma
2015-08-23sfeed: use separate enum field for content fieldsHiltjo Posthuma
2015-08-23simplify unknown or less priority tag case, fix gettag bug (wrong sort order)Hiltjo Posthuma
2015-08-23simplify field map code (add list tagid -> field.Hiltjo Posthuma
also: - parse tag media:description for RSS. - be more strict about using the order of fields, this is more consistent now. - remove buffer_init: don't allocate buffers on start. - realloc, be slightly more aggresive with memory allocating: initial buffer size 16 to 64 bytes.