Age | Commit message (Collapse) | Author | |
---|---|---|---|
2018-12-02 | XML tag parse improvements for PI and end tags | Hiltjo Posthuma | |
- Stricter parsing of tags, no whitespace stripping after <. - For end tags the "internal" context x->tag would be "/sometag". Make sure this matches exactly with the parameter tag. - Reset tagname after parsing an end tag. - Make end tag handling more consistent. - Remove temporary variable taglen. | |||
2018-08-26 | xml: use ANSI types and struct initialization | Hiltjo Posthuma | |
long is atleast 32-bits, codepointtoutf8() works with >= 32-bit types. Valid codepoint ranges are not larger than this. unsigned char is not needed because converted unicode bytes don't use this range. tested all valid codepoints and output on amd64, i386 and SPARC64. | |||
2018-08-23 | xml: remove TODO comments and add a note | Hiltjo Posthuma | |
2018-08-22 | xml: improve parsing of invalid attribute values separated by whitespace | Hiltjo Posthuma | |
It is invalid XML, but this allows parsing old HTML pages aswell. For example: <input id=cb checked type="checkbox" title='checkbox' /> or <FONT FACE=wingdings SIZE=12><BLINK>oh hai</BLINK></FONT> | |||
2018-08-22 | xml: improve handling of invalid long data entities | Hiltjo Posthuma | |
this also fixes an issue with truncating and missing data on invalid input. | |||
2018-08-21 | xml: rewrite codepointtoutf8 function | Hiltjo Posthuma | |
No more converting to a uint32_t type. Just convert to a byte buffer. Tested on little- and big-endian. The code should be more clear too hopefully. | |||
2018-08-21 | xml: don't reset internal tagname when parsing non-tag types like CDATA | Hiltjo Posthuma | |
... this affects "tags" starting with < such as CDATA and processing instructions. | |||
2018-08-21 | xml: fix missing first byte when parsing a long incorrect attribute entity | Hiltjo Posthuma | |
... the entity had to be invalid (start with &) and longer than the buffer size. + tiny style fix. | |||
2018-08-21 | xml: interface change: make some functions private | Hiltjo Posthuma | |
... this does not expose the uint* types either. | |||
2018-08-21 | xml: increase allowed size of attribute names | Hiltjo Posthuma | |
2018-08-16 | XML parser: numeric entity: check unicode codepoint range | Hiltjo Posthuma | |
2018-03-11 | include <sys/types.h> for types size_t, ssize_t etc | Hiltjo Posthuma | |
This makes sure xml.c in particular can be compiled without further feature macros. | |||
2018-03-11 | xml: improve comment parsing | Hiltjo Posthuma | |
note that ---> is officially invalid XML, but we allow it anyway. | |||
2018-03-11 | xml: fix parsing of cdata when a handler is unset | Hiltjo Posthuma | |
2018-03-11 | xml: improve CDATA parsing | Hiltjo Posthuma | |
thanks Svyatoslav Mishyn for the feedback! | |||
2017-12-24 | xml: make name entities static, minor clarifications | Hiltjo Posthuma | |
2016-04-10 | xml: stricter check of entity: must end with ';', ... | Hiltjo Posthuma | |
... zero output buffer if codepoint length is 0 | |||
2015-08-22 | xml: fix includes | Hiltjo Posthuma | |
2015-08-22 | xml: simplify XML reader | Hiltjo Posthuma | |
2015-08-16 | xml: change xml_parse_string to xml_parse_buf | Hiltjo Posthuma | |
In the parser itself allow reading '\0' in the XML itself. Add a length parameter to specify the buffer size. | |||
2015-08-14 | minor code-style improvements | Hiltjo Posthuma | |
2015-08-14 | xml: whoops, remove leftover xml_getnext_stdin | Hiltjo Posthuma | |
2015-08-14 | xml: separate reader context from parser | Hiltjo Posthuma | |
also: - rename xmlparser_ prefix to xml_. - make xml_parse public, this allows a custom reader like a direct mmap, see: XMLParser.getnext and (optionall) XMLParser.getnext_data. - improve the README text. | |||
2015-08-08 | xml: move entity to namedentitystr() | Hiltjo Posthuma | |
2015-08-06 | xml: remove forced __inline__ attribute | Hiltjo Posthuma | |
2015-08-06 | general cleanups | Hiltjo Posthuma | |
2015-08-01 | xml: only allow full uppercase or full lowercase for entities | Hiltjo Posthuma | |
2015-07-31 | xml: fix xml_namedentitytostr loop | Hiltjo Posthuma | |
2015-07-31 | xml: fix missing include strings.h, for strncasecmp | Hiltjo Posthuma | |
2015-07-29 | improve includes (dont include headers in .h), fix build on Linux | Hiltjo Posthuma | |
2015-07-28 | improve code-style and consistency | Hiltjo Posthuma | |
2015-06-23 | xml: fix comment issue, improve cdata and comment while encountering separator | Hiltjo Posthuma | |
2015-06-22 | xml: fix cdata issue | Hiltjo Posthuma | |
2015-06-21 | separate xml specific code into xml.c | Hiltjo Posthuma | |
2015-06-21 | xml.c: fix empty cdata callback | Hiltjo Posthuma | |
2015-05-16 | xml: only call data handler if set | Hiltjo Posthuma | |
2015-05-16 | xml: call parse | Hiltjo Posthuma | |
2015-05-16 | xml: attrentity handler will be called if set | Hiltjo Posthuma | |
it used to be if attrentity is NULL it would call attrdata. | |||
2015-05-16 | xml: allow to read from fd or string buffer | Hiltjo Posthuma | |
+ minor code style. | |||
2014-11-17 | code-style, ugly test-code (remove later) | Hiltjo Posthuma | |
2014-11-11 | comment style | Hiltjo Posthuma | |
2014-11-11 | fix typo in man page | Hiltjo Posthuma | |
2014-06-28 | xml: fix attribute without data: | Hiltjo Posthuma | |
<input checked /> Signed-off-by: Hiltjo Posthuma <hiltjo@codemadness.org> | |||
2014-06-27 | small fixes | Hiltjo Posthuma | |
reorder static -> public xml functions. Signed-off-by: Hiltjo Posthuma <hiltjo@codemadness.org> | |||
2014-05-08 | style: linewrap, etc | Hiltjo Posthuma | |
Signed-off-by: Hiltjo Posthuma <hiltjo@codemadness.org> | |||
2014-04-02 | xml: fix cdata parsing, disable markup declaration parsing for now | Hiltjo Posthuma | |
Signed-off-by: Hiltjo Posthuma <hiltjo@codemadness.org> | |||
2014-04-02 | Makefile: add sfeed_web, use c99 for build | Hiltjo Posthuma | |
Signed-off-by: Hiltjo Posthuma <hiltjo@codemadness.org> | |||
2014-04-01 | fix crlf newlines, add fp arg to xmlparser_init | Hiltjo Posthuma | |
Signed-off-by: Hiltjo Posthuma <hiltjo@codemadness.org> | |||
2014-03-31 | new version | Hiltjo Posthuma | |
lots of things changed, but cleanup todo. changelog and consistent stream of small updates will come in the future. Signed-off-by: Hiltjo Posthuma <hiltjo@codemadness.org> | |||
2013-05-20 | update xml parser, many optimizations and dos to unix newlines, much cleanup ↵ | Hiltjo Posthuma | |
todo though Signed-off-by: Hiltjo Posthuma <hiltjo@codemadness.org> |