htmlparser
Simple HTML Tag Parser (SAX-style events)
Functions
| Function | Description |
|---|---|
_html_byte_at | Internal byte reader. |
_html_copy | Internal: copy bytes from src+off to dst+doff, length n. |
_html_add_evt | Internal: add an event to the handle. |
_html_find_gt | Internal: find ‘>’ starting from pos. Returns index of ‘>’ or len if not found. |
_html_find_lt | Internal: find ‘<’ starting from pos. Returns index of ‘<’ or len if not found. |
_html_tag_end | Internal: find end of tag name (space or end). Returns index. |
html_parse | Parse HTML buffer into events. |
html_event_count | Return event count. |
html_event_type | Return event type at index (1=open, 2=close, 3=text). |
html_event_tag | Copy tag name at event idx to out. Return length. |
html_event_text | Copy text content at event idx to out. Return length. |
html_strip_tags | Strip all HTML tags, return plain text length. |
_html_poke_amp | Internal: poke a string literal byte sequence for HTML escape. |
_html_poke_lt | |
_html_poke_gt | |
_html_poke_quot | |
html_escape | Escape HTML special characters: & < > " |
Details
_html_byte_at
fn _html_byte_at(buf: &i8, idx: i64) -> i64Internal byte reader.
_html_copy
fn _html_copy(dst: &i8, doff: i64, src: &i8, soff: i64, n: i64) -> i64Internal: copy bytes from src+off to dst+doff, length n.
_html_add_evt
fn _html_add_evt(h: &i64, typ: i64, off: i64, ln: i64) -> i64Internal: add an event to the handle.
_html_find_gt
fn _html_find_gt(data: &i8, pos: i64, len: i64) -> i64Internal: find ‘>’ starting from pos. Returns index of ‘>’ or len if not found.
_html_find_lt
fn _html_find_lt(data: &i8, pos: i64, len: i64) -> i64Internal: find ‘<’ starting from pos. Returns index of ‘<’ or len if not found.
_html_tag_end
fn _html_tag_end(data: &i8, start: i64, limit: i64) -> i64Internal: find end of tag name (space or end). Returns index.
html_parse
fn html_parse(buf: &i8, len: i64) -> &i64Parse HTML buffer into events.
html_event_count
fn html_event_count(h: &i64) -> i64Return event count.
html_event_type
fn html_event_type(h: &i64, idx: i64) -> i64Return event type at index (1=open, 2=close, 3=text).
html_event_tag
fn html_event_tag(h: &i64, idx: i64, out: &i8) -> i64Copy tag name at event idx to out. Return length.
html_event_text
fn html_event_text(h: &i64, idx: i64, out: &i8) -> i64Copy text content at event idx to out. Return length.
html_strip_tags
fn html_strip_tags(buf: &i8, len: i64, out: &i8) -> i64Strip all HTML tags, return plain text length.
_html_poke_amp
fn _html_poke_amp(out: &i8, oi: i64) -> i64Internal: poke a string literal byte sequence for HTML escape.
_html_poke_lt
fn _html_poke_lt(out: &i8, oi: i64) -> i64_html_poke_gt
fn _html_poke_gt(out: &i8, oi: i64) -> i64_html_poke_quot
fn _html_poke_quot(out: &i8, oi: i64) -> i64html_escape
fn html_escape(buf: &i8, len: i64, out: &i8) -> i64Escape HTML special characters: & < > "