commit | 2b392a4b8d739edeac3552dff3f614219a733955 | [log] [tgz] |
---|---|---|
author | cvolzke4 <45087979+cvolzke4@users.noreply.github.com> | Fri Sep 20 09:06:20 2019 +1000 |
committer | Nicholas Shahan <nshahan@google.com> | Thu Sep 19 16:06:20 2019 -0700 |
tree | 03a867819af1d72e181a77f4612d7c7c82493fc0 | |
parent | d37f5887e327a245abb9fc3675446dd96b172957 [diff] |
Fix spans generated for HTML with higher-plane unicode characters (#109)
This is a pure Dart HTML5 parser. It's a port of html5lib from Python.
Parsing HTML is easy!
import 'package:html/parser.dart' show parse; import 'package:html/dom.dart'; main() { var document = parse( '<body>Hello world! <a href="www.html5rocks.com">HTML5 rocks!'); print(document.outerHtml); }
You can pass a String or list of bytes to parse
. There's also parseFragment
for parsing a document fragment, and HtmlParser
if you want more low level control.