[dart2wasm] Refactor int parsing

This refactors dart2wasm's `int.parse` and `int.tryParse` implementations.

Current implementation is copied from VM, which supports 63-bit "smi"
integers and 32-bit integers. In dart2wasm we only support 64-bit
integers. This change updates the int parsing to handle 64-bit ints.

The changes can be summarized as refactorings plus one change.

Refactorings:

- Remove all mentions of "smi"s and 32-bit integer parsing support.

- Move patched public members `parse` and `tryParse` to the beginning of
  the patched class, to allow top-down reading of the file and separate
  entry points from internal functions.

- Refactor the `last` (inclusive last character index) parameter of
  `_tryParseSmi` (renamed to `_tryParseIntRadix10` in this CL) as `end`
  (exlusive last character index), to be consistent with the rest of the
  function in the same file.

- Remove redundant `null` checks from the pre-sound-null-safe days.

- Remove 32-bit constants from the `_PARSE_LIMITS` table.

Change:

Current code, when the input string is larger than the max. number of
digits that fit into the `int` type, parses one "block" at a time, then
combines the blocks.

(A block is a substring in the input that can be parsed as `int` without
overflows)

This makes the code very complicated (with lazily generated "overflow
limits" table, complicated logic to combine blocks while checking for
overflows) to handle just one digit after a block.

With this change we do something simpler: first we skip all leading
zeros. This part is new, current code does not skip leading zeros and
handle them as a part of a block. After the zeros we parse one block as
usual. After the block, we can parse at most two more digits without an
overflow (or underflow if the number is negative). Handling of these two
digits does not need to be optimized with special checks and table
lookups, because the amount of work done for the digits is small, and
branching and the cost of table lookups followed by more efficient code
will probably be slower than just handling digits in a simple way.

This change is done in `_parseRadix`. Rest of the changes in the file
are refactoring, as described above.

The `_PARSE_LIMITS` table with max. number of digits that fit into an
`int` is updated using this program:

```
void main() {
  final maxI64 = 9223372036854775807;
  for (int radix = 2; radix <= 36; radix += 1) {
    final str = maxI64.toRadixString(radix);
    print("Max I64 in radix $radix = $str, num digits = ${str.length}");
  }
}
```

For example, max. 64-bit signed integer in radix 20 is
"5cbfjia3fh26ja7", which has 15 digits. Unless all of the digits are the
largest digit of the radix, we need to subtract one. So the max. number
of digits for radix 20 is 14.

The only radixes where all digits are the largest digit are 2 and 8. In
these cases we can handle 63 and 21 digits respectively (instead of 62
and 20). In all other bases we subtract the number of digits printed by
the program above by one in the table.

# Benchmarks

Golem reports 61% improvement in the benchmark `Int.parse.0032.bits`.

Golem also reports 18% slowdown in Utf8Encode.sk.10M, however the Wasms
for that benchmark before and after this change are identical, so it
must be noise.

Change-Id: Ia35a50a0328e680be2d494405e13caaded1b7ad9
Reviewed-on: https://dart-review.googlesource.com/c/sdk/+/372281
Commit-Queue: Ömer Ağacan <omersa@google.com>
Reviewed-by: Martin Kustermann <kustermann@google.com>
1 file changed
tree: 3ef177cf468ba1c97540d7a6d4c1e49a34c776ca
  1. .dart_tool/
  2. .github/
  3. benchmarks/
  4. build/
  5. docs/
  6. pkg/
  7. runtime/
  8. samples/
  9. sdk/
  10. tests/
  11. third_party/
  12. tools/
  13. utils/
  14. .clang-format
  15. .gitattributes
  16. .gitconfig
  17. .gitignore
  18. .gn
  19. .mailmap
  20. .style.yapf
  21. AUTHORS
  22. BUILD.gn
  23. CHANGELOG.md
  24. codereview.settings
  25. CONTRIBUTING.md
  26. DEPS
  27. LICENSE
  28. OWNERS
  29. PATENT_GRANT
  30. PRESUBMIT.py
  31. README.dart-sdk
  32. README.md
  33. sdk.code-workspace
  34. sdk_args.gni
  35. sdk_packages.yaml
  36. SECURITY.md
  37. WATCHLISTS
README.md

Dart

An approachable, portable, and productive language for high-quality apps on any platform

Dart is:

  • Approachable: Develop with a strongly typed programming language that is consistent, concise, and offers modern language features like null safety and patterns.

  • Portable: Compile to ARM, x64, or RISC-V machine code for mobile, desktop, and backend. Compile to JavaScript or WebAssembly for the web.

  • Productive: Make changes iteratively: use hot reload to see the result instantly in your running app. Diagnose app issues using DevTools.

Dart's flexible compiler technology lets you run Dart code in different ways, depending on your target platform and goals:

  • Dart Native: For programs targeting devices (mobile, desktop, server, and more), Dart Native includes both a Dart VM with JIT (just-in-time) compilation and an AOT (ahead-of-time) compiler for producing machine code.

  • Dart Web: For programs targeting the web, Dart Web includes both a development time compiler (dartdevc) and a production time compiler (dart2js).

Dart platforms illustration

License & patents

Dart is free and open source.

See LICENSE and PATENT_GRANT.

Using Dart

Visit dart.dev to learn more about the language, tools, and to find codelabs.

Browse pub.dev for more packages and libraries contributed by the community and the Dart team.

Our API reference documentation is published at api.dart.dev, based on the stable release. (We also publish docs from our beta and dev channels, as well as from the primary development branch).

Building Dart

If you want to build Dart yourself, here is a guide to getting the source, preparing your machine to build the SDK, and building.

There are more documents in our repo at docs.

Contributing to Dart

The easiest way to contribute to Dart is to file issues.

You can also contribute patches, as described in Contributing.

Roadmap

Future plans for Dart are included in the combined Dart and Flutter roadmap on the Flutter wiki.