Open access research
publication repository
publication repository
Nizipli, Yagiz, & Lemire, Daniel (2024). Parsing Millions of URLs per Second. Software: Practice and Experience, 54 (5). https://doi.org/10.1002/spe.3296
File(s) available for this item:
PDF
- urlparser-14.pdf
Content : Submitted Version License : Creative Commons Attribution. |
|
Item Type: | Journal Articles |
---|---|
Refereed: | Yes |
Status: | Published |
Abstract: | URLs are fundamental elements of web applications. By applying vector algorithms, we built a fast standard-compliant C++ implementation. Our parser uses three times fewer instructions than competing parsers following the WHATWG standard (e.g., Servo's rust-url) and up to eight times fewer instructions than the popular curl parser. The Node.js environment adopted our C++ library. In our tests on realistic data, a recent Node.js version (20.0) with our parser is four to five times faster than the last version with the legacy URL parser. |
Depositor: | Lemire, Daniel |
Owner / Manager: | Daniel Lemire |
Deposited: | 20 Nov 2023 15:59 |
Last Modified: | 05 Apr 2024 15:25 |
RÉVISER |