LogoTeluq
Français
Logo
Open access research
publication repository

Parsing Millions of URLs per Second [r-libre/3098]

Nizipli, Yagiz, & Lemire, Daniel (2024). Parsing Millions of URLs per Second. Software: Practice and Experience, 54 (5). https://doi.org/10.1002/spe.3296

File(s) available for this item:
[img]  PDF - urlparser-14.pdf
Content : Submitted Version
License : Creative Commons Attribution.
 
Item Type: Journal Articles
Refereed: Yes
Status: Published
Abstract: URLs are fundamental elements of web applications. By applying vector algorithms, we built a fast standard-compliant C++ implementation. Our parser uses three times fewer instructions than competing parsers following the WHATWG standard (e.g., Servo's rust-url) and up to eight times fewer instructions than the popular curl parser. The Node.js environment adopted our C++ library. In our tests on realistic data, a recent Node.js version (20.0) with our parser is four to five times faster than the last version with the legacy URL parser.
Depositor: Lemire, Daniel
Owner / Manager: Daniel Lemire
Deposited: 20 Nov 2023 15:59
Last Modified: 05 Apr 2024 15:25

Actions (login required)

RÉVISER RÉVISER