LogoTeluq
Français
Logo
Open access research
publication repository

Parsing Millions of URLs per Second [r-libre/3098]

Nizipli, Yagiz, & Lemire, Daniel (In Press). Parsing Millions of URLs per Second. Software: Practice and Experience. https://doi.org/10.1002/spe.3296

File(s) available for this item:
[img]  PDF - urlparser-14.pdf
Content : Submitted Version
License : Creative Commons Attribution.
 
Item Type: Journal Articles
Refereed: Yes
Status: In Press
Abstract: URLs are fundamental elements of web applications. By applying vector algorithms, we built a fast standard-compliant C++ implementation. Our parser uses three times fewer instructions than competing parsers following the WHATWG standard (e.g., Servo's rust-url) and up to eight times fewer instructions than the popular curl parser. The Node.js environment adopted our C++ library. In our tests on realistic data, a recent Node.js version (20.0) with our parser is four to five times faster than the last version with the legacy URL parser.
Depositor: Lemire, Daniel
Owner / Manager: Daniel Lemire
Deposited: 20 Nov 2023 15:59
Last Modified: 09 Dec 2023 15:36

Actions (login required)

RÉVISER RÉVISER