Click here to Skip to main content
13,449,326 members (46,057 online)
Click here to Skip to main content
Add your own
alternative version

Tagged as


Posted 26 Feb 2012

Counting lines in a string

, 28 Feb 2012
Rate this:
Please Sign up or sign in to vote.
Great analysis!I found out that Regex can be accelerated by a factor of about two.Instead of new Regex(@"\n", RegexOptions.Compiled|RegexOptions.Multiline);you can speed up by using:new Regex(@"^.*?$", RegexOptions.Compiled|RegexOptions.Multiline);But admittedly, nothing beats...
Great analysis!

I found out that Regex can be accelerated by a factor of about two.

Instead of
new Regex(@"\n", RegexOptions.Compiled|RegexOptions.Multiline);

you can speed up by using:
new Regex(@"^.*?$", RegexOptions.Compiled|RegexOptions.Multiline);

But admittedly, nothing beats the native methods (IndexOf).

My statement above is wrong: I did compare "$" (and not "\n") against "^.*?".
The measurments show that "\n" is the fastest of all Regex matches, while "$" is the slowest (5 times slower than "\n"...!).
That's a real surprise to me.

The comparison:

Regex Match[ms] for 2.500.000 linesRegexOptions

As a comparison: IndexOf('\n') only takes 237 [ms].



This article, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)


About the Author

Andreas Gieriet
Founder eXternSoft GmbH
Switzerland Switzerland
I feel comfortable on a variety of systems (UNIX, Windows, cross-compiled embedded systems, etc.) in a variety of languages, environments, and tools.
I have a particular affinity to computer language analysis, testing, as well as quality management.

More information about what I do for a living can be found at my LinkedIn Profile and on my company's web page (German only).

You may also be interested in...

Comments and Discussions

GeneralRe: So Regex("^.*?$") is not faster than Regex("\n"), as you ori... Pin
Ronald M. Martin29-Feb-12 7:06
memberRonald M. Martin29-Feb-12 7:06 
GeneralRe: Ah, I see your initial question. "*" is greedy match (match ... Pin
Andreas Gieriet28-Feb-12 16:48
memberAndreas Gieriet28-Feb-12 16:48 
GeneralRe: Let me rephrase my question. Assuming that your syntax (@"^.... Pin
Ronald M. Martin28-Feb-12 3:50
memberRonald M. Martin28-Feb-12 3:50 
GeneralI don't understand the use of the question mark (?) in this ... Pin
Ronald M. Martin27-Feb-12 17:27
memberRonald M. Martin27-Feb-12 17:27 
GeneralRe: I simple measured a difference of a factor of about two. No ... Pin
Andreas Gieriet27-Feb-12 21:21
memberAndreas Gieriet27-Feb-12 21:21 
GeneralCan anyone please explain why LinesCount2 is so slow? I thou... Pin
Miller426-Feb-12 6:54
memberMiller426-Feb-12 6:54 
Can anyone please explain why LinesCount2 is so slow? I thought it should be as fast as the LinesCountIndexOf.

General General    News News    Suggestion Suggestion    Question Question    Bug Bug    Answer Answer    Joke Joke    Praise Praise    Rant Rant    Admin Admin   

Use Ctrl+Left/Right to switch messages, Ctrl+Up/Down to switch threads, Ctrl+Shift+Left/Right to switch pages.

Permalink | Advertise | Privacy | Terms of Use | Mobile
Web03 | 2.8.180318.3 | Last Updated 28 Feb 2012
Article Copyright 2012 by Andreas Gieriet
Everything else Copyright © CodeProject, 1999-2018
Layout: fixed | fluid