Click here to Skip to main content
15,887,083 members
Articles / Programming Languages / C++
Article

Wildcard string compare (globbing)

Rate me:
Please Sign up or sign in to vote.
4.90/5 (82 votes)
15 Feb 2005 1.2M   96   144
Matches a string against a wildcard string such as "*.*" or "bl?h.*" etc. This is good for file globbing or to match hostmasks.

Usage:

This is a fast, lightweight, and simple pattern matching function.

if (wildcmp("bl?h.*", "blah.jpg")) {
  //we have a match!
} else {
  //no match =(
}

Function:

int wildcmp(const char *wild, const char *string) {
  // Written by Jack Handy - <A href="mailto:jakkhandy@hotmail.com">jakkhandy@hotmail.com</A>
  const char *cp = NULL, *mp = NULL;

  while ((*string) && (*wild != '*')) {
    if ((*wild != *string) && (*wild != '?')) {
      return 0;
    }
    wild++;
    string++;
  }

  while (*string) {
    if (*wild == '*') {
      if (!*++wild) {
        return 1;
      }
      mp = wild;
      cp = string+1;
    } else if ((*wild == *string) || (*wild == '?')) {
      wild++;
      string++;
    } else {
      wild = mp;
      string = cp++;
    }
  }

  while (*wild == '*') {
    wild++;
  }
  return !*wild;
}

License

This article has no explicit license attached to it but may contain usage terms in the article text or the download files themselves. If in doubt please contact the author via the discussion board below.

A list of licenses authors might use can be found here


Written By
Web Developer
United States United States
This member has not yet provided a Biography. Assume it's interesting and varied, and probably something to do with programming.

Comments and Discussions

 
QuestionFastest wildcard function benchmarked with 3 compilers Pin
Sanmayce28-Nov-22 14:55
Sanmayce28-Nov-22 14:55 
QuestionMessage Closed Pin
17-Jun-21 14:45
Dayo Thomas17-Jun-21 14:45 
QuestionUnit tests please Pin
richarno2-Dec-19 22:03
richarno2-Dec-19 22:03 
Suggestionnice algorithm, but with a weakness Pin
senzabandiera7-Oct-16 23:24
senzabandiera7-Oct-16 23:24 
Generaldelphi port Pin
vovach77723-Dec-14 3:38
vovach77723-Dec-14 3:38 
QuestionMy vote of 5+ Pin
Sanmayce29-Nov-13 7:47
Sanmayce29-Nov-13 7:47 
Hi Mr. Handy,
it is so good to see etude developers in C, don't know how but I haven't seen your function until three-four days ago.

I postponed all my activities in attempt to come up with some gem.
Last night you kicked my ass, my amateurish interests in wildcard matching led me to writing my own (in fact a semi-port of Igor Pavlov's code) function.
I did my next-to-better to make it superfast, in which I succeeded, but failed to outperform yours, your function is faster than mine both for short and long strings. BRAVO!

I already have had a wildcard searcher working just fine, a very versatile one, but slow. Therefore I added a FAST add-on to my 3-in-1 searcher Kazahana thus allowing VERSATILE (9 wildcards) and FAST (the classic 2 wildcards) modes. I also tested mine vs yours using 2 threads, in short they are really fast, the 2 threads utilize 180-192% the CPU achieving 140-170MB/s TOTAL traversal speed, see further below.

Having failed to "kick your ass" I bend a knee before you, but only temporarily, I need more time to clear my sight, in the meantime it would be nice some real programmer(s) to help me to speed up my etude.
If you can speed up my function, please do so, I will appreciate your widemindedness. Dethroning your own with your own is a sweety feeling.

Since I am fond of benchmarking and endless results logs, you are welcome to my Kazahana dedicated article to see them.

Being an UFC fan I see my defeat in the light of Johny Hendricks defeat by the champion Georges St-Pierre some weeks ago. Johny rocks, I like his style, his interviews are worth seeing as:
Johny Hendricks: "I Am the Champion" (UFC 167 Post-Press Conference)

Best,
Georgi 'Kaze'

P.S.
I couldn't help it, just some of them:

The big benchmark, searching all lines in Wikipedia 1024MB dump:
My function is used in executable: Kazahana_r1-++fix+nowait_critical_nixFIX_WolfRAM+fixITER_HEXADECAD-Threads_IntelV12.exe
Your function is used in executable: Kazahana_r1-++fix+nowait_critical_nixFIX_WolfRAM+fixITER_HEXADECAD-Threads_IntelV12_JH.exe

The runs are, my wildcards '&'/'+' stand for '*'/'?':
Speed results for pattern "&karolina&wydra&":
D:\_KAZE>timer Kazahana_r1-++fix+nowait_critical_nixFIX_WolfRAM+fixITER_HEXADECAD-Threads_IntelV12.exe "&karolina&wydra&" enwiki-20130904-pages-articles.7z.001 1536 >>Results.txt
Kazahana, a superfast exact & wildcards & Levenshtein Distance (Wagner-Fischer) searcher, r. 1-++fix+nowait_critical_nixFIX_Wolfram+fixITER, copyleft Kaze 2013-Nov-29.
Enforcing FAST wildcard mode ...
omp_get_num_procs( ) = 2
omp_get_max_threads( ) = 2
Enforcing HEXADECAD i.e. hexadecuple-threads ...
Allocating Master-Buffer 1536KB ... OK
/; 00,000,160,591 bytes/clock
Kazahana: Total/Checked/Dumped xgrams: 9,382,307/7,914,526/0
Kazahana: Performance: 156 KB/clock
Kazahana: Performance: 1,401 xgrams/clock
Kazahana: Performance: Total/fread() clocks: 6,694/654
Kazahana: Performance: I/O time, i.e. fread() time, is 9 percents
Kazahana: Performance: RDTSC I/O time, i.e. fread() time, is 1,334,917,342 ticks
Kazahana: Done.
Timer 9.01 : Igor Pavlov : Public domain : 2009-05-31

Kernel Time  =     0.717 =    9%
User Time    =    13.041 =  178%
Process Time =    13.759 =  188%
Global Time  =     7.298 =  100%

D:\_KAZE>timer Kazahana_r1-++fix+nowait_critical_nixFIX_WolfRAM+fixITER_HEXADECAD-Threads_IntelV12_JH.exe "&karolina&wydra&" enwiki-20130904-pages-articles.7z.001 1536 >>Results.txt
Kazahana, a superfast exact & wildcards & Levenshtein Distance (Wagner-Fischer) searcher, r. 1-++fix+nowait_critical_nixFIX_Wolfram+fixITER, copyleft Kaze 2013-Nov-29.
Enforcing FAST wildcard mode ...
omp_get_num_procs( ) = 2
omp_get_max_threads( ) = 2
Enforcing HEXADECAD i.e. hexadecuple-threads ...
Allocating Master-Buffer 1536KB ... OK
/; 00,000,167,227 bytes/clock
Kazahana: Total/Checked/Dumped xgrams: 9,382,307/7,914,526/0
Kazahana: Performance: 163 KB/clock
Kazahana: Performance: 1,459 xgrams/clock
Kazahana: Performance: Total/fread() clocks: 6,428/639
Kazahana: Performance: I/O time, i.e. fread() time, is 9 percents
Kazahana: Performance: RDTSC I/O time, i.e. fread() time, is 1,308,754,183 ticks
Kazahana: Done.
Timer 9.01 : Igor Pavlov : Public domain : 2009-05-31

Kernel Time  =     0.748 =   11%
User Time    =    12.230 =  181%
Process Time =    12.979 =  192%
Global Time  =     6.729 =  100%


Get down get down get down get it on show love and give it up
What are you waiting on?

GeneralMy vote of 5 Pin
Franc Morales29-May-13 15:47
Franc Morales29-May-13 15:47 
Questionhelp required for wilcard matching * and # Pin
SaimaAsif23-Feb-12 23:56
SaimaAsif23-Feb-12 23:56 
GeneralMy vote of 5 Pin
Plamen Petrov13-Dec-11 21:37
professionalPlamen Petrov13-Dec-11 21:37 
SuggestionModification with '#' as wildcard joker for digits Pin
Thomas Haase25-Sep-11 23:16
Thomas Haase25-Sep-11 23:16 
QuestionLicence Question Pin
randommark23-Nov-10 0:33
randommark23-Nov-10 0:33 
AnswerAnother C# version, with a twist Pin
Thomas Levesque29-Jun-10 14:50
professionalThomas Levesque29-Jun-10 14:50 
GeneralObscurity Pin
Chuck O'Toole25-Apr-10 18:18
Chuck O'Toole25-Apr-10 18:18 
AnswerMy C# contribution - recursive, of course! Pin
RenniePet26-Mar-10 5:21
RenniePet26-Mar-10 5:21 
GeneralRe: My C# contribution - recursive, of course! Pin
Erwin de GRoot29-Mar-10 1:58
Erwin de GRoot29-Mar-10 1:58 
GeneralDepends on whether you need to optimize the last few nanoseconds out of it... Pin
RenniePet29-Mar-10 7:45
RenniePet29-Mar-10 7:45 
GeneralSorry - revised numbers Pin
RenniePet29-Mar-10 8:35
RenniePet29-Mar-10 8:35 
GeneralRe: Depends on whether you need to optimize the last few nanoseconds out of it... Pin
Erwin de GRoot29-Mar-10 8:37
Erwin de GRoot29-Mar-10 8:37 
GeneralYet another version - 25% faster, I think [modified] Pin
RenniePet1-Apr-10 8:24
RenniePet1-Apr-10 8:24 
GeneralRe: Yet another version - 25% faster, I think Pin
aleks1k21-Sep-11 2:47
aleks1k21-Sep-11 2:47 
QuestionI used this function but I how I can catch variables from the * ??? Pin
moh.hijjawi20-Oct-09 1:55
moh.hijjawi20-Oct-09 1:55 
AnswerRe: I used this function but I how I can catch variables from the * ??? Pin
RenniePet1-Apr-10 11:27
RenniePet1-Apr-10 11:27 
Questionany updates ? Pin
kiquenet.com2-Jul-09 5:12
professionalkiquenet.com2-Jul-09 5:12 
GeneralImproved matching with end-of-text Pin
Anders Heie11-May-09 15:20
Anders Heie11-May-09 15:20 
GeneralRe: Improved matching with end-of-text: some cases don't work properly! Pin
roadrunner31412-Aug-09 3:35
roadrunner31412-Aug-09 3:35 

General General    News News    Suggestion Suggestion    Question Question    Bug Bug    Answer Answer    Joke Joke    Praise Praise    Rant Rant    Admin Admin   

Use Ctrl+Left/Right to switch messages, Ctrl+Up/Down to switch threads, Ctrl+Shift+Left/Right to switch pages.