Pete Freitag Pete Freitag

Regular Expressions - Named Capture Groups

Published on March 21, 2005
By Pete Freitag
javamisc

I am porting some code from .NET to Java today, and found out about a very cool feature in regular expressions called Named Groups. Most regular expression implementations let you group text (which can then be used for back-references) using parenthesis, for example: my name is (pete). You then refer to the groups with numbers such as \1 or $1 to match the first group.

Python was the first language to introduce the ability to name groups in regular expressions, PHP along with .NET also supports them. For example my name is (?<fisrtname>pete) although it looks more complicated you can reference the group by name rather than by number. This is really handy in the code I'm porting because there are tons of regular expressions and the ordering of the groups may be different based on the pattern.

However it doesn't look like Java supports this feature, which is a bummer because we are going to have to implement it ourself by preparsing the pattern, and storing the order of the named groups.

It would be really cool if Java 1.6 supported this... Though I don't see any JSR's on the topic.



regex java dotnet

Regular Expressions - Named Capture Groups was first published on March 21, 2005.

If you like reading about regex, java, or dotnet then you might also like:

Weekly Security Advisories Email

Advisory Week is a new weekly email containing security advisories published by major software vendors (Adobe, Apple, Microsoft, etc).

Comments

Yes Anthony, I am aware of \1, what I am talking about is named backreferences.
by Pete Freitag on 03/21/2005 at 3:31:32 PM UTC
It looks like JRegex does support named groups:
http://jregex.sourceforge.net/

Marius
by Marius Scurtescu on 03/22/2005 at 1:00:52 PM UTC
Sorry, I took your reply about \1 to mean you didn't understand that Java uses $1. As for named groups, that's another matter.
by Jon on 01/18/2006 at 2:52:34 PM UTC