Android Tutorial [B4X] Regular expressions (RegEx) tutorial

WZSun · Dec 29, 2010

Hi Erel,
Thanks for the insight... it sure does help inspired me to think harder..

Below is a quick StringParse sub that I did to retrieve a sample date

s = "12/31/2010"
s1 = StrParse(s,"/",2)
msgbox(s1,"Info") ' returns 2010

Sub StrParse(FirstStr As String, sSeparator As String, idx As Int) As String
Dim strArray() As String, l As List
strArray = Regex.Split("[" & sSeparator & "\s]", FirstStr)
l.Initialize2(strArray)
Return l.Get(idx)
End Sub

Rgds
WZSun

devjet · Apr 26, 2011

RegexBuddy

Btw, this is a very good tool to speed up Regex issues.
RegexBuddy: Learn, Create, Understand, Test, Use and Save Regular Expression
Regards
Hans

Foz · Sep 27, 2011

I think I'm missing something here...

If I do a Split, into a dynamic string array, how do I then get the resulting array size?

Erel · Sep 27, 2011

B4X:

Dim arr() As String
arr = Regex.Split(...)
For i = 0 To arr.Length - 1
 Log(arr(i))
Next

Foz · Sep 27, 2011

:sign0161:
Thank you Erel!

sigh... I was doing an inline assign which you obviously can't do, and it didn't like it and thus never showed the Length field and wouldn't compile.

One of these days I'll get my head screwed on straight...

ChrShe · Feb 17, 2013

Some quick Regex.Matcher help...

Good day!

I've been tinkering around with the Regex.Matcher and have run into a bit of a snag that I was hoping I could get some help with.

I'm parsing a web page with the following lines:

<div class="list-animal-info-block">
<div class="list-animal-name"><a href="wsAdoptableAnimalDetails.aspx?id=13069119&css=adoptableSearch.css" >Jed</a></div>
<div class="list-animal-id">13069119</div>
<div class="list-anima-species">Dog</div>
<div class="list-animal-sexSN">Male/Neutered</div>
<div class="list-animal-breed">Terrier, American Pit Bull/Mix</div>
<div class="list-animal-age">2 years 9 months</div>
<div class="hidden">Dog Large</div>

What I need to get is the InnerText of each div line. So, for example, for the Line-animal-id, I'd like to have "13069119" returned.

Using the following, I've been able to get the matcher to find the line, but can't seem to figure out returning the portion of the line that I'm interested in.

B4X:

 Regex.Matcher("class=\""list-animal-name\""",page)

So, basically, how do I get the matcher to return the portion of the found line that I want?

Any help is greatly appreciated.
THANK YOU!!!
~Chris

Erel · Feb 18, 2013

It is better to start a new thread for such questions.

If the string is a valid XML (XHTML) then you can use an XML parser to parse it.

With Regex you need something like:

B4X:

"class=\""([^""]+)\"">([^>]+)</div>" 'group 1 will hold the class attribute and group 2 the text.

LucaMs · Jan 27, 2014

Erel said:
Regular expressions are very powerful and make complicate parsing challenges much easier...

The groups feature is very useful. If you find yourself calling String.IndexOf together with String.Substring multiple times, it is a good hint that you should move to a Regex and Matcher.

I found

When I met regular expressions, I quickly abandoned them.
I thought: "Too much time to learn them, I hurry faster with string functions."

This your last sentence makes me think, though.

Am I wrong or they could be very useful for creating a command line parser and for break (split, group or grrrr) HTML blocks/Tags?

Erel · Jan 27, 2014

The recommended way to parse HTML is with the jTIDY library.

Alberto Michelis · Jul 7, 2015

How to check only alphabetical chars and spaces?
Alberto Michelis OK
Alberto,Michelis Wrong
Alberto2Michelis Wrong
Thanks

Erel · Jul 8, 2015

Please start a new thread for this question.

Sandman · Jun 17, 2017

I know this is a very old thread, but I thought it might be useful to post a link to a webpage showing RegEx patterns:

https://docs.oracle.com/javase/7/docs/api/java/util/regex/Pattern.html

(I searched the forum and couldn't find a list.)

MaFu · Jun 19, 2017

I use this tool to create and test RegEx pattern:
http://www.ultrapico.com/ExpressoDownload.htm
Very useful imho.

Erel · Jun 19, 2017

There is a tool written especially for B4X: https://b4x.com:51041/regex_ws/index.html

victormedranop · Nov 1, 2017

I need to parse this string "Resultado : Q;11#1;P;12#1;T;13#23;Q;14#2;Q;21#2;P;22#2;T;23#3;Q;31#3;P;32#3;T;33#9;SP;34#10;SP;35#6;Q;41#6;P;42#6;T;43#12;SP;44#11;SP;45#7;Q;51#7;P;52#7;T;53#13;SP;54#14;SP;55#20;Q;61#20;P;62#20;T;63#21;Q;71#21;P;72#21;T;73#22;C;81

the result should be

Q;11#1;
P;12#1;
T;13#23;
Q;14#2;
Q;21#2;
P;22#2;
T;23#3;
Q;31#3;
P;32#3;
T;33#9;
SP;34#10;
SP;35#6;
Q;41#6;
P;42#6;
T;43#12;
SP;44#11;
SP;45#7;
Q;51#7;
P;52#7;
T;53#13;
SP;54#14;
SP;55#20;
Q;61#20;
P;62#20;
T;63#21;
Q;71#21;
P;72#21;
T;73#22;
C;81

any help will be appreciated.

Victor

MaFu · Nov 1, 2017

This regex pattern should work:
(([A-Z]+;\d+#\d+

|([A-Z]+;\d+))

Erel · Nov 2, 2017

This is not the correct place to post such questions. Always start a new thread for your question.

epiCode · May 24, 2022

Erel said:
Basic4android uses Java regular expression engine. See this page for specific nuances related to this engine: Pattern (Java Platform SE 6)

Sorry for posting in old thread... but is this still relevant ? Need to check if negative lookahead is supported.

Erel · May 25, 2022

Android Tutorial [B4X] Regular expressions (RegEx) tutorial

WZSun

Member

devjet

Member

Foz

Member

Erel

B4X founder

Foz

Member

ChrShe

Member

Erel

B4X founder

LucaMs

Expert

Erel

B4X founder

Alberto Michelis

Well-Known Member

Erel

B4X founder

Sandman

Expert

MaFu

Well-Known Member

Erel

B4X founder

victormedranop

Well-Known Member

MaFu

Well-Known Member

Erel

B4X founder

epiCode

Active Member

Erel

B4X founder

Similar Threads