Hello everyone, i need some help i'm working on a new app, but i'm stuck on something, i don't really understand the REGEX function and i really need some help, the HTML information i need to extract is for example, file size, uploaded by: Seeders: Leechers: and Quality.
Attached is a text file with the HTML information, and here's is a portion of what i need to extract.
I can extract this information by using this pattern:
but i also need to get the information that goes after each of those tags.
Can anyone please help me with this, or does anyone have any examples i can use?
:BangHead:
View attachment HTML_Code.txt
Thanks Everyone!
Cheers,
Walter
Attached is a text file with the HTML information, and here's is a portion of what i need to extract.
<dt>Size:</dt>
<dd>704.4 MiB (738613989 Bytes)</dd>
<br />
<dt>Info:</dt>
<dd><a href="http://www.imdb.com/title/tt1931533/" target="_blank" title="IMDB" rel="nofollow">IMDB</a></dd> <dt>Spoken language(s):</dt>
<dd>English</dd>
<dt>Tag(s):</dt>
<dd><a href="/tag/Seven">Seven</a> <a href="/tag/Psychopaths">Psychopaths</a> <a href="/tag/2012">2012</a> <a href="/tag/DVDSCR">DVDSCR</a> <a href="/tag/XviD">XviD</a> <a href="/tag/AbSurdiTy">AbSurdiTy</a> </dd>
<dt>Quality:</dt>
<dd id="rating" class="">
+9 / -1 (+8) </dd>
<br />
<dt>Uploaded:</dt>
<dd>2013-01-07 15:18:04 GMT</dd>
<dt>By:</dt>
<dd>
<a href="/user/scene4all/" title="Browse scene4all">scene4all</a> <img src="//static.thepiratebay.se/img/vip.gif" alt="VIP" title="VIP" style="width:11px;" border='0' /></dd>
<br />
<dt>Seeders:</dt>
<dd>19546</dd>
<dt>Leechers:</dt>
<dd>2974</dd>
I can extract this information by using this pattern:
<dt>.*</dt>
[1] => <dt>Files:</dt>
[2] => <dt>Size:</dt>
[3] => <dt>Info:</dt>
[4] => <dt>Spoken language(s):</dt>
[5] => <dt>Tag(s):</dt>
[6] => <dt>Quality:</dt>
[7] => <dt>Uploaded:</dt>
[8] => <dt>By:</dt>
[9] => <dt>Seeders:</dt>
[10] => <dt>Leechers:</dt>
[11] => <dt>Comments</dt>
[12] => <dt>Info Hash:</dt>
but i also need to get the information that goes after each of those tags.
Can anyone please help me with this, or does anyone have any examples i can use?
:BangHead:
View attachment HTML_Code.txt
Thanks Everyone!
Cheers,
Walter