Stan Schwertly on 28 Oct 2011 08:03:23 -0700


[Date Prev] [Date Next] [Thread Prev] [Thread Next] [Date Index] [Thread Index]

[Philadelphia-pm] Perl one liner, regex capture group problem


I wrote this one-liner for a friend, and wanted to replace the end with Perl. It pulls out the image sources from a URL:

command: curl -so- http://www.wikihow.com/Make-Easy-Homemade-Biscuits|egrep -o "src="" -c 5-
abbreviated output:
'http://pad2.whstatic.com/images/thumb/3/31/Gfrollsonplate_198.jpg/-crop-44-33-40px-Gfrollsonplate_198.jpg'
'http://pad1.whstatic.com/skins/WikiHow/images/corner_sprite.png'
'http://pad3.whstatic.com/images/thumb/7/71/Bread-rolls-2126.jpg/-crop-44-33-44px-Bread-rolls-2126.jpg'
'http://pad1.whstatic.com/skins/WikiHow/images/corner_sprite.png'

I tried to replace it with the following command, but it doesn't seem to be respecting the capture group:

command: curl -so- http://www.wikihow.com/Make-Easy-Homemade-Biscuits|perl -nE "say $1 if /src="">
abbreviated output:
                                <a href="" href="http://www.wikihow.com/Make-Pineapple-Biscuits">http://www.wikihow.com/Make-Pineapple-Biscuits'><img class='rounders2_img' alt='' src="" href="http://pad2.whstatic.com/images/thumb/b/be/Pineapple-tart.jpg/-crop-44-33-36px-Pineapple-tart.jpg">http://pad2.whstatic.com/images/thumb/b/be/Pineapple-tart.jpg/-crop-44-33-36px-Pineapple-tart.jpg' />

                                <img class='rounders2_sprite' alt='' src="" href="http://pad1.whstatic.com/skins/WikiHow/images/corner_sprite.png'/">http://pad1.whstatic.com/skins/WikiHow/images/corner_sprite.png'/>

It's printing the matched line, but doesn't populate $1 correctly. What should I change?

BR
Stan Schwertly
_______________________________________________
Philadelphia-pm mailing list
Philadelphia-pm@pm.org
http://mail.pm.org/mailman/listinfo/philadelphia-pm