In order to complete the SSRank.io app, I needed to scrape the current player lists from Scoresheet.com. However, their player list pages are nearly devoid of HTML elements in the body, just huge blocks of text.
So how to pull only specific elements of a line, while keeping groups of lines separated? Regexp, of course. Treat the whole page body like a string, and manipulate away.
I can’t recommend the tool enough.