Equivalent scrapbook

…scrap on Regexp

Rexp extract content inside body tag

html_string.scan(/<body>(.*)<\/body>/im)

custom links

url.match('^https*:[\w._]*github')
# http://github
# https://github
# https://www.github
# http://www.github

get rid of all non ASCII characters

s.gsub!(/\P{ASCII}/, '')

or

s.delete!("^\u{0000}-\u{007F}")