wallabag/inc/3rdparty/site_config/standard/blogs.smithsonianmag.com.txt

15 lines
502 B
Plaintext
Raw Normal View History

# metadata
author://div[@class = 'post']/div[@class='meta']/a[1]
date://div[@id = 'rap']/h2[1]
body://div[@class = 'post']
# wrapping caption and image
wrap_in(fieldset)://div[contains(@class, 'wp-caption')]
# clean up
strip://div[@class = 'post']/h3[@class = 'storytitle']
strip://div[@class = 'post']/div[@class = 'social']
strip://img[@style = 'display:none;']
2013-12-06 04:13:03 -05:00
strip://img[@height='0' and @width='0']
test_url: http://blogs.smithsonianmag.com/adventure/2011/10/tips-for-women-traveling-in-turkey/