Re: [arch-dev-public] [RFC] archweb nvchecker integration

2 Feb 2022

      On 2/1/22 22:54, George Rawlinson via arch-dev-public wrote:
...
On 22-02-01 08:21, Morten Linderud via arch-dev-public wrote:
At this stage, the following [community] packages that I maintain
require massaging of HTML sources:
* html-xml-utils
* oil
* parallel
* libmilter (bundled with sendmail source)
* time
I suppose if a nvchecker plugin existed that utilised bs4 (beautiful
soup), that would work. But I assume that would still fit your
definition of "arbitrary script". :p
There is a regex plugin and a htmlparser plugin for this.

The htmlparser plugin accepts XPath, but if you want to process it 
further the regex plugin may just work better.

Examples for your packages:

[html-xml-utils]
source = "regex"
url = "https://www.w3.org/Tools/HTML-XML-utils/"
regex = "html-xml-utils-(.*?).tar.gz"

[oil]
source = "htmlparser"
url = "https://www.oilshell.org/release/latest/"
xpath = "//h1/text()"
prefix = "Oil "

-- 
Regards,
Felix Yan