Say you have HTML similar to the following:

<div style="background-image: url('https://some.domain/image')"></div>

and you want to extract https://some.domain/image using XPath. With XPath 2.0, you can select the URL with something like

select-before(select-after(//div/@style, "backgound-image: url("), ")")

but, when using XPath 1.0, this fails — I think it’s due to nested functions not being supported in XPath 1.0, but I have been unable to find documentation to confirm that. Is there a way to accomplish this using XPath 1.0?

  • KalciferOP
    link
    fedilink
    English
    arrow-up
    1
    ·
    6 months ago

    If the code you have quoted is verbatim what you have tried, seems like you need to extract the parentheses and possibly a single or double quote, depending on the source css. […] But I don’t think that would cause xpath to fail… It would just extract the wrong value

    Ah, yeah, that wasn’t intentional — my bad! But, yeah, that would just result in the wrong value being returned, not the expression failing.