D28921: Search for JSON-LD inside of microdata tree

Kai Uwe Broulik noreply at phabricator.kde.org
Fri Apr 17 15:31:13 BST 2020


broulik created this revision.
broulik added a reviewer: vkrause.
Herald added a project: KDE PIM.
Herald added a subscriber: kde-pim.
broulik requested review of this revision.

REVISION SUMMARY
  I've noticed it quite often that websites annotate their `<body>` as `WebPage` and then have a `<script type="application/ld+json>` inside. We would not find such a tag since we're only on the lookout for more microdata once processing one.

TEST PLAN
  - Existing unittests still pass
  - Comes with a new one to cover this usecase
  
  - We find `NewsArticle` annotations in articles on rheinpfalz.de

REPOSITORY
  R1003 KItinerary: Travel Reservation handling library

REVISION DETAIL
  https://phabricator.kde.org/D28921

AFFECTED FILES
  autotests/structureddata/empty-microdata-with-json-ld-inside.html
  autotests/structureddata/empty-microdata-with-json-ld-inside.json
  src/generic/structureddataextractor.cpp

To: broulik, vkrause
Cc: kde-pim, fbampaloukas, dcaliste, dvasin, rodsevich, winterz, vkrause, mlaurent, knauss, dvratil
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.kde.org/pipermail/kde-pim/attachments/20200417/c4e62b06/attachment.html>


More information about the kde-pim mailing list