Jump to content

Topic on Talk:Citoid/Archive 2

Please, return more data: PMC, PMID, ISSN, Publisher

7
Summary by Arthurfragoso

There was a communication problem in the main WP servers with the NIH servers.

The SysOp team fixed it. :)

Arthurfragoso (talkcontribs)

As I have asked in https://en.wikipedia.org/wiki/Wikipedia_talk:ProveIt#Automatic_retrieve_other_information:_PMID,_PMC,_ISSN_and_publisher.


If I do a DOI search, it returns me the articles' title, authors etc. But it won't return PMID, PMC, ISSN, publisher and other data that I can easily get manually.


It's possible to get PMID and PMC from the DOI using this API: https://www.ncbi.nlm.nih.gov/pmc/tools/id-converter-api/


Google Scholar always return me the ISSN and Publisher, although they don't have an API, but probably there are others that do.

AManWithNoPlan (talkcontribs)

Citoid only returns primary data. It’s doesn’t do the next step of asking PubMed do you recognize this doi. ~~~~

Arthurfragoso (talkcontribs)

Ok, now I understand. Who retrieves the data is Zetero. I installed it and did some tests:

If I do a DOI search, it returns me fewer data: https://pastebin.com/2rY86CVf

If I do an URL search, it returns me DOI, PMID, PMC, etc: https://pastebin.com/8mLQ696X


I tried to install citoid, but it failed to build, so I tested the wikipedia server:

DOI search: https://en.wikipedia.org/api/rest_v1/data/citation/mediawiki/10.1053%2Fj.ackd.2013.08.006

URL search: https://en.wikipedia.org/api/rest_v1/data/citation/mediawiki/https%3A%2F%2Fwww.ncbi.nlm.nih.gov%2Fpubmed%2F24206604


The URL returns me unknown_error, that's probably why I don't use it. :(


"type":"https://mediawiki.org/wiki/HyperSwitch/errors/unknown_error"

Arthurfragoso (talkcontribs)
Mvolz (WMF) (talkcontribs)

Thanks for reporting, that nothing is getting through from the pubmed website at all is separate issue from the one above! And a much more serious problem, unfortunately.

Mvolz (WMF) (talkcontribs)

So, we actually added support for this using the pubmed api in 2014 (https://phabricator.wikimedia.org/T1088). Unfortunately, the NIH api has a long, long history of falling over and causing citoid performance issues (https://phabricator.wikimedia.org/T133696). As a result we added a config variable and turned off requesting extra identifiers from their service in production in 2017 (https://phabricator.wikimedia.org/T162886). Since citoid has to be snappy enough to work in real time on user request, I don't see us changing that unless a more reliable / faster service could be found to supply the info.

My advice would be for a bot to do this work, because that can work in the background and therefore response time isn't as critical.

Arthurfragoso (talkcontribs)

It's now working! Cheers! Yay! :)