I’m just over 20% of the way through the initial journal scan for GOA6 (3,200 of 15,676–I discovered another duplicate), so it’s a good time for a quick progress report.
Note that, as before, I sort journals by publisher before checking–because many multijournal publishers use the same templates for all journals, making it easier for me to find fee data and do article counts.
For GOA6, that means I’ve now checked partway through Elsevier (Medicine in Novel Technology and Devices); so far, the 2020 article count is 226,676, but that will almost certainly go up.
Last year, that range of publishers included 2,965 journals, which published 195,025 articles in 2019. (THE 2019 FIGURE IN THE FIRST PROGRESS REPORT WAS WRONG: that figure should be 100,055.)
This year, problematic journals include 41 malware cases, one that’s not OA, and 65 unreachable/unworkable (half of them Cambridge). These will all be rechecked. In addition to all those 503 errors, I see 13 404, one 403, 9 SSL certificate problems, 1 (other) database error, 7 DNS failures, 3 cases of fraud, one apparently hijacked case, 9 malware, one phishing, 18 trojans, and a few others.
For what it’s worth, the same range of publishers last year wound up with 7 journals that had malware but could be analyzed, 33 that had to be reached through an alternate address, 13 malware-not-countable cases, and one unreachable. I’d guess we’ll wind up with similar proportions this year.
So does doing one-tenth in the first 12 days of the year mean I’ll finish the first pass at the end of April (that is, around 120 days into the year)? Possible but unpredictable. Elsevier journals can be checked very rapidly, maybe even faster than BMC (once I figured out the right advanced-search strategy); I don’t believe most other large clusters are that easy. So we shall see: no predictions until I’m at least three-quarters finished! (Best guess is very late April or early to mid-May.)