Archive for March, 2021

GOA6: Update 8

Tuesday, March 30th, 2021


Time for another GOA6 checkpoint, at 12,800 of 15,676.

Note that, as always, I sort journals by publisher before checking–because many multijournal publishers use the same templates for all journals, making it easier for me to find fee data and do article counts.

For GOA6, that means I’ve now checked through publisher Universidade Federal do Rio de Janeiro and title Mana. So far, the 2020 article count is 911,525, and that will almost certainly go up. The 2019 total for this set of journals is 741,882 articles.

Last year, that range of publishers included 11,402 journals, which published 758,050 articles in 2019. So there’s a net gain of 1,398 added journals so far.

For this group of 1,600 journals–ignoring the first 11,200–problematic journals include 53 malware case and 109 unreachable/unworkable.

Looking more closely at the malware cases for these 1,600 journals, there are eight security-certificate problem, five phishing and 39 Trojans–including seven at Universidade Federal de Alagoas and five at Universidade Estadual de Montes Claros.

How confident am I that we’ll reach a million articles? The remaining 2,866 journals had 95l177 articles in 2019, so it’s not certain, but likely. We shall see…

This is an interesting segment, nearly all university journals from Latin American countries or Spain and Portugal. [Actually, one from Sweden, 32 from Portugal, 160 from Spain and all the rest from 18 Latin American countries, with Brazil accounting for 743.] Unsurprisingly, that also means an even higher percentage of no-fee/diamond than overall (likely to be around 70%): of the 1,434 journals fully analyzed out of this 1,600, only 48 have fees.

I’d say it’s now very probable that the first scan will be done in late April, barring illness or other unexpected events–other things are taking up more time, but some 400 of the remaining 2,866 should be relatively fast. We shall see. That would leave some checking and the long rescans. (So far, about 1,800 journals need rechecking; the final number will probably exceed 2,000.)

So no overall target date yet…



Angry?

Friday, March 26th, 2021

Just for fun, I’ve been going through my listening collection–all ripped from owned CDs using MusicBee to FLAC, played back on a Cowan Plenue high-fidelity player–by “genre,” presumably supplied by crowdsourcing to whatever metadata database MusicBee uses. (Background)

Last night, I finished one odd genre and scrolled to the next: Angry.

So what’s included (from my collection, that is)?

One album: No Secrets, by Carly Simon.

Really? Angry? The album shows a confident, talented woman. One song (the basis for the album title) shows her disappointed in her lover/boyfriend/spouse/whatever. Another, the big hit, is “You’re So Vain,” Of the songs on the album, those are as close as I could come to anything even resembling anger, and you’d really be stretching it in either case (especially the latter, which I still love).

My thought went out to whoever supplied that genre: I hope you got help.

GOA6: Seventh note

Wednesday, March 17th, 2021

Time for another GOA6 checkpoint, at 11,200 of 15,676.

Note that, as before, I sort journals by publisher before checking–because many multijournal publishers use the same templates for all journals, making it easier for me to find fee data and do article counts.

For GOA6, that means I’ve now checked through publisher Universidad de Guadalajara and title Sincronía. So far, the 2020 article count is 860,506, and that will almost certainly go up. The 2019 total for this set of journals is 741,882 articles.

Last year, that range of publishers included 9,986 journals, which published 711,296 articles in 2019. So there’s a net gain of 1,214 added journals so far.

For this group of 1,600 journals–ignoring the first 9,600–problematic journals include 53 malware cases, 98 unreachable/unworkable, six non-OA journals (registration required), a few assorted situations, and 29 that had to be found at a different address. Some 20-odd of the 98 are almost certainly very temporary: the second half of a university’s journals all had DNS failures, the morning after the first half were fine.

Looking more closely at the malware cases for these 1,600 journals, there are seven security-certificate problem, three malware in general, six phishing and 36 Trojans.

How confident am I that we’ll reach a million articles? Well, the remaining 4,467 journals had 140,498 articles in 2019, so unless there are fewer articles in 2020 and no gain from the remaining newly-added journals (about 567 of them), it seems likely.

I’ve been running a bit ahead of expected schedule. That may slow down for a bit, for reasons that relate to April 15 and for other non-study reasons. But it’s looking good to complete the first scan by the end of April, followed by the slower second scan and final malware check…

By the way: I *love* to see other studies based on this work, including the spreadsheet, but if you’re planning such a study, please read the book– https://waltcrawford.name/goa5.pdf , or blow $11 on the color paperback at Lulu. There are some useful caveats and other subtleties that aren’t in the spreadsheet.

GOA6: Sixth Report

Friday, March 5th, 2021


Time for another GOA6 checkpoint, at 9,600 of 15,676–and this one’s a mixed bag.

Note that, as before, I sort journals by publisher before checking–because many multijournal publishers use the same templates for all journals, making it easier for me to find fee data and do article counts.

For GOA6, that means I’ve now checked through publisher SpringerOpen and and title Chinese Journal of Mechanical Engineering. So far, the 2020 article count is 789,685, and that will almost certainly go up slightly. The 2019 total for this set of journals is 672,583 articles.

Last year, that range of publishers included 8,614 journals, which published 640,867 articles in 2019. So there’s a net gain of 986 added journals so far.

For this group of 1,600 journals–ignoring the first 8,000–problematic journals include 46 malware cases, 86 unreachable/unworkable, one non-OA journal (registration required)–and the unfortunate part, 249 that had to be found at a different address. A few of those are DergiPark, but most are Sciendo, because parent company DeGruyter implemented a new website that broke all the links to journals it had moved to Sciendo, and hasn’t yet updated DOAJ records. (They’d all be unreachable, but I saw the problem and managed a workaround of sorts.)

I have thoughts about DeGruyter/Sciendo. I will not burden you with them.

Looking more closely at the malware cases for these 1,600 journals, there are eight security-certificate problem, one exploit, one spyware, one malware in general, seven phishing–and 28 Trojans.

Now, on to the next 1,600…