Archive for the ‘open access’ Category

Gold/Diamond OA usage stats

Wednesday, July 31st, 2024

As of today, here’s what I see:

Gold Open Access 2024: 101 PDF downloads, one paperback.

Diamond OA 2024: 2,500 PDF downloads, no paperbacks. (Without at least looking at Gold Open Access 2024, Diamond OA 2024 readers are lacking some context, but…

Dataset: 98 downloads.

Previous year (GOA8/Diamond 2023)

GOA8: 968 PDF copies, one paperback

Diamond 2023: 452 PDF downloads, no paperbacks

Dataset: 269 downloads.

GOA7:

1,476 PDF downloads (and one paperback).

Country7: 372 PDF downloads

Dataset: 389 downloads.

Gold Open Access Project: Usage Notes

Thursday, June 20th, 2024

The first notes for GOA24/DIA24 (“GOA9”), and the first note since February.

Gold Open Access 2024 (GOA9)

59 pdf ebooks and no paperbacks.

Dataset: 67 downloads, as far as I can tell (and 68 views at Figshare)

Diamond OA 2024

The big surprise (the power of Peter Suber and Mastodon?): 2,256 PDF ebooks. No paperbacks.

Gold Open Access 8

948 PDF ebooks, one print paperback (plus, of course, the copy I purchased)

Dataset: 264 downloads (and 664 views at Figshare)

Diamond OA 2023

435 PDF ebooks, no paperbacks.

Diamond OA 2024 is now available

Tuesday, June 18th, 2024

I’m delighted to announce that Diamond OA 2024: The World of No-Fee Open Access Publishing is now available as a free PDF or a nominally-priced ($8 US, as low as GBP6.25 or as high as CAD$13) trade paperback.

And that’s it for GOA9: I’ll provide stats in the next week or two and periodically through the rest of the year, and look at GOA10 in a few months. The bad news: diamond OA seems to be slipping somewhat. The good: still more than 400,000 articles in 2023.

Gold Open Access 2024 now available as book and dataset

Thursday, May 23rd, 2024

Gold Open Access 2024: Articles in Journals 2019-2023 (GOA9) is now available as an $11.50 color trade paperback or free PDF download. The dataset is also available at figshare (although it won’t currently preview) or directly from waltcrawford.name. For all links, go to https://waltcrawford.name/goaj.html

Diamond OA 2024: The World of No-Fee Open Access Publishing should be ready in July or very late June.

GOA9: Progress Report

Thursday, May 16th, 2024

I’m reworking on Gold Open Access 2024, and my best guess is that it will be complete some time this month–most likely between May 24 and May 31. “Complete” means that the paperback is available from Lulu, the free PDF is available on my website, and the dataset is on Figshare.

A best guess is just that.

If you’ve been using the preliminary dataset: there will be changes–around 30 additional rows, and one odd change that does not affect article counts.

Best guess for Diamond OA 2024, barring unusual crises, is three to five weeks after Gold Open Access 2024 is complete–maybe more, almost certainly not less. So very late June or early July, if crossing my fingers doesn’t make typing too slow.

GOA9: Preliminary dataset now available

Thursday, May 2nd, 2024

A preliminary version of the GOA9 dataset is now available at at https://waltcrawford.name/g9_prefig.xlsx. I believe this dataset will be identical to the final g9 dataset on Figshare, unless some surprises turn up during preparation of derivative data and the book.

The dataset proper includes 19,392 journals; the exclusions page has another 877 (including the “xd” journals). For more notes, see https://walt.lishost.org/2024/04/goa9-end-of-data-gathering/

[Note: crosschecking to prepare derivative data showed two keyboarding errors; they have been corrected and the preliminary dataset has been replaced.]

Do note that Malwarebytes seems to have restored its blanket categorization of all .name domains as suspicious…and I have no idea when or whether they’ll (once again) refine that overbroad warning.

GOA9: End of data gathering

Tuesday, April 30th, 2024

I’ve finished the smallest but most annoying (and slowest) part of data gathering: rechecking the xm2/xx2 journals. It wasn’t entirely worthless–53 were now a (normal), five were now inactive (bi), one was now xj (no longer in DOAJ), and one was now dead/defunct (xd).

Unfortunately, the malware problem is much worse than last year: 335 xm2 (compared to 260 last year) and, sigh, 313 xm (compared to 190 last year. It’s mostly Indonesia: almost precisely 2/3 of all malware cases (439 out of 648), including 157 of 313 new cases and 282 of 335 continuing (xm2). Next highest is Brazil with 42 new and five continuing, then Italy, Russia and Spain with a dozen each and Venezuela with 11. Nearly all Indonesia cases are in universities: either they’re not aware that they’re carrying malware (almost always in entry menus for Oa journals, not individual journals) or they just don’t care.

DOAJ has also been more actively removing journals this year than in the first four months of 2023: 249 journals compared to 41 last year. And there were more “not OA journals”–but only a dozen, mostly because when checked twice, they required login. (There are also two or three that are encyclopedias, not journals, and one that I find entirely mystifying–a journal devoted to one author that has neither dates nor issues but lists articles alphabetically; I’m unwilling to read each and every PDF to see what the dates are, if there are any.)

How often did I find a new URL (by searching on journal title + ISSN) that appeared to work properly as an entry point for the journal? 359, I believe.

The Big Numbers

In all, and barring cleanup during processing, 19,622 journals are fully analyzed, although 230 of those have no articles later than 2018 (xd). Those journals published 1,440,494 articles in 2023 and 1,445,733 in 2022–up just basely from last year and the first time there’s been a decline in overall articles (probably because a number of journals are very late with issue processing). NOTE: I’ve now moved xd journals to Excluded, where they belong. Doesn’t affect the article numbers, but does cut the overall number of fully-analyzed journals to 19,392.

18,430 journals are “normal” (a) and 510 had no 2022 or 2023 articles when checked (bi). The former is around 850 more journals than last year; the latter, 87 more.

Special Cases

Cases that will appear in regular tables:

  • bi (inactive): 510
  • xm (malware or certificate problems): 313
  • xx (unavailable or unworkable): 139

Cases that will not be in regular tables and will be on the Exclusions page of the dataset:

  • xd (dead/defunct): 230
  • xj (removed from DOAJ): 249
  • xm2 (continuing malware): 335
  • xn (not an OA journal or uncountable): 12
  • xx2 (continued unavailable/unworkable): 51

Next steps

It appears that I will make a preliminary version of the dataset available on my own website, probably on Thursday or Friday. It’s possible–but unlikely–that there will be slight changes in the final dataset, published when Gold Open Access 2024 is published.

Preliminary data now available

The preliminary data is now available at https://waltcrawford.name/g9_prefig.xlsx. I believe this dataset will be identical to the final g9 dataset on figshare, unless some surprises turn up during preparation of derivative data and the book.

Do be aware that Malwarebytes seems to think that all .name domains are potentially problematic. I’ve notified them that this isn’t so, and just as they fixed it previously I hope they’ll do so again–I’m not prepared to rebuild all of my files on another domain.

And beyond…

Now I’ll check data for conformity, add derived data (e.g., revenue figures for journals, size, growth…) and make sure the templates are working.

Then comes the “writing”–mostly generating tables and figures, but with some text as well. [Modified to allow for other priorities:] The book and formal data posting should be ready in the spring (that is, before the end of June).

Then, Diamond OA 2024, covering the 13,139 diamond OA journals that aren’t exclusions (almost precisely 2/3, but exactly 66.9605545%), mostly focusing on country-by-country. Figure another three to five weeks for that.

 

GOA9: End of Pass 2 (start of Pass 2b)

Saturday, April 27th, 2024

I’ve completed Pass 2, problematic journals and those where it seemed plausible that a new 2023 issue might emerge. Along the way, I also retested 35 of the xm2/xx2 journals, and–to my surprise–found five that could be resurrected looking for other URLs. So, given that there are only 456 of these, I’ll check them one more time.

At this point–omitting those 456–there are 19,563 included journals and 250 excluded journals. The 250 exclusions account for only 438 articles in 2023, which is hardly surprising (the total peaked at 8,511 in 2020). Nearly all of the exclusions so far, 239 of them, are journals removed from DOAJ since 1/1/2024; the other 11 are not OA journals–four encyclopedias and seven journals that now require login with no other obvious means of access (checked twice).

The 19,563 account for 1,441,817 articles in 2023 and 1,443,423 in 2022. 333 of them are at different URLs than the ones that appear in DOAJ. 6,470 have fees and 13,093–just over two-thirds–do not.

Special cases

  • bi [inactive: no articles since 2021]: 521
  • xd [dead: no articles since 2018]: 229
  • Note that many bi and xd have been continued under other names
  • xm [malware, but not in 2023]: 313
  • xx [unavailable or unworkable, but not in 2023]: 138

Yes, there are still far too many malware cases…especially given the xm2 still to be rechecked.

Now, on to the mercifully brief final pass. Then, some time off, and time to massage data, add derived data, and write the book(s).

Oh, and you’ve still got a few days to chime in on https://walt.lishost.org/2024/04/goa9-should-i-post-a-preliminary-dataset/

GOA9: Should I post a preliminary dataset?

Friday, April 26th, 2024

Here’s the question: Should I post a preliminary dataset for GOA9 when I finish the final data-gathering pass, rather than waiting until the book is ready (my usual practice)?

I’ll finish Pass 2 (rechecking most problematic cases that weren’t also problematic last year) this weekend. I’ve looked at enough of the “xm2/xx2” cases (ones that were problematic this year and last) to believe that it may be worth the few days required to recheck those. So the data will be ready some time next week–probably May 2-4.

At that point, after a day off, I do some normalization, then start adding derivative data (e.g., revenue for each fee-charging journal, size and cost brackets), then prepare the book–probably a four or five week process. (The Diamond OA by Country book gets done after I’ve published the book and uploaded the dataset to Figshare–probably in mid-July.)

Of course, I’d really like people to read the text treatment before using the data, and it is possible that normalization could result in some data changes (probably very few, maybe none). Ideally, I’d like a few people to buy the print book (always priced at the nearest half-dollar over actual production costs), but that seems like a lost cause…

So: If I post this, there could be some changes–and I would NOT be tracking or posting those changes.

I’ll decide around May 2, based on feedback. [The early post would add maybe an hour or two of work; that’s not a decision point.] Please leave feedback in comments, in email (waltcrawford@gmail.com), or to waltcrawford on Mastodon. (I’ve been off the deadbird for months now.)

What say you?

Gold/Diamond Scan, Toward Second Pass

Friday, April 12th, 2024

I’ve now done several cleanup things:

  • Checked to see whether any of the journals removed from DOAJ since 1/1/2024 had been restored by 4/12/2024, None had; those journals are all treated as excluded.
  • Compared codes for last year’s study with those this time around. If a journal is “xm” or “xx” and was also “xm” or “xx” last year, I stop checking it. There were a lot of these–adding those to the “xj” (removed) journals yields 948 journals that I marked as excluded and won’t recheck. Perhaps worth noting that those 948 only account for 9,757 articles in 2024 (where articles could be counted through other means) and 16,574 in 2023.
  • At this point, there are 2,510 journals to be rechecked and 17,177 complete (and not excluded, but including journals no longer publishing). Those journals account for 1,393,000 articles in 2023; 1,381,039 in 2022.
  • The rescan will start today and cover 2,510 journals. I’ll do the usual daily Mastodon posts. Hoping for two weeks…but I do some additional checking, so it could take longer.