Playing with O*NET visualizations for degree program proposals
Edit: Found it! Paul Ingemi located the corresponding "this is for programmers!" website, which I have no idea how I overlooked. But onetcenter.org has everything for download in different db formats. Yesssss.
I'm working on a small project involving how to propose new STEM-related degree programs, and wanted to jot down this resource and think-aloud about my attempts to understand how I might use it.
The Department of Labor has a website front-end for O*NET, a database with information about jobs. Job outlooks (is this field expected to grow, etc.?) are one of the pieces of information, number of people in that industry, and other stats I expected to see -- but then others I hadn't thought of, such as "which skills does this job require?"
That last bit allows comparisons of easy lateral moves -- if you're trained in career A, and want to switch, what professions are most similar (but may not be in an obviously similar industry)? For instance, it seems logical to me that a truck driver would also probably be a good train operator, but I wouldn't have thought they would be using skills similar to an explosives worker... but okay, yes, big risky pieces of equipment that you have to operate... yes.
This NYT article on career-switching has interactive visualizations that play with the O*NET data. I can't immediately figure out how -- there's no source code or obvious API on the O*NET side. I wonder how easy it would be to hook to the database, or even scrape it if need be (but that seems silly; there ought to be an interface -- at the same time, that doesn't mean there actually is one). I wonder how we might use this information to think about how to choose new degree programs to start.
For instance, we could look at popular degree programs that cannot possibly accept all of the students that apply, and find skill-adjacent careers to try and expand the number of things students with similar interests can go into (if expanding existing degree programs is not an option). We could screen potential new programs not only by job growth outlook, but by lateral move possibilities - what sorts of degrees will give students the widest range of options if they want to do something different post-graduation, or make it easier to switch majors before graduation? (Engineering degrees are notoriously bad for switching-into from another major, since they require so many specific prerequisite course chains.)
So there's that one question on my mind -- what might we do with this data if we could play with it? And then there's the second question, which is: how might we (on a technical level) play with it?
The O*NET website doesn't have a listed API that I can find. I cannot figure out how to directly query the data, short of going all Ryan Mitchell on it and scraping the heck out of a lot of pages. This seems extremely silly. Maybe I'm missing something. But when Sebastian and I looked (thanks for sanity-checking me, Sebastian!) there wasn't any indication how the NYT piece pulled data from the O*NET database to make the visualizations. We suspect it may be a case of "we threw programmer-hours at the problem" as opposed to "this dataset was easy to manipulate."
So as to not wander down rabbit holes, I've messaged the article authors asking if there's an easy way to learn more about their methodology. And then... onwards, to working on degree program proposals. Ah, research life.