This is really interesting, thanks.
To point out to those who perhaps haven't looked at the file, many of these are hostnames, not domain names. And many of these are for non-web services, ie is a z39.50 server (api) which is probably used to allow third parties to search the library catalogue. Pointing this out before anyone jumps in with "15 thousand gov websites!"