Difference between revisions of "ISFDB:Data Consistency/Disallowed URLs"
Jump to navigation
Jump to search
(Added links to the problem publication records) |
Mhhutchins (talk | contribs) |
||
Line 8: | Line 8: | ||
! Allowed? | ! Allowed? | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
|- | |- | ||
| amazon.com | | amazon.com | ||
| 38016 | | 38016 | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
| | | | ||
|- | |- | ||
| bestsf.net | | bestsf.net | ||
| 46 | | 46 | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
| | | | ||
|- | |- | ||
| bookscans.com | | bookscans.com | ||
| 131 | | 131 | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
| | | | ||
|- | |- |
Revision as of 00:13, 13 October 2010
Here is how many URLs we point to on a domain by domain basis. We may want to check with the owners of the domains that we haven't secured permission to point to yet and, if they refuse, zap the URLs. (Whether the URLs are still valid is a different question, one that I will explore in a later script.)
Domain | Number of URLs | Allowed? |
---|---|---|
amazon.com | 38016 | |
bestsf.net | 46 | |
bookscans.com | 131 | |
cox.net | 26 | |
dac-editions.com | 1: | |
danielarenson.com | 1: | |
deeden.co.uk | 1: | |
delsdog.co.uk | 1: | |
demon.co.uk | 2: | |
drwhoguide.com | 6: | |
earthlink.net | 1: | |
eclipse.co.uk | 94 | |
edcox.com | 1: | |
edirectory.co.uk | 6: | |
emory.edu | 1: | |
eskimo.com | 1: | |
eternalnight.co.uk | 1: | |
fantasticfiction.co.uk | 173 | |
flickr.com | 1: | |
forgottenfutures.com | 1: | |
fsbusiness.co.uk | 1: | |
generationterrorists.com | 1: | |
geocities.com | 3: | |
googlepages.com | 6: | |
gorelets.com | 2: | |
guptara.net | 1: | |
homestead.com | 1: | |
hostigos.com | 2: | |
houdinination.de | 1: | |
iblist.com | 1: | |
images-amazon.com | 5253 | |
isficpress.com | 2: | |
jpoc.net | 1: | |
kayobooks.com | 1: | |
kristine-smith.com | 1: | |
ksu.edu | 3: | |
librarything.com | 5: | |
lynnabbey.com | 1: | |
majipoor.com | 1: | |
mmedia.is | 1: | |
myspacecdn.com | 1: | |
nasa.gov | 1: | |
nesfa.org | 1: | |
net.au | 1: | |
neweyestudio.com | 1: | |
nicolagriffith.com | 1: | |
nightshadebooks.com | 1: | |
nildram.co.uk | 4: | |
nnbh.com | 1: | |
no-ip.org | 1: | |
nohttp | 6: | |
noosfere.com | 10 | |
northatlanticbooks.com | 1: | |
oclc.org | 2: | |
ofearna.us | 1: | |
oivas.com | 4: | |
oldearthbooks.com | 6: | |
onza.net | 2: | |
papergolem.com | 1: | |
paraworlds.com | 1: | |
pen-paper.net | 1: | |
penguingroup.com | 1: | |
perfectblissmovie.com | 1: | |
philsp.com | 573 | |
photobucket.com | 32 | |
piers-anthony.com | 2: | |
pjfarmer.com | 4: | |
pspublishing.co.uk | 1: | |
quixmart.co.uk | 1: | |
raelori.com | 2: | |
randomhouse.com | 2: | |
randyasplund.com | 1: | |
redjackbooks.com | 1: | |
rockpublishing.com | 1: | |
rudysbooks.com | 1: | |
scifan.com | 1: | |
scifi-az.com | 1: | |
scifi.com | 1: | |
sfcovers.net | 2036 | |
sfcrowsnest.com | 1: | |
sff.net | 2: | |
sfreviews.net | 2: | |
sfrevu.com | 1: | |
sfsite.com | 4: | |
sjgames.com | 10 | |
snapfish.com | 2: | |
sondheimguide.com | 6: | |
sophiaswereld.nl | 1: | |
sorgentedelcielo.it | 1: | |
strangewords.com | 1: | |
subterraneanpress.com | 1: | |
sylviaengdahl.com | 4: | |
tematika.com | 3: | |
trafford.com | 1: | |
tripod.com | 1: | |
ttrantor.org | 1: | |
ucr.edu | 1: | |
umn.edu | 3: | |
uncw.edu | 66 | |
ursamajorawards.org | 1: | |
uwaterloo.ca | 1: | |
webscription.net | 1: | |
welaforlag.se | 1: | |
wheatlandpress.com | 1: | |
wikimedia.org | 1: | |
wildsidebooks.com | 2: | |
wildsidepress.com | 1: | |
worldcat.org | 1: | |
yet.org | 3: | |
yimg.com | 1: | |
zarthani.net | 3: | |
zone-sf.com | 2: |