Difference between revisions of "ISFDB:Data Consistency/Disallowed URLs"

From ISFDB
Jump to navigation Jump to search
(fixed ma.us)
 
(7 intermediate revisions by 2 users not shown)
Line 10: Line 10:
 
! Allowed?
 
! Allowed?
  
|-
 
| albin-michel.fr
 
| 11
 
| No
 
 
|-
 
|-
 
| amazon.ca
 
| amazon.ca
 
| 17
 
| 17
| ?
+
| Yes
 
|-
 
|-
 
| amazon.com
 
| amazon.com
Line 33: Line 29:
 
| eclipse.co.uk
 
| eclipse.co.uk
 
| 76
 
| 76
| ?
+
| Yes (Unapersson)
 
|-
 
|-
 
|-
 
|-
Line 49: Line 45:
 
|-
 
|-
 
| googlepages.com
 
| googlepages.com
| 3:
+
| 3
[http://www.isfdb.org/cgi-bin/pl.cgi?25002 25002]
 
[http://www.isfdb.org/cgi-bin/pl.cgi?47306 47306]
 
[http://www.isfdb.org/cgi-bin/pl.cgi?129951 129951]
 
 
| Yes? (Marc Kupper)
 
| Yes? (Marc Kupper)
 
|-
 
|-
Line 62: Line 55:
 
| 26022
 
| 26022
 
| D'oh!
 
| D'oh!
|-
 
| meow.fr
 
| 1:
 
[http://www.isfdb.org/cgi-bin/pl.cgi?262371 262371]
 
| No
 
 
|-
 
|-
 
| mondourania.com
 
| mondourania.com
 
| 381
 
| 381
 
| Yes
 
| Yes
|-
 
| mottleshire.org
 
| 1:
 
[http://www.isfdb.org/cgi-bin/pl.cgi?268421 268421]
 
| No
 
|-
 
| mpressbooks.co.uk
 
| 4:
 
[http://www.isfdb.org/cgi-bin/pl.cgi?307693 307693]
 
[http://www.isfdb.org/cgi-bin/pl.cgi?307694 307694]
 
[http://www.isfdb.org/cgi-bin/pl.cgi?307695 307695]
 
[http://www.isfdb.org/cgi-bin/pl.cgi?307696 307696]
 
| No
 
|-
 
| mushroom-ebooks.com
 
| 9:
 
[http://www.isfdb.org/cgi-bin/pl.cgi?291255 291255]
 
[http://www.isfdb.org/cgi-bin/pl.cgi?291256 291256]
 
[http://www.isfdb.org/cgi-bin/pl.cgi?291321 291321]
 
[http://www.isfdb.org/cgi-bin/pl.cgi?291331 291331]
 
[http://www.isfdb.org/cgi-bin/pl.cgi?291334 291334]
 
[http://www.isfdb.org/cgi-bin/pl.cgi?291335 291335]
 
[http://www.isfdb.org/cgi-bin/pl.cgi?291336 291336]
 
[http://www.isfdb.org/cgi-bin/pl.cgi?291337 291337]
 
[http://www.isfdb.org/cgi-bin/pl.cgi?291380 291380]
 
| No
 
|-
 
| ndhansen-hill.com
 
| 3:
 
[http://www.isfdb.org/cgi-bin/pl.cgi?273541 273541]
 
[http://www.isfdb.org/cgi-bin/pl.cgi?273631 273631]
 
[http://www.isfdb.org/cgi-bin/pl.cgi?273632 273632]
 
| No
 
|-
 
| netonecom.net
 
| 1:
 
[http://www.isfdb.org/cgi-bin/pl.cgi?301704 301704]
 
| No
 
 
|-
 
|-
 
| nohttp
 
| nohttp
| 3:
+
| 3
| No
+
| ?
|-
 
| noosfere.org
 
| 2:
 
[http://www.isfdb.org/cgi-bin/pl.cgi?271991 271991]
 
[http://www.isfdb.org/cgi-bin/pl.cgi?271992 271992]
 
| ? (Collectors Showcase)
 
|-
 
| obversebooks.co.uk
 
| 3:
 
[http://www.isfdb.org/cgi-bin/pl.cgi?317336 317336]
 
[http://www.isfdb.org/cgi-bin/pl.cgi?317445 317445]
 
[http://www.isfdb.org/cgi-bin/pl.cgi?317446 317446]
 
| No
 
 
|-
 
|-
 
| openlibrary.org
 
| openlibrary.org
| 1:
+
| 1
[http://www.isfdb.org/cgi-bin/pl.cgi?231837 231837]
 
 
| Yes
 
| Yes
|-
 
| orcabook.com
 
| 1:
 
[http://www.isfdb.org/cgi-bin/pl.cgi?256422 256422]
 
| No
 
|-
 
| over-blog.com
 
| 3:
 
[http://www.isfdb.org/cgi-bin/pl.cgi?261304 261304]
 
[http://www.isfdb.org/cgi-bin/pl.cgi?261305 261305]
 
[http://www.isfdb.org/cgi-bin/pl.cgi?261383 261383]
 
| No
 
|-
 
| penguingroup.com
 
| 1:
 
[http://www.isfdb.org/cgi-bin/pl.cgi?255307 255307]
 
| No
 
 
|-
 
|-
 
| philsp.com
 
| philsp.com
 
| 2466
 
| 2466
 
| Yes (Galactic Central)
 
| Yes (Galactic Central)
|-
 
| pjfarmer.com
 
| 1:
 
[http://www.isfdb.org/cgi-bin/pl.cgi?302971 302971]
 
| No
 
|-
 
| polluto.com
 
| 1:
 
[http://www.isfdb.org/cgi-bin/pl.cgi?256138 256138]
 
| No
 
|-
 
| pspublishing.co.uk
 
| 1:
 
[http://www.isfdb.org/cgi-bin/pl.cgi?272034 272034]
 
| No
 
|-
 
| quarante-deux.org
 
| 1:
 
[http://www.isfdb.org/cgi-bin/pl.cgi?259017 259017]
 
| No
 
|-
 
| randomhouse.com
 
| 1:
 
[http://www.isfdb.org/cgi-bin/pl.cgi?291014 291014]
 
| No
 
|-
 
| redrosepublishing.com
 
| 1:
 
[http://www.isfdb.org/cgi-bin/pl.cgi?273544 273544]
 
| No
 
|-
 
| regalcrest.biz
 
| 1:
 
[http://www.isfdb.org/cgi-bin/pl.cgi?257732 257732]
 
| No
 
|-
 
| rstuttle.com
 
| 24
 
| No
 
 
|-
 
|-
 
| sfcovers.net
 
| sfcovers.net
 
| 2263
 
| 2263
 
| Yes (Visco)
 
| Yes (Visco)
|-
 
| sfsite.com
 
| 4:
 
[http://www.isfdb.org/cgi-bin/pl.cgi?286446 286446]
 
[http://www.isfdb.org/cgi-bin/pl.cgi?286528 286528]
 
[http://www.isfdb.org/cgi-bin/pl.cgi?302185 302185]
 
[http://www.isfdb.org/cgi-bin/pl.cgi?304433 304433]
 
| No
 
|-
 
| shaunaroberts.com
 
| 2:
 
[http://www.isfdb.org/cgi-bin/pl.cgi?321777 321777]
 
[http://www.isfdb.org/cgi-bin/pl.cgi?321778 321778]
 
| No
 
|-
 
| sjgames.com
 
| 1:
 
[http://www.isfdb.org/cgi-bin/pl.cgi?123101 123101]
 
| No
 
|-
 
| skyrock.com
 
| 1:
 
[http://www.isfdb.org/cgi-bin/pl.cgi?262237 262237]
 
| No
 
|-
 
| smashwords.com
 
| 1:
 
[http://www.isfdb.org/cgi-bin/pl.cgi?316708 316708]
 
| No
 
|-
 
| smithwriter.com
 
| 1:
 
[http://www.isfdb.org/cgi-bin/pl.cgi?275613 275613]
 
| No
 
 
|-
 
|-
 
| thetrashcollector.com
 
| thetrashcollector.com
 
| 3
 
| 3
 
| Yes
 
| Yes
|-
 
| tout-resumer.fr
 
| 1:
 
[http://www.isfdb.org/cgi-bin/pl.cgi?262372 262372]
 
| No
 
 
|-
 
|-
 
| uncw.edu
 
| uncw.edu
 
| 108
 
| 108
 
| Yes (Ace Image Library)
 
| Yes (Ace Image Library)
|-
 
| unl.edu
 
| 2:
 
[http://www.isfdb.org/cgi-bin/pl.cgi?279422 279422]
 
[http://www.isfdb.org/cgi-bin/pl.cgi?280314 280314]
 
| No
 
|-
 
| vanguardproductions.net
 
| 1:
 
[http://www.isfdb.org/cgi-bin/pl.cgi?304553 304553]
 
| No
 
|-
 
| webscription.net
 
| 1:
 
[http://www.isfdb.org/cgi-bin/pl.cgi?295917 295917]
 
| No
 
|-
 
| wildsidebooks.com
 
| 1:
 
[http://www.isfdb.org/cgi-bin/pl.cgi?331033 331033]
 
| No
 
|-
 
| wordpress.com
 
| 2:
 
[http://www.isfdb.org/cgi-bin/pl.cgi?257733 257733]
 
[http://www.isfdb.org/cgi-bin/pl.cgi?331418 331418]
 
| No
 
|-
 
| zarthani.net
 
| 1:
 
[http://www.isfdb.org/cgi-bin/pl.cgi?274972 274972]
 
| No
 
 
|}
 
|}

Latest revision as of 15:46, 21 October 2010

Here is how many URLs we point to on a domain by domain basis. We will want to check with the owners of the domains that we haven't secured permission to point to yet and, if they fail to grant permission, zap the URLs. (Whether the URLs are still valid is a different question, one that I will explore in a later script.) See ISFDB:Image linking permissions#List of sites granting permission for a list of sites that have granted permission.

URLs by domain determined as of 2010-10-16:


Domain Number of URLs Allowed?
amazon.ca 17 Yes
amazon.com 37198 Yes
bookscans.com 455 Yes
collectorshowcase.fr 200 Yes
eclipse.co.uk 76 Yes (Unapersson)
fantascienza.com 758 Yes
fantasticfiction.co.uk 1197 Yes
fatcow.com 43 Yes (Bookscans)
googlepages.com 3 Yes? (Marc Kupper)
images-amazon.com 37157 Yes
isfdb.org 26022 D'oh!
mondourania.com 381 Yes
nohttp 3 ?
openlibrary.org 1 Yes
philsp.com 2466 Yes (Galactic Central)
sfcovers.net 2263 Yes (Visco)
thetrashcollector.com 3 Yes
uncw.edu 108 Yes (Ace Image Library)