:::

詳目顯示

回上一頁
題名:論搜尋引擎以程式在網路上自動抓取資料時可能面臨之法律問題及其解決之道
書刊名:圖書館學與資訊科學
作者:廖先志陳鍾誠
出版日期:2007
卷期:33:1
頁次:頁53-66
主題關鍵詞:搜尋引擎Crawler程式侵權行為著作權法Search engineCrawlerTortsCopyright
原始連結:連回原系統網址new window
相關次數:
  • 被引用次數被引用次數:期刊(1) 博士論文(0) 專書(0) 專書論文(0)
  • 排除自我引用排除自我引用:1
  • 共同引用共同引用:0
  • 點閱點閱:50
搜尋引擎必須以crawler程式(又稱spider程式)來自動抓取網頁並建立索引,如果crawler程式僅僅循著網頁所提供的超連結來搜尋網頁並抓取內容,稱為一般性的crawler程式;如果不論網頁是否提供超連結,crawler程式會自行計算並找到網頁的所有內容並加以抓取,此種crawler則稱為深度crawler。而crawler抓取網頁內容的步驟可以細分為「取得資料」及「儲存並建立索引」二大步驟。在「取得資料」階段中,深度crawler雖然是自行透過演算法來取得網頁的所有內容,但本文認為仍不至於構成非法存取(unauthorized access)。此外,不論是一般的crawler或是深度crawler,如果取得網頁內容時會耗費網站資源而干擾網站的正常運作,就可能構成如美國eBay案中討論的財產侵害(trespass to chattel)。在「儲存並建立索引」階段中,原則上應該不會侵害網頁擁有者之重製權,然而、有些搜尋引擎 (例如 Google) 將其取得的內容以「庫存頁面」(cache)的方式允許使用者存取,此時即有爭議發生,但本文以為,由於搜尋引擎的主要目的是在使網路使用者更容易接觸網頁,所以此種「重製」與「散布」行為原則上應有著作權法「合理使用」原則的適用,故不會構成侵害著作權,但仍應考慮搜尋引擎與原網站之間是否處於競爭關係,以及所抓取之資料量佔原網站之比例等因素綜合判斷。要解決搜尋引擎與網站間可能發生的法律爭議,除可以強化現行的robot exclusion標準外,網站也可以考慮增強自動過濾crawler的功能,以杜絕爭議。
A Search engine uses a crawler (or a spider) to retrieve and index web pages. A “general crawler” crawls across the Internet by following only the hyperlinks directly provided by web pages. On the other hand, a “deep crawler” has the capability of retrieving all contents of web pages whether the hyperlinks are available or not. The process of a crawler’s work can be split into two major steps: the “retrieving” step and the “storage and indexing” step. As far as the retrieving step is concerned, even though a deep crawler can generate all URLs by itself, there should not be any unauthorized access at all. However, a crawler, either a general or deep one, will be charged with trespassing to chattel if it interferes in the website, whereas, in the “storage and indexing” step, owners’ copyright of web pages will in principle not be infringed upon by a crawler. Nevertheless, there are controversies about “cache”, which is provided by certain popular search engines, such as Google and Yahoo. Search engines may claim their fair use defense of “cache”, because one of the main goals of search engines is to improve access to information on the internet. Actually, we must consider many factors before we decide whether “cache” is fair or not, for instance, whether there is competition between the search engine and the retrieved web sites; and what percentage of the contents is retrieved by the crawler, etc. To resolve all possible legal conflicts between crawlers and web pages, among several economic resolutions, a newer and complete robot exclusion standard is required. In addition, a self-detecting mechanism adopted by web pages may be even a more powerful one.
期刊論文
1.林發立(20000800)。Internet的優勢與問題--從Ticketmaster Corp. v. Tickets.com Inc.「深入連結」一案談起。萬國法律,112,49-52。  延伸查詢new window
2.The Observer。The Observer。  new window
3.Samuelson, Pamela(2003)。Unsolicited Communications as Trespass?。Comm. ACM,46(10),15-20。  new window
4.Collins, Kevin Emerson(2006)。Cybertrespass and Trepass to Documents。Cleveland State Law Review,54,41-66。  new window
5.Hatcher, Michael、McDonnell, Jay、Ostfeld, Stacy(1999)。Computer Crimes。American Criminal Law Review,36(3),397-444。  new window
6.Tan, P.-N.、Kumar, V.(2002)。Discovery of web robot sessions based on their navigational patterns。Data Mining and Knowledge Discovery,6(1),9-35。  new window
會議論文
1.Ntoulas, Alexandros、Zerfos, Petros、Cho, Junghoo(2005)。Downloading Textual Hidden Web Content by Keyword Queries100-109。  new window
研究報告
1.Page, L.(1998)。The PageRank Citation Ranking: Brining Orderto the Web。  new window
圖書
1.羅明通(200208)。著作權法論。臺北:三民書局。  延伸查詢new window
2.Baeza-Yates, R.、Ribeiro-Neto, B.(1999)。Modern information retrieval。Addison-Wesley。  new window
3.Lemley, Mark A.(2009)。Software and Internet Law。Software and Internet Law。  new window
其他
1.郭寶明。網站搜索引擎提供者著作權侵權風險的法律分析。  延伸查詢new window
2.W3C。HTML 4.01specification。  new window
3.Lourenço, Anália G.,Belo, Orlando O.(2006)。Catching web crawlers in the act。  new window
4.Blake A. Field v. Google Inc.。  new window
5.(1927)。De Forest Radio Tel. & Tel. Co. v. United States。  new window
 
 
 
 
第一頁 上一頁 下一頁 最後一頁 top
QR Code
QRCODE