新书推介:《语义网技术体系》
作者:瞿裕忠,胡伟,程龚
   XML论坛     W3CHINA.ORG讨论区     计算机科学论坛     SOAChina论坛     Blog     开放翻译计划     新浪微博  
 
  • 首页
  • 登录
  • 注册
  • 软件下载
  • 资料下载
  • 核心成员
  • 帮助
  •   Add to Google

    >> 本版讨论Semantic Web(语义Web,语义网或语义万维网, Web 3.0)及相关理论,如:Ontology(本体,本体论), OWL(Web Ontology Langauge,Web本体语言), Description Logic(DL, 描述逻辑),RDFa,Ontology Engineering等。
    [返回] 计算机科学论坛W3CHINA.ORG讨论区 - Web新技术讨论『 Semantic Web(语义Web)/描述逻辑/本体 』 → Frank Van Harmelen评《The Unreasonable Effectiveness of Data》(发表于IEEE Intelligent System 2009年三/四月刊) 查看新帖用户列表

      发表一个新主题  发表一个新投票  回复主题  (订阅本版) 您是本帖的第 3091 个阅读者浏览上一篇主题  刷新本主题   树形显示贴子 浏览下一篇主题
     * 贴子主题: Frank Van Harmelen评《The Unreasonable Effectiveness of Data》(发表于IEEE Intelligent System 2009年三/四月刊) 举报  打印  推荐  IE收藏夹 
       本主题类别: Semantic Web    
     admin 帅哥哟,离线,有人找我吗?
      
      
      
      威望:9
      头衔:W3China站长
      等级:计算机硕士学位(管理员)
      文章:5255
      积分:18407
      门派:W3CHINA.ORG
      注册:2003/10/5

    姓名:(无权查看)
    城市:(无权查看)
    院校:(无权查看)
    给admin发送一个短消息 把admin加入好友 查看admin的个人资料 搜索admin在『 Semantic Web(语义Web)/描述逻辑/本体 』的所有贴子 点击这里发送电邮给admin  访问admin的主页 引用回复这个贴子 回复这个贴子 查看admin的博客楼主
    发贴心情 Frank Van Harmelen评《The Unreasonable Effectiveness of Data》(发表于IEEE Intelligent System 2009年三/四月刊)

    http://blog.larkc.eu/?p=1331

    The unreasonable effectiveness of fake controversies

    (by Frank van Harmelen)

    The Halevy, Norvig & Pereira paper on “[URL=http://www.computer.org/portal/cms_docs_intelligent/intelligent/homepage/2009/x2exp.pdf]The Unreasonable Effectiveness of Data[/URL]” (published in IEEE Intelligent Systems, and posted on the [URL=http://googleresearch.blogspot.com/2009/03/unreasonable-effectiveness-of-data.html]Google Blog[/URL]) was much discussed in recent days.

    I had my finger on  the trigger for a response, when I stumbled across [URL=http://www.betaversion.org/%7Estefano/linotype/news/275/]Stefano Mazzocchi’s blog[/URL] which phrased my opinion about the piece exactly: Halevy (first author) makes his case by creating a controversy that isn’t really there. He opposes a symbolic/structural approach to semantics against a statistical approach, and makes it seem as if the two are entirely mutually exclusive. Obviously that isn’t the case: it’s great if statistical analysis of humongous datasets can unearth important relationships, and I can see no reason why the results of such work could then not be used in structural/symbolic approaches. This is (potentially) a mutually beneficial relationship, not an antagonistic one.

    As Mazzocchi rightly points out, it’s rather ironic that the entire Google empire is built on …. gues what…. analysing a structural/symbolic network (namely the HREF links between webpages), which they then very succesfully combine with all kinds of statistical measures. If this combination of structural and statistical approaches works for Web1.0, why suddenly create this fake controversy when we talk about Web3.0?

    The most fruitful way forward would be to investigate how the ace work that Halevy c.s. are doing on statistical methods with huge datasets can be combined with approaches that exploit the explicit structure that is available in so many large datasets.

    And as an aside: cartooning the Semantic Web as being about “tagging web-pages” is defaulting to a rhetorical device known as “[URL=http://en.wikipedia.org/wiki/Straw_man]seting up a straw man[/URL]“. Never a strong sign. I’m sure Alon c.s. are familiar with LOD, but no mention of it in their paper…)

    To finish up, here are some quotes from  [URL=http://www.betaversion.org/%7Estefano/linotype/news/275/]Stefano Mazzocchi’s excellent blog entry:[/URL]

        What upset me about that paper is not how they say “oh sure, structure is great, but look overhere: there is a goldmine in all the sand” (which is something I fully resonate with) but they phrased it as a fight, deterministic vs. statistical, trying to convince people that adding structure it not the way to go, it’s basically a global waste of research resources

        ….

        Google uses all sort of techniques, statistical and not and they are very good at mixing them together, but that’s not what you get from the paper. What you get is a undertone of criticism for those who believe that what’s needed is a lot more explicit structure

        ….

        this confrontational undertone is coming across at best as hypocrite and at worst as toxic, especially when coming from the [URL=http://research.google.com/]research heads[/URL] of an [URL=http://www.google.com/]entity[/URL] that so much benefited from non-statistical amplification of minor distributed increases in data structure.

    Amen to that.


    附:[URL=http://www.computer.org/portal/cms_docs_intelligent/intelligent/homepage/2009/x2exp.pdf]《The Unreasonable Effectiveness of Data》下载地址 [/URL]


       收藏   分享  
    顶(0)
      




    ----------------------------------------------

    -----------------------------------------------

    第十二章第一节《用ROR创建面向资源的服务》
    第十二章第二节《用Restlet创建面向资源的服务》
    第三章《REST式服务有什么不同》
    InfoQ SOA首席编辑胡键评《RESTful Web Services中文版》
    [InfoQ文章]解答有关REST的十点疑惑

    点击查看用户来源及管理<br>发贴IP:*.*.*.* 2009/4/2 11:33:00
     
     Humphrey 帅哥哟,离线,有人找我吗?狮子座1981-7-23
      
      
      威望:1
      等级:研二(搞定了DL,再搞定F-Logic!)
      文章:937
      积分:5743
      门派:W3CHINA.ORG
      注册:2008/3/12

    姓名:(无权查看)
    城市:(无权查看)
    院校:(无权查看)
    给Humphrey发送一个短消息 把Humphrey加入好友 查看Humphrey的个人资料 搜索Humphrey在『 Semantic Web(语义Web)/描述逻辑/本体 』的所有贴子 引用回复这个贴子 回复这个贴子 查看Humphrey的博客2
    发贴心情 
    语义网方面的专门讨论?又有的看了。

    ----------------------------------------------
    鸿丰

    点击查看用户来源及管理<br>发贴IP:*.*.*.* 2009/4/2 15:26:00
     
     GoogleAdSense狮子座1981-7-23
      
      
      等级:大一新生
      文章:1
      积分:50
      门派:无门无派
      院校:未填写
      注册:2007-01-01
    给Google AdSense发送一个短消息 把Google AdSense加入好友 查看Google AdSense的个人资料 搜索Google AdSense在『 Semantic Web(语义Web)/描述逻辑/本体 』的所有贴子 访问Google AdSense的主页 引用回复这个贴子 回复这个贴子 查看Google AdSense的博客广告
    2025/9/15 9:17:56

    本主题贴数2,分页: [1]

    管理选项修改tag | 锁定 | 解锁 | 提升 | 删除 | 移动 | 固顶 | 总固顶 | 奖励 | 惩罚 | 发布公告
    W3C Contributing Supporter! W 3 C h i n a ( since 2003 ) 旗 下 站 点
    苏ICP备05006046号《全国人大常委会关于维护互联网安全的决定》《计算机信息网络国际联网安全保护管理办法》
    62.500ms