新书推介:《语义网技术体系》
作者:瞿裕忠,胡伟,程龚
   XML论坛     W3CHINA.ORG讨论区     计算机科学论坛     SOAChina论坛     Blog     开放翻译计划     新浪微博  
 
  • 首页
  • 登录
  • 注册
  • 软件下载
  • 资料下载
  • 核心成员
  • 帮助
  •   Add to Google

    >> Web服务(Web Services,WS), 语义Web服务(Semantic Web Services, SWS)讨论区: WSDL, SOAP, UDDI, DAML-S, OWL-S, SWSF, SWSL, WSMO, WSML,BPEL, BPEL4WS, WSFL, WS-*,REST, PSL, Pi-calculus(Pi演算), Petri-net,WSRF,
    [返回] 中文XML论坛 - 专业的XML技术讨论区W3CHINA.ORG讨论区 - Web新技术讨论『 Web Services & Semantic Web Services 』 → [讨论]语义搜索技术 查看新帖用户列表

      发表一个新主题  发表一个新投票  回复主题  (订阅本版) 您是本帖的第 515127 个阅读者浏览上一篇主题  刷新本主题   树形显示贴子 浏览下一篇主题
     * 贴子主题: [讨论]语义搜索技术 举报  打印  推荐  IE收藏夹 
       本主题类别: 信息检索 | Semantic Web    
     whfcarter 帅哥哟,离线,有人找我吗?
      
      
      
      威望:9
      等级:计算机学士学位(贵宾)
      文章:143
      积分:2145
      门派:XML.ORG.CN
      注册:2005/3/8

    姓名:(无权查看)
    城市:(无权查看)
    院校:(无权查看)
    给whfcarter发送一个短消息 把whfcarter加入好友 查看whfcarter的个人资料 搜索whfcarter在『 Web Services & Semantic Web Services 』的所有贴子 引用回复这个贴子 回复这个贴子 查看whfcarter的博客141
    发贴心情 

    to  Humphrey: 放假期间没有关注论坛,所以没有及时回答你的问题。
    Amit的全称是sheth amit, 他现在是Wright University的教授,他是ISWC 2006的local chair, ISWC 2008的Program Chair以及IJSWIS的chief-in-editor。你可以通过DBLP查到他的相关论文,应该是WWW 2007的(如果没有记错的话)。
    对于"查询约束",我没有听说过,英语是"query constraint"还是"search constraint"?
    同时,"查询扩展"和你所说的约束应该不是逆推的关系,这里我想就我对于查询扩展的理解简单说明一下,查询扩展故名思义就是扩展原有的查询,最简单的是google的relevance feedback,当然在Semantic Web中最一开始得应用是根据一个thesarus或者taxonomy对查询(最常见是关键字)进行disambiguation或者指定sense或context,从而增加搜索的recall (通过扩展获得的新的查询)获得原先找不到的相关结果。当然,在我原先的帖子中提到query interpretation,即从一种查询语义到另一种查询语义的转换。例如对于Semantic Web查询引擎要求输入SPARQL或者RQL等formal query,但是为了提高系统的受用群体或者改善易用性,我们可以支持natural language或keyword,将这些转换为formal query的过程可以看作是query rewritting,对于keyword 到sparql的转换,你补全了很多原本Keyword中缺失的语义信息等,可以看作是一种expansion,而这种应用可以提高search的precision。当然这种你可以看作是对于原有查询增加新的约束条件。

    另外和你说的相关的还有faceted browsing或者叫exploatory search,最简单的就是很多e-business的购物网站,提供各种product的属性等来约束搜索结果,同时,query relaxing是另外一个相关的topic,他的想法是根据用户的preference或者当前的search context对于某些搜索条件添加不同的权重,或放宽搜索约束条件。

    希望这些简单的解释对你有帮助。

    点击查看用户来源及管理<br>发贴IP:*.*.*.* 2009/1/4 15:12:00
     
     wangjp0702 帅哥哟,离线,有人找我吗?
      
      
      等级:大一新生
      文章:0
      积分:58
      门派:XML.ORG.CN
      注册:2007/10/19

    姓名:(无权查看)
    城市:(无权查看)
    院校:(无权查看)
    给wangjp0702发送一个短消息 把wangjp0702加入好友 查看wangjp0702的个人资料 搜索wangjp0702在『 Web Services & Semantic Web Services 』的所有贴子 引用回复这个贴子 回复这个贴子 查看wangjp0702的博客142
    发贴心情 
    真正的语义网应用还没到来
    点击查看用户来源及管理<br>发贴IP:*.*.*.* 2009/1/7 14:06:00
     
     Humphrey 帅哥哟,离线,有人找我吗?狮子座1981-7-23
      
      
      威望:1
      等级:研二(搞定了DL,再搞定F-Logic!)
      文章:937
      积分:5743
      门派:W3CHINA.ORG
      注册:2008/3/12

    姓名:(无权查看)
    城市:(无权查看)
    院校:(无权查看)
    给Humphrey发送一个短消息 把Humphrey加入好友 查看Humphrey的个人资料 搜索Humphrey在『 Web Services & Semantic Web Services 』的所有贴子 引用回复这个贴子 回复这个贴子 查看Humphrey的博客143
    发贴心情 
    “语义搜索”,或者说“语义搜索引擎”的出现是在语义网概念提出之前呢,还是在语义网概念提出之后呢?虽然我看了一些资料,试图找出结果,但是至今也没有获得明确的答案,只好拜托诸位前辈指教。虽然问题比较原始,但是似乎不是很容易能有理有据的说明的。

    ----------------------------------------------
    鸿丰

    点击查看用户来源及管理<br>发贴IP:*.*.*.* 2009/1/8 21:23:00
     
     whfcarter 帅哥哟,离线,有人找我吗?
      
      
      
      威望:9
      等级:计算机学士学位(贵宾)
      文章:143
      积分:2145
      门派:XML.ORG.CN
      注册:2005/3/8

    姓名:(无权查看)
    城市:(无权查看)
    院校:(无权查看)
    给whfcarter发送一个短消息 把whfcarter加入好友 查看whfcarter的个人资料 搜索whfcarter在『 Web Services & Semantic Web Services 』的所有贴子 引用回复这个贴子 回复这个贴子 查看whfcarter的博客144
    发贴心情 
    语义搜索,就是semantic search,指更加智能的搜索引擎,这是所有搜索引擎的一致目标。他代表支持用户表达复杂的查询需求,精确定位并给出答案。这个概念在Semantic Web之前就已经出现。大家都知道,搜索引擎的核心技术是信息检索(Information Retrieval),这最早在digital library中得到应用,并且在早期的搜索中主要使用基于逻辑表示的boolean匹配。在近年中,随着自然语言技术的成熟以及现有syntax-based技术的缺陷,在一些企业搜索应用(如IBM)或者站点搜索(如Wikipedia, Freebase)甚至垂直搜索(如专家搜索,机票搜索等)中,语义技术(不仅仅局限于Semantic Web technology)被越来越多的提到和应用。PowerSet被称为成功的semantic search engine (主要基于自然语言处理的),之后被微软高价收购。而很多基于metadata的语义搜索引擎原型也被提出,其中包括Yahoo的microsearch和searchMonkey等。我觉得Semantic Web的出现使得语义搜索更加流行也更加受到关注。但同时也使得semantic search这个词更加具有歧义了,:)
    点击查看用户来源及管理<br>发贴IP:*.*.*.* 2009/1/9 0:47:00
     
     viaphone 帅哥哟,离线,有人找我吗?
      
      
      等级:大三(要不要学学XML呢?)
      文章:149
      积分:674
      门派:XML.ORG.CN
      注册:2005/3/15

    姓名:(无权查看)
    城市:(无权查看)
    院校:(无权查看)
    给viaphone发送一个短消息 把viaphone加入好友 查看viaphone的个人资料 搜索viaphone在『 Web Services & Semantic Web Services 』的所有贴子 引用回复这个贴子 回复这个贴子 查看viaphone的博客145
    发贴心情 
    现在问题是网络里没有有效的充足的语义知识基础设施,sematic search 还只能做为传统IR的一个补充部份来做。也许还只某些具体的领域,或具体的环节改进查询结果。要真正实现所谓语义查询估计还有时日。即使真正这一天到来,传统IR我相信依旧会在其中拌演相当重要的角色。
    点击查看用户来源及管理<br>发贴IP:*.*.*.* 2009/1/10 20:37:00
     
     Avansky 帅哥哟,离线,有人找我吗?
      
      
      威望:1
      等级:大三(研究MFC有点眉目了!)
      文章:103
      积分:675
      门派:W3CHINA.ORG
      注册:2008/12/3

    姓名:(无权查看)
    城市:(无权查看)
    院校:(无权查看)
    给Avansky发送一个短消息 把Avansky加入好友 查看Avansky的个人资料 搜索Avansky在『 Web Services & Semantic Web Services 』的所有贴子 引用回复这个贴子 回复这个贴子 查看Avansky的博客146
    发贴心情 
    2008十大语义网产品

    Top 10 Semantic Web Products of 2008
    Written by Richard MacManus / December 2, 2008 9:57 AM

    In 2008 we saw the Semantic Web gain traction, giving us plenty of choice when selecting the 10 best Semantic Web products of 2008.
    This is the first in a series of posts we'll publish over December, listing our choices for the top web products of the year. Then at the end of December, we'll post a Top 100 list - which we'll be promoting over 2009 and opening up at some point for public voting. Without further ado, let's jump into the top 10 Semantic Web products of 2008.
    Earlier this month we posted an update to 10 Semantic Web applications that we have been tracking for a year now. Some of those make this list, as well as some from our follow-up post 10 More Semantic Apps to Watch. We also have a couple of other products in this list, which for one reason or another didn't get mentioned in our watch-lists.
    You may disagree with our selections, so do tell us in the comments what you think.
    Note: the products listed below are in no particular order
    1. Yahoo! SearchMonkey
    In May this year Yahoo! launched an open developer platform for search called SearchMonkey. Yahoo hasn't had the happiest of years, but its willingness to innovate in search is to be commended. As we reported at the Web 2.0 Expo in April, SearchMonkey is a component of a major overhaul at Yahoo! across all of its properties to "rewire" for the social graph and data portability. SearchMonkey allows developers to build applications on top of Yahoo! search, including allowing site owners to share structured data with Yahoo!, using semantic markup (microformats, RDF), standardized XML feeds, APIs (OpenSearch or other web services), and page extraction.
    We think this is the best use of Semantic Web by an Internet bigco this year. So for that reason SearchMonkey makes our top 10 list. Related: The Story of SearchMonkey.
    2. Powerset (acquired by Microsoft in '08)
    Powerset (see our initial coverage here and here) is a natural language search engine. It's fair to say that Powerset has had a great 2008, having been acquired by Microsoft in July this year.
    At the time of the acquisition, Powerset said that it needed a bigger partner to expand its product beyond its current state of only searching Wikipedia - something we had speculated about when the rumors of the acquisition first appeared. In its own statement, Microsoft stressed how useful Powerset's technology will be for improving Microsoft's own search products and to "take Search to the next level." In our analysis of the deal, we noted that it was a "bold play requiring exact execution" by Microsoft.
    3. Open Calais (Thomson Reuters)
    At the end of 2007, ClearForest had been recently acquired by Reuters and at that point it had a Web Service and a Firefox extension. What a change a year brings! ClearForest went on to release Calais, a toolkit of products that enable users to incorporate semantic functionality within their blog, content management system, website or application.
    Since launching the Open Calais API early this year, over 6,000 developers have registered with it and the service is doing more than 1 million transactions a day. Version 3.0 was released earlier this month and version 4 is expected by January 09.
    4. Dapper MashupAds
    In November we wrote about the recent improvement in Dapper MashupAds, a product we first spotted over a year ago. The idea is that publishers can tell Dapper: this is the place on my web page where the title of a movie will appear, now serve up a banner ad that's related to whatever movie this page happens to be about. That could be movies, books, travel destinations - anything. We remarked that the UI for this has grown much more sophisticated in the past year.
    The company believes that its new ad network will provide monetary incentive for publishers to have their websites marked up semantically. We think this has plenty of promise, so it makes our year-end list.
    5. Hakia
    Hakia is a search engine focusing on natural language processing methods to try and deliver 'meaningful' search results. Hakia attempts to analyze the concept of a search query, in particular by doing sentence analysis. Over the past year Hakia has been busy extending its reach - licensing its proprietary OntoSem technology to other companies in March and announcing a Semantic API in June. It was also one of the first companies to utilize Yahoo! BOSS, by integrating their semantic parsing with the Yahoo! search index.
    We think Hakia has made good progress getting its technology into the hands of third parties and making use of Yahoo's broader index, so for that reason it's among our top 10 for the year.
    6. TripIt
    Tripit is an app that manages your travel planning. With TripIt, you forward incoming bookings to plans@tripit.com and the system manages the rest.
    Over the past year TripIt has continued to iterate on its feature set - introducing LinkedIn integration, better mobile functionality, more social networking features, and other goodies. In short, it's user experience continues to rock!
    7. BooRah
    BooRah is a restaurant review site that we first reviewed earlier this year and has come on in leaps and bounds over 2008. BooRah uses semantic analysis and natural language processing to aggregate reviews from food blogs. Because of this, BooRah can recognize praise and criticism in these reviews and then rates restaurants accordingly. BooRah also gathers reviews from Citysearch, Tripadvisor and other large review sites.
    BooRah also announced last month the availability of an API that will allow other web sites and businesses to offer online reviews and ratings from BooRah to their customers. The API will surface most of BooRah's data about a given restaurant, including ratings, menus, discounts, and coupons.
    8. AdaptiveBlue
    Disclosure: AdaptiveBlue's founder Alex Iskold is a feature writer at RWW.
    AdaptiveBlue are makers of the Firefox plugin, BlueOrganizer. As we wrote in January this year, the basic idea behind BlueOrganizer is that it gives you added information about webpages you visit and offers useful links based on the subject matter.
    Over the past year the company has been working on a new product, called Glue. Launched last month, Glue is a more social networking oriented version of BlueOrganizer - it connects you to your friends based around things like books, music, movies, stars, artists, stocks, wine, restaurants, and more. We think the company has diversified smartly in 2008, by integrating social networking and mobile functionality into its products.
    9. Zemanta
    Zemanta is a blogging tool which harnesses semantic technology to add relevant content to your posts. While it didn't make either of our 'Semantic Apps to Watch' lists in November, a number of commenters pointed it out as something they use. In September we covered a major upgrade to Zemanta's service, allowing users to specify the sources they want to see in the suggestions list that Zemanta provides. Users can now incorporate their own social networks, RSS feeds, and photos into their blog posts. As we noted, this makes Zemanta a lot more appealing to established bloggers who are in less need of suggestions and more in need of automation.
    Zemanta's API is also being used by startups, including semantic bookmarking service Faviki - which we mentioned in our second Watch-list. So all up, we think Zemanta has done enough this year to be included in our top 10 list.
    10. UpTake
    Semantic search startup UpTake (formerly Kango) aims to make the process of booking travel online easier. In our review in May, we explained that UpTake is a vertical search engine that has assembled what it says is the largest database of US hotels and activities - over 400,000 of them - from more than 1,000 different travel sites. Using a top-down approach, UpTake looks at its database of over 20 million reviews, opinions, and descriptions of hotels and activities in the US and semantically extracts information about those destinations.
    And now please let us know in the comments what you think of our selections. Do you think we've picked the best 10 Semantic Web products of the year?

    ----------------------------------------------
    本人的论文是基于语义网的搜索引擎技术。
    望同路人多交流!
    Email:avan1017@163.com

    点击查看用户来源及管理<br>发贴IP:*.*.*.* 2009/1/11 22:10:00
     
     Avansky 帅哥哟,离线,有人找我吗?
      
      
      威望:1
      等级:大三(研究MFC有点眉目了!)
      文章:103
      积分:675
      门派:W3CHINA.ORG
      注册:2008/12/3

    姓名:(无权查看)
    城市:(无权查看)
    院校:(无权查看)
    给Avansky发送一个短消息 把Avansky加入好友 查看Avansky的个人资料 搜索Avansky在『 Web Services & Semantic Web Services 』的所有贴子 引用回复这个贴子 回复这个贴子 查看Avansky的博客147
    发贴心情 
    大家看看这个,或许对你们有点帮助

    ----------------------------------------------
    本人的论文是基于语义网的搜索引擎技术。
    望同路人多交流!
    Email:avan1017@163.com

    点击查看用户来源及管理<br>发贴IP:*.*.*.* 2009/1/11 22:10:00
     
     Humphrey 帅哥哟,离线,有人找我吗?狮子座1981-7-23
      
      
      威望:1
      等级:研二(搞定了DL,再搞定F-Logic!)
      文章:937
      积分:5743
      门派:W3CHINA.ORG
      注册:2008/3/12

    姓名:(无权查看)
    城市:(无权查看)
    院校:(无权查看)
    给Humphrey发送一个短消息 把Humphrey加入好友 查看Humphrey的个人资料 搜索Humphrey在『 Web Services & Semantic Web Services 』的所有贴子 引用回复这个贴子 回复这个贴子 查看Humphrey的博客148
    发贴心情 
    whfcarter同志的意思是,语义搜索的产生要比语义网概念的提出要早,而垂直搜索、元搜索即属于早期语义搜索范畴。我的理解没错吧?
    自己感觉:一方面,回答这种问题需要对整个语义搜索领域的发展史比较熟悉,或者占有一些较明确的直接相关资源;另一方面,这种“先有鸡还是先有蛋”的问题的答案似乎也依赖于对“语义搜索”概念的界定。
    感谢whfcarter同志的热心解答,谢谢。

    ----------------------------------------------
    鸿丰

    点击查看用户来源及管理<br>发贴IP:*.*.*.* 2009/1/12 8:42:00
     
     Humphrey 帅哥哟,离线,有人找我吗?狮子座1981-7-23
      
      
      威望:1
      等级:研二(搞定了DL,再搞定F-Logic!)
      文章:937
      积分:5743
      门派:W3CHINA.ORG
      注册:2008/3/12

    姓名:(无权查看)
    城市:(无权查看)
    院校:(无权查看)
    给Humphrey发送一个短消息 把Humphrey加入好友 查看Humphrey的个人资料 搜索Humphrey在『 Web Services & Semantic Web Services 』的所有贴子 引用回复这个贴子 回复这个贴子 查看Humphrey的博客149
    发贴心情 
    最近一段时间被一些家庭琐事所困,没能抽出时间到论坛参与讨论。不过相关的问题还是一直都在考虑着的,就是不能上网查资料,也不能写什么,好闷啊!
    言归正传,语义搜索事实上应该是可以用于全网络的,就是包括万维网、互联网和内部网的。前一段我们主要是概略地讨论语义搜索的,没针对哪一种网络。不过语义搜索总要有一个对象,所以我想先从针对语义网的语义搜索开始和大家一块儿讨论。
    我粗略地看了一些东西,总体感觉语义网的语义搜索目前比较窄,或许是和语义网的规模有关。凡是成型的语义网语义搜索引擎无非都是对本体库或者对社会网络(不知道这样称谓是否合适,就是类似FOAF的那种结构)进行简单搜索;或基于关键词,或基于简单描述列表。甚至给我最深刻的印象就像一个在线词典。不知道,对于语义网的语义搜索给各位留下什么印象?欢迎诸位跟帖讨论。
    感谢大家对本论题的关注,祝大家新春愉快!

    ----------------------------------------------
    鸿丰

    点击查看用户来源及管理<br>发贴IP:*.*.*.* 2009/2/4 12:22:00
     
     Humphrey 帅哥哟,离线,有人找我吗?狮子座1981-7-23
      
      
      威望:1
      等级:研二(搞定了DL,再搞定F-Logic!)
      文章:937
      积分:5743
      门派:W3CHINA.ORG
      注册:2008/3/12

    姓名:(无权查看)
    城市:(无权查看)
    院校:(无权查看)
    给Humphrey发送一个短消息 把Humphrey加入好友 查看Humphrey的个人资料 搜索Humphrey在『 Web Services & Semantic Web Services 』的所有贴子 引用回复这个贴子 回复这个贴子 查看Humphrey的博客150
    发贴心情 
    补充几个由whfcarter同志发表的与语义搜索相关的话题,以资参考:
    Our vision of semantic Web search
    对2007年到2008年之间whfcarter同志所在研究组的工作总结
    http://bbs.w3china.org/dispbbs.asp?boardID=2&ID=71338
    Evolving Web, Evolving Search
    whfcarter同志对语义搜索的看法
    http://bbs.w3china.org/dispbbs.asp?boardID=2&ID=71295
    whfcarter同志发布的其他搜索领域相关材料:
    1st Call for Papers SEMSEARCH'09
    一篇征稿启事,但是对热点研究方向有所涉及
    http://bbs.w3china.org/dispbbs.asp?boardID=2&ID=71208
    Google Researcher Targets Web's Structured Data
    来自著名计算机刊物《微电脑世界》的一篇文章,末尾附有whfcarter同志的简要评论
    http://bbs.w3china.org/dispbbs.asp?boardID=2&ID=71707

    ----------------------------------------------
    鸿丰

    点击查看用户来源及管理<br>发贴IP:*.*.*.* 2009/2/24 19:07:00
     
     GoogleAdSense狮子座1981-7-23
      
      
      等级:大一新生
      文章:1
      积分:50
      门派:无门无派
      院校:未填写
      注册:2007-01-01
    给Google AdSense发送一个短消息 把Google AdSense加入好友 查看Google AdSense的个人资料 搜索Google AdSense在『 Web Services & Semantic Web Services 』的所有贴子 访问Google AdSense的主页 引用回复这个贴子 回复这个贴子 查看Google AdSense的博客广告
    2024/5/8 5:20:52

    本主题贴数164,分页:[1] ... [12] [13] [14] [15] [16] [17]

    管理选项修改tag | 锁定 | 解锁 | 提升 | 删除 | 移动 | 固顶 | 总固顶 | 奖励 | 惩罚 | 发布公告
    W3C Contributing Supporter! W 3 C h i n a ( since 2003 ) 旗 下 站 点
    苏ICP备05006046号《全国人大常委会关于维护互联网安全的决定》《计算机信息网络国际联网安全保护管理办法》
    3,296.875ms