News

Yahoo! Search BOSS™

BOSS (Build your Own Search Service) is Yahoo!’s open search web services platform. The goal of BOSS is simple: to foster innovation in the search industry. Developers, start-ups, and large Internet companies can use BOSS to build and launch web-scale search products that utilize the entire Yahoo! Search index. BOSS gives you access to Yahoo!’s investments in crawling and indexing, ranking and relevancy algorithms, and powerful infrastructure. By combining your unique assets and ideas with our search technology assets, BOSS is a platform for the next generation of search innovation, serving hundreds of millions of users across the Web.

How Do I Get Started?

  1. Check out BOSS specs and mash-up examples below
  2. Review the documentation
  3. Get a BOSS Application ID

About the API

Overview

Search APIs are nothing new, but typically they’ve included rate limits, strict terms of service regarding the re-ordering and presentation of results, and provided little or no opportunity for monetization. These constraints have limited the innovation and commercial viability of new search solutions.

BOSS (Build your Own Search Service) is different – it’s a truly open API with as few rules and limitations as possible. With BOSS, developers and start-ups now have the technology and infrastructure to build next generation search solutions that can compete head-to-head with the principals in the search industry. BOSS will grow and evolve with a focus on providing additional functionality, tools, and data for developers.

  Previously Available with Yahoo! Search API Available with BOSS
Queries Per Day 5,000 Unlimited*
No Restrictions on Presentation no yes
Re-Ordering Allowed no yes
Blending of Proprietary and Yahoo! Search Content Allowed no yes
Monetization no Coming Soon!
White-Label Attribution Required yes

* BOSS offers developers unlimited daily queries, though Yahoo! reserves the right to limit unintended usage, such as automated querying by bots.

BOSS Screencast

BOSS developer Vik Singh walks through the basics of using the BOSS API and Mashup Framework, including two live examples.

Examples

hakia

hakia, a leading semantic search engine, uses Yahoo! Search BOSS to accelerate its semantic analysis of the Web by accessing the Yahoo! index’s vast amounts of web documents.

Me.dium Search

Me.dium combined the BOSS API with its insight into the real time surfing activity of the crowds to build a unique “Crowd-Powered” social search engine prototype.

Daylife

Daylife To-Go is a new self-service, hosted publishing platform from Daylife. Anyone can use this platform to automatically generate 100% customizable pages and widgets. Daylife To-Go uses the BOSS API platform to power its Web search module.

Cluuz

Cluuz generates easier to understand search results through patent pending semantic cluster graphs, image extraction, and tag clouds. The Cluuz analysis is performed in real-time on results returned from BOSS API.

Revenue Sharing

In the near future, we will launch a monetization platform enabling Yahoo! and partners to jointly participate in the economics of BOSS-powered search products. Either Yahoo! sponsored search integration, with certain implementation and exclusivity requirements, or potentially a payment model, will be required above a specified query threshold.

Terms of Use

Use of this service is subject to the BOSS API Terms of Use.

Learn More

BOSS Mashup Framework

The BOSS Mashup Framework is an experimental python library that provides developers with tools for mashing up the BOSS API with other third-party data sources.

BOSS Custom

BOSS Custom is an invite-only program focused on building highly scalable next generation search products. This program is designed for consumer web businesses with unique assets (such as extensive user data or novel technologies) that want to develop truly innovative search products using Yahoo!’s core search technology.

Frequently Asked Questions

Have questions about what BOSS can and can’t do? Check out the FAQ for a quick overview of the basics.

Interested in Working on BOSS?

Passionate about building open platforms? Join Yahoo! and work one of the top priorities in Search. Email us here bossfeedback@yahoo-inc.com and please include “BOSS Jobs” in the subject line.


近期做了几个网站的SEO效果明显提升

通过为期几个月的研究,潜心阅读了一些关于SEO的TIPS,发现了做好SEO必不可忽略几个主要要点:
1、网站代码的规范化。检查HTML代码语法、Javascript语法及CSS规范。
2、网站结构的明晰化或层次化。把握好内容的主次,突出重点。
3、有规律性的网站更新。
4、网站内容的时效价值。提供某一类对象最具有时效性的信息。
5、被动、主动的有效结合。提高网站的交互性、加强对外的交换信息。

MKT-CHINA 网站经过这两周的优化,近期许多与主要关键字的相关网页均在Yahoo、Google排名靠前。更是令人兴奋的事情是原来在ALEXA的排名为6,700,000,如今排到 1,997,808 在ALEXA网站这一周的排名已经达到596,089。
http://www.alexa.com/data/details/traffic_details/mkt-china.com

期盼下一个高峰…


Flash 的内容将被Google搜索?

我们知道做站点首页最好不要放flash页面,因为搜索引擎不认识,但是很多flash做的页面却很漂亮,让我们爱不释手,如何才能两全?

Adobe宣布与Google和Yahoo达成协议,优化的Adobe Flash Player将被添加到两家互联网公司的搜索引擎中。该工具将帮助搜索引擎更好地索引包括Flash或Shockwave Flash(SWF)的动态网页内容和富网络应用(RIA)。

尽管搜索引擎能够索引包括SWF文件的静态文字和链接,但由于经常变化,搜索引擎难以扑捉RIA和动态网页内容。

通过这三家公司合作,基于Flash的RIA,包括加载时的内容,不必由开发者修改而直接可以搜索到。Google已经在网站上增加了优化的Flash Player,其搜索引擎将从今天开始能够进入SWF文件。雅虎计划在Yahoo搜索的未来更新中增加该技术,但没有透露时间表。

Adobe Flash Player的高级主管Justin Everett-Church表示,“终端用户将获得更好的信息,更相关的结果和更好的体验。我们的目标是使世界上的每个搜索引擎都能够搜索SWF文件。”

搜索引擎专家Danny Sullivan在博客中写到,设计者和网页开发者长期以来被Flash中的内容排除在搜索引擎之外而困扰。“这个改变将使原来隐藏的信息解锁,搜索者获得更好的体验。
摘自:Google


Ping Google Blog Search以加速网页收录

尽管按照严格的SEO(搜索引擎优化)理论,让网站内的页面尽 可能多、尽可能快地被搜索引擎收录未必是最佳的选择,也不会在根本上改善网站整体的SEO效果——部分人甚至认为,过于强调让网页进入搜索引擎的索引数据 库,则会造成因存在大量的相关度不高或低价值的页面而影响网站整体优化的结果,特别是降低网站内重要页面的排名,毕竟,让网页出现在搜索引擎的索引数据库 并不是目的,在SERP(搜索结果页面)获得高排名才是努力方向,正因为此,许多人提出不应向或最好不要向Google网站管理员中心提交 sitemaps——但是,在大多数情况下,保证网站能够被搜索引擎正确索引仍然是SEO的基础。

此外,在Internet上抄袭或“转载”已成普遍行为的今天,让自己的网页能够在第一时间内被搜索引擎索引还有另一层意义:众所周知,虽然搜 索引擎在如何判断内容的原始出处可能有十分复杂的机制,但网页被其索引、收录的时间总是一个相当重要的判断因子——这也是我们关注这一问题的主要原因,近 来对Vista天地的抄袭已经到了肆无忌惮的程度——从这个角度,保证自己的原创内容页面能够先于转载网站被搜索引擎收录的重要性不言而喻。当然,这并不能保证原创页面最终出现在SERP的排名会高于转载网站,那取决于很多因素。

在Sitemaps标准走向统一后,通过向Google、Yahoo!提交sitemaps可以在一定程度上提高网页被索引、收录的速度,但结果并不理想,这倒也可以理解,从根本上说,Sitemaps的的主要目标在于提高搜索引擎索引页面的全面性而非时效性。那么,是不是还有别的途径?

最近,我们进行的一项测试表明,通过Google的Blog Search可以有效地实现这一点。自去年10月份,Google Blog Search便已开始支持Ping 服务, 即当Blog上增加了新内容或内容改变后可直接通知Google Blog Search,以帮助其索引、收录。测试中使用2个网站做比对,二者同时建立,均使用WordPress,域名均为新注册域名,在各搜索引擎中均不存在任 何记录,均没有任何外部链接,惟一的区别便是在其中之一,姑且称为网站A吧,中设置了Ping Google Blog Search,即在WordPress的“Update Services”设置中添加了“http://blogsearch.google.com/ping/RPC2”,而另一则保持WordPress的 默认设置即仅Ping “http://rpc.pingomatic.com/”。

测试结果相当令人震憾,网站A除第一篇文章用了一天时间才被收录外,其后均在一个小时内被Google blog search收录,并旋即出现在Google的主索引库中(即Google网页搜索),其中最快的一次用了不到一分钟,连文章中的错别字都未来得及修改便 已被Google缓存。而网站B,则直到半个月后才被Google收录了一个页面,差距甚远。——至于没有外部链接的网站B为何也能被索引,猜测可能缘于 Ping “http://rpc.pingomatic.com/”而在Technorati中出现了链接,不过因Technorati被封,未详细检查。

当然,由于没有外部链接,网站A出现在Google网页搜索中的页面均为“补充结果”,但相信这并不是什么大问题,随着内容的逐步充实,获得足够的链接,其自然会从补充结果中逃出

稍许令人郁闷的是,这仅对Google有效。至于如何提高搜索引擎索引、收录网页的速度,仍有待解决。

注1:在SEO、原创内容与独特内容的留言中,Cloudream认为“借助adsense可以让搜索引擎第一个抓取你的文章(发表完自己刷几次页面即可)”,但MediaBot不能收录、索引新的网页,而只在某种情况下对索引数据库中已存在页面进行更新是公认的事实,虽然我们也曾对Google的官方说法提出过质疑,但客观分析,Google在这件事上应该是没有说谎的,不然,同时维护两个功能相同或相近的索引库,在技术实现上存在很大难度,也有自找麻烦的嫌疑

注2:虽然我们的测试在WordPress下进行,不过,使用其他blog平台甚至传统的CMS,只要能提供RSS输出,均可通过Ping Google Blog Search——或自动Ping,如无相应设置也可手工Ping——加速网页的收录与索引


MKT BLOG

Welcome to MKT new website. now you can start blogging!


  • Share |
  • Latest Articles From MKT

    Copyright © 2010 MKT CHINA. All rights reserved.
    Powered by WordPress