Web Search Sciences is responsible for constantly improving the search experience of Yahoo customers. Our scientists combine a diverse set of scientific disciplines, from information retrieval and machine learning to text- and data-mining, to create new algorithms and data models, for crawling, indexing, query and content understanding, ranking, and presenting results. Our scientists work with the engineering and product groups, and deliver innovation into Yahoo search impacting millions of users across the world, thousands of times every second.
Disciplines & Areas of Expertise
Scientific Disciplines include Information Retrival, Machine Learning, Data & Text Mining and Natural Language Processing. Learn More
Areas of Expertise include Ranking, Classification, Information Extraction and Summarization. Learn moreTeam
- Aitao Chen
- Alex Cozzi
- A. Rochette
- Alpa Jain
- Amit Madaan
- Ana-Maria Popescu
- Anlei Dong
- Anne Zhang
- A. Raghuveer
- Ashwin Tengli
- Barbara Poblete
- Ben Shahshahani
- Benoit Dumoulin
- Bill Pentney
- Bo Long
- Byron Dom
- Changsung Kang
- Charu Tiwari
- Chi-Hoon Lee
- Ciya Liao
- Daniel Boies
- Daniel Dechelotte
- Danny Levinson
- D. Farrar
- Deepa Paranjpe
- Dun Liu
- Emre Velipasaoglu
- Eric Crestan
- Fan Li
- Fernando Diaz
- Flavian Vasile
- Fuchun Peng
- George Mills
- Georges Dupret
- Gilad Mishne
- Gilbert Leung
- Harsha Khattri
- Hugues Bouchard
- Huihsin Tseng
- James Shaw
- Jan Pfeifer
- J. Crespo
- J. Paiement
- J. Langlois
- Jerry Ye
- Jianzhang He
- Jing Bai
- John Blackmer
- Jyh-Herng Chow
- Karolina Buchner
- Kimberly Farrell
- K. Tsioutsiouliklis
- Krishnan S. Kumar
- Lan Nie
- Larry Lai
- Lei Duan
- Longbin Chen
- M. Pennacchiotti
- Mike Ching
- Mingrui Wu
- Nadia Ghamrawi
- N. Sadagopan
- Nicolas Stroppa
- Nicolas Torzec
- Pankaj Gulhane
- Patrick Pantel
- Pavel Dmitriev
- Pranam Kolari
- Priyanka Garg
- Rao Shen
- Remi Kwan
- Richard Kasperski
- Romain Vinot
- Rosie Jones
- Ruiqiang Zhang
- Rupesh Mehta
- S.R. Jeyashankher
- S. Bhuramal Satpal
- Scott Gaffney
- Seokkyung Chung
- Shailesh Kumar
- Shihao Ji
- Siva Gurumurthy
- Soo-Min Pantel
- Srihari Reddy
- Srinivas Vadrevu
- S. H. Sengamedu
- Su Chan
- Su-Lin Wu
- Subhajit Sanyal
- S. Lamkhede
- Suju Rajan
- Taesup Moon
- Tanuja Bompada
- Tina Liu
- Uma Sawant
- Umut Ozertem
- Xavier Dupre
- Xiaofeng He
- Xin Li
- Xing Wei
- Yi Chang
- Yiping Zhou
- Yoshiyuki Inagaki
- Youssef Billawala
- Yumao Lu
- Zhaohui Zheng
- Zhenzhen Kou
- Zhigang Hua
- Ziming Zhuang
Publications
Shape Classification Through Structured Learning of Matching Measures,
, CVPR2009, 06/2009, Miami, FL, (2009)
Web Search Result Summarization: Title Selection Algorithms and User Satisfaction,
, Conference on Information and Knowledge Management, November, 2009, Hong Kong, (2009)
Abstract
Stochastic Gradient Boosting Distributed Decision Trees,
, The 18th ACM Conference on Information and Knowledge Management (CIKM), 11/2009, Hong Kong, (2009)
Information Theoretic Regularization for Semi-Supervised Boosting,
, ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 06/2009, (2009)
Improving Web Page Classification by Label-propagation over Click Graphs,
, Conference on Information and Knowledge Management (CIKM-2009), 11/2009, Hong Kong, (2009)
Threshold selection for web-page classification with highly skewed class distribution,
, World Wide Web 2009, Madrid, Spain, p.1081-1082, (2009)
The Dynamic Retrieval of XML Elements,
, Advances in XML Information Retrieval and Evaluation, Volume Volume 39, (2006)
A dual coordinate descent method for large-scale linear SVM,
, ICML 2008, 2008, (2008)
Semi-Supervised Classification Using Sparse Gaussian Process Regression.,
, IJCAI 2009, 2009, (2009)
Network Flow for Collaborative Ranking,
, The 10th European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD), 09/2006, Berlin, Germany, (2006)
Re-Ranking Search Results Using Query Logs,
, The ACM 15th Conference on Information and Knowledge Management (CIKM), 11/2006, Arlington, VA, (2006)
Collaboration Over Time: Characterizing and Modeling Network Evolution,
, The First ACM International Conference on Web Search and Data Mining (WSDM), 02/2008, Stanford, CA, (2008)
Real-time Automatic Tag Recommendation,
, The 31st Annual International ACM SIGIR Conference (SIGIR), 07/2008, Singapore, (2008)
Towards Click-based Models of Geographic Interests in Web Search,
, ACM/IEEE/WIC International Conference on Web Intelligence (WI), 12/2008, Sydney, Australia, (2008)
Joint categorization of queries and clips for web-based video search,
, Multimedia Information Retrieval 2006, (2006)
Internet-scale collection of human-reviewed data,
, WWW 2007, (2007)
Incorporating query difference for learning retrieval functions in world wide web search,
, CIKM 2006 , (2006)
A Regression Framework for Learning ranking functions using relative relevance judgments,
, SIGIR 2007, (2007)
A General Boosting Method and its Application to Learning Ranking Functions for Web Search,
, NIPS 2008, (2008)
Query-Level Learning to Rank Using Isotonic Regression,
, Proceedings of the 46th Annual Allerton Conference on Communication, Control and Computing 2008, (2008)
Enhancing Topical Ranking with Preferences from Click-Through Data,
, SIGIR 2009 poster , (2009)
Search Engine Adaptation by Feedback Control Adjustment for Time-sensitive Query,
, NAACL-HLT 2009, (2009)
Web Search Engine Metrics: Direct Metrics to Measure User Satisfaction,
, WWW (2009), Madrid, Spain, (2009)
Search result reranking by feedback control adjustment for time-sensitive query,
, HLT-NAACL 2009, (2009)
Document Preprocessing For Naive Bayes Classification and Clustering with Mixture of Multinomials,
, Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining (KDD-2004), (2004)
