Lee Giles
Encyclopedia
C. Lee Giles is the David Reese Professor at the College of Information Sciences and Technology at the Pennsylvania State University
Pennsylvania State University
The Pennsylvania State University, commonly referred to as Penn State or PSU, is a public research university with campuses and facilities throughout the state of Pennsylvania, United States. Founded in 1855, the university has a threefold mission of teaching, research, and public service...

. He is also Professor of Computer Science and Engineering, Professor of Supply Chain and Information Systems, and Director of the Intelligent Systems Research Laboratory. His graduate degrees are from the University of Michigan and the University of Arizona and his undergraduate degrees are from Rhodes College and the University of Tennessee. His PhD is in optical sciences; his advisor was Harrison H. Barrett. His academic genealogy includes two Nobel laureates and prominent mathematicians.

Research

He has been associated with Princeton University
Princeton University
Princeton University is a private research university located in Princeton, New Jersey, United States. The school is one of the eight universities of the Ivy League, and is one of the nine Colonial Colleges founded before the American Revolution....

, the University of Pennsylvania
University of Pennsylvania
The University of Pennsylvania is a private, Ivy League university located in Philadelphia, Pennsylvania, United States. Penn is the fourth-oldest institution of higher education in the United States,Penn is the fourth-oldest using the founding dates claimed by each institution...

, Columbia University
Columbia University
Columbia University in the City of New York is a private, Ivy League university in Manhattan, New York City. Columbia is the oldest institution of higher learning in the state of New York, the fifth oldest in the United States, and one of the country's nine Colonial Colleges founded before the...

, the University of Pisa
University of Pisa
The University of Pisa , located in Pisa, Tuscany, is one of the oldest universities in Italy. It was formally founded on September 3, 1343 by an edict of Pope Clement VI, although there had been lectures on law in Pisa since the 11th century...

, the University of Trento
University of Trento
The University of Trento is an Italian university located in the cities of Trento and Rovereto. It has been able to achieve considerable results in didactics, research and international relations, as shown by Censis University Guide and by the Italian Ministry of...

 and the University of Maryland, College Park
University of Maryland, College Park
The University of Maryland, College Park is a top-ranked public research university located in the city of College Park in Prince George's County, Maryland, just outside Washington, D.C...

. Previous positions were at NEC Research Institute (now NEC Labs), Princeton, NJ; Air Force Research Laboratory
Air Force Research Laboratory
The Air Force Research Laboratory is a scientific research organization operated by the United States Air Force Materiel Command dedicated to leading the discovery, development, and integration of affordable aerospace warfighting technologies; planning and executing the Air Force science and...

; and the United States Naval Research Laboratory
United States Naval Research Laboratory
The United States Naval Research Laboratory is the corporate research laboratory for the United States Navy and the United States Marine Corps and conducts a program of scientific research and development. NRL opened in 1923 at the instigation of Thomas Edison...

. He is best known for his work on the creation of novel scientific and academic search engines and digital libraries.

His research interests are in intelligent web and cyberinfrastructure tools, search engines and information retrieval, digital libraries, web services, knowledge and information management and extraction, machine learning, and information and data mining. In these areas he has over 300 publications with some in Nature
Nature (journal)
Nature, first published on 4 November 1869, is ranked the world's most cited interdisciplinary scientific journal by the Science Edition of the 2010 Journal Citation Reports...

, Science
Science (journal)
Science is the academic journal of the American Association for the Advancement of Science and is one of the world's top scientific journals....

 and the Proceedings of the National Academy of Sciences. His research is well cited with an h-index of 49 according to Google Scholar and over 10,000 total citations as evidenced in CiteSeerX, ISI and the Google Scholar
Google Scholar
Google Scholar is a freely accessible web search engine that indexes the full text of scholarly literature across an array of publishing formats and disciplines. Released in beta in November 2004, the Google Scholar index includes most peer-reviewed online journals of Europe and America's largest...

.

He is a Fellow of the Association for Computing Machinery
Association for Computing Machinery
The Association for Computing Machinery is a learned society for computing. It was founded in 1947 as the world's first scientific and educational computing society. Its membership is more than 92,000 as of 2009...

 (ACM), IEEE and INNS.

CiteSeer and Search Engines

His early work published in Science and Nature with Steve Lawrence estimated the size of the web and showed that search engines did not index that much of it. This work also showed that the web had significantly matured and had a diversity of material and resources.

With Steve Lawrence
Steve Lawrence (computer scientist)
Dr. Steve Lawrence was among the group at NEC Research which was responsible for the creation of the Search Engine/Digital Library CiteSeer. He is currently an employee at Google....

 and Kurt Bollacker
Kurt Bollacker
Dr. Kurt Bollacker is a computer scientist with a research background in the areas of machine learning, digital libraries, semantic networks, and electro-cardiographic modeling. He received a Ph.D. in Computer Engineering from The University Of Texas At Austin...

, Giles was responsible for the creation in 1997 of automatic citation indexing and CiteSeer
CiteSeer
CiteSeer was a public search engine and digital library for scientific and academic papers. It is often considered to be the first automated citation indexing system and was considered a predecessor of academic search tools such as Google Scholar and Microsoft Academic Search. It was replaced by...

, a public academic search engine and digital library for Computer and Information Science. Under his direction CiteSeer
CiteSeer
CiteSeer was a public search engine and digital library for scientific and academic papers. It is often considered to be the first automated citation indexing system and was considered a predecessor of academic search tools such as Google Scholar and Microsoft Academic Search. It was replaced by...

 was moved to and is being maintained at the Pennsylvania State University. CiteSeer has been replaced by the Next Generation CiteSeer, CiteSeerX
CiteSeerX
CiteSeerX is a public search engine and digital library and repository for scientific and academic papers with a focus on computer and information science. It is loosely based on the previous CiteSeer search engine and digital library and is built with a new open source infrastructure, SeerSuite,...

.

He is the director of the Next Generation CiteSeer project, CiteSeerX
CiteSeerX
CiteSeerX is a public search engine and digital library and repository for scientific and academic papers with a focus on computer and information science. It is loosely based on the previous CiteSeer search engine and digital library and is built with a new open source infrastructure, SeerSuite,...

, also at the Pennsylvania State University
Pennsylvania State University
The Pennsylvania State University, commonly referred to as Penn State or PSU, is a public research university with campuses and facilities throughout the state of Pennsylvania, United States. Founded in 1855, the university has a threefold mission of teaching, research, and public service...

. In addition, he was responsible for the creation of an academic business search engine and digital library, BizSeer (previously known as SmealSearch). With Isaac Councill, he created automatic acknowledgement indexing, permitting for the first time the automatic search and indexing of acknowledged entities in scholarly and research documents.

His recent research in collaboration with Professors Prasenjit Mitra, Karl Mueller, Barbara Garrison and James Kubicki has resulted in the development of a search engine and data portal for chemistry, ChemxSeer, ChemXSeer
ChemXSeer
ChemXSeer project, funded by the National Science Foundation, is a public integrated digital library, database, and search engine for scientific papers in chemistry. It is being developed by a multidisciplinary team of researchers at the Pennsylvania State University. ChemXSeer was conceived by Dr....

. With Yang Sun, a novel search engine, BotSeer
BotSeer
BotSeer was a Web-based information system and search tool that provides resources and services for research on Web robots and trends in Robot Exclusion Protocol deployment and adherence. It was created and designed by , , and C. Lee Giles....

, was designed that searches and indexes robots.txt files on web sites. The Next Generation CiteSeer, CiteSeerx, came on line in February, 2008, with over one million articles indexed and now with active crawling is approaching 2 million. These new services are based on SeerSuite
SeerSuite
SeerSuite refers a to a collection of open source tools that provide the underlying application software for creating academic search engines and digital libraries such as CiteSeerX, ChemXSeer, and ArchSeer...

, a package of open sources tools for searching and indexing academic documents and data.

External links

The source of this article is wikipedia, the free encyclopedia.  The text of this article is licensed under the GFDL.
 
x
OK