1. Trang chủ
  2. » Công Nghệ Thông Tin

Intelligent web search smart algorithms 454

320 19 0

Đang tải... (xem toàn văn)

Tài liệu hạn chế xem trước, để xem đầy đủ mời bạn chọn Tải xuống

THÔNG TIN TÀI LIỆU

Thông tin cơ bản

Định dạng
Số trang 320
Dung lượng 1,54 MB

Nội dung

THE INTELLIGENT WEB This page intentionally left blank the Intelligent Web Search, Smart Algorithms, and Big Data GAUTAM SHROFF Great Clarendon Street, Oxford, OX2 6DP, United Kingdom Oxford University Press is a department of the University of Oxford It furthers the University’s objective of excellence in research, scholarship, and education by publishing worldwide Oxford is a registered trade mark of Oxford University Press in the UK and in certain other countries © Gautam Shroff 2013 The moral rights of the author have been asserted First Edition published in 2013 Impression: All rights reserved No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, without the prior permission in writing of Oxford University Press, or as expressly permitted by law, by licence or under terms agreed with the appropriate reprographics rights organization Enquiries concerning reproduction outside the scope of the above should be sent to the Rights Department, Oxford University Press, at the address above You must not circulate this work in any other form and you must impose this same condition on any acquirer Published in the United States of America by Oxford University Press 198 Madison Avenue, New York, NY 10016, United States of America British Library Cataloguing in Publication Data Data available Library of Congress Control Number: 2013938816 ISBN 978–0–19–964671–5 Printed in Italy by L.E.G.O S.p.A.-Lavis TN Links to third party websites are provided by Oxford in good faith and for information only Oxford disclaims any responsibility for the materials contained in any third party website referenced in this work To my late father, who I suspect would have enjoyed this book the most ACKNOWLEDGEMENTS Many people have contributed to my thinking and encouraged me while writing this book But there are a few to whom I owe special thanks First, to V S Subrahamanian, for reviewing the chapters as they came along and supporting my endeavour with encouraging words I am also especially grateful to Patrick Winston and Pentti Kanerva for sparing the time to speak with me and share their thoughts on the evolution and future of AI Equally important has been the support of my family My wife Brinda, daughter Selena, and son Ahan—many thanks for tolerating my preoccupation on numerous weekends and evenings that kept me away from you I must also thank my mother for enthusiastically reading many of the chapters, which gave me some confidence that they were accessible to someone not at all familiar with computing Last but not least I would like to thank my editor Latha Menon, for her careful and exhaustive reviews, and for shepherding this book through the publication process vi CONTENTS List of Figures ix Prologue: Potential xi Look The MEMEX Reloaded Inside a Search Engine Google and the Mind Deeper and Darker 20 29 Listen 40 Shannon and Advertising The Penny Clicks Statistics of Text Turing in Reverse Language and Statistics Language and Meaning Sentiment and Intent 40 48 52 58 61 66 73 Learn 80 Learning to Label Limits of Labelling Rules and Facts Collaborative Filtering Random Hashing Latent Features Learning Facts from Text Learning vs ‘Knowing’ 83 95 102 109 113 114 122 126 vii CONTENTS Connect 132 Mechanical Logic The Semantic Web Limits of Logic Description and Resolution Belief albeit Uncertain Collective Reasoning 136 150 155 160 170 176 Predict 187 Statistical Forecasting Neural Networks Predictive Analytics Sparse Memories Sequence Memory Deep Beliefs Network Science 192 195 199 205 215 222 227 Correct 235 Running on Autopilot Feedback Control Making Plans Flocks and Swarms Problem Solving Ants at Work Darwin’s Ghost Intelligent Systems 235 240 244 253 256 262 265 268 Epilogue: Purpose 275 References 282 Index 291 viii LIST OF FIGURES Turing’s proof 158 Pong games with eye-gaze tracking 187 Neuron: dendrites, axon, and synapses 196 Minutiae (fingerprint) 213 Face painting 222 Navigating a car park 246 Eight queens puzzle 257 ix ...THE INTELLIGENT WEB This page intentionally left blank the Intelligent Web Search, Smart Algorithms, and Big Data GAUTAM SHROFF Great Clarendon... world wide web In other words, rather than ‘traditional’ artificial intelligence, the successes we are witnessing are better described as those of web intelligence’ xiii THE INTELLIGENT WEB arising... scale *** The web is believed to have well over a trillion web pages, of which at least 50 billion have been catalogued and indexed by search engines such as Google, making them searchable by

Ngày đăng: 05/03/2019, 08:32

Nguồn tham khảo

Tài liệu tham khảo Loại Chi tiết
17. Piotr Indyk and Rajeev Motwani, ‘Approximate Nearest Neighbors: Towards Removing the Curse of Dimensionality’, STOC ’98: Proceedings of the 30th Annual ACM Symposium on Theory of Computing(New York: ACM, 1998), 604–13 Sách, tạp chí
Tiêu đề: Approximate Nearest Neighbors: Towards Removing the Curse of Dimensionality
Tác giả: Piotr Indyk, Rajeev Motwani
Nhà XB: ACM
Năm: 1998
18. Jayant Madhavan et al., ‘Web-Scale Data Integration: You Can Only Afford to Pay as You Go’, Proceedings of CIDR (2007) Sách, tạp chí
Tiêu đề: Proceedings of CIDR
19. Jayant Madhavan, David Ko, Lucja Kot, Vignesh Ganapathy, Alex Rasmussen, and Alon Halevy, ‘Google’s Deep Web Crawl’, Proceedings of the VLDB Endow- ment (2010) Sách, tạp chí
Tiêu đề: Google’s Deep Web Crawl
Tác giả: Jayant Madhavan, David Ko, Lucja Kot, Vignesh Ganapathy, Alex Rasmussen, Alon Halevy
Nhà XB: Proceedings of the VLDB Endowment
Năm: 2010
20. Anand Rajaraman, ‘Kosmix: High-Performance Topic Exploration Using the Deep Web’, Proceedings of the VLDB Endowment, 2/2 (Aug. 2009), 1524–9 Sách, tạp chí
Tiêu đề: Kosmix: High-Performance Topic Exploration Using the Deep Web
Tác giả: Anand Rajaraman
Nhà XB: Proceedings of the VLDB Endowment
Năm: 2009
21. Meghan E. Irons, ‘Caught in a Dragnet’, Boston Globe, 17 July 2011 Sách, tạp chí
Tiêu đề: Boston Globe
22. Ronald Kessler, The Terrorist Watch (New York: Three Rivers Press, 2007) Sách, tạp chí
Tiêu đề: The Terrorist Watch
Tác giả: Ronald Kessler
Nhà XB: Three Rivers Press
Năm: 2007
23. V. S. Subrahmanian, Aarron Mannes, Amy Sliva, Jana Shakarian, and John P. Dickerson, Computational Analysis of Terrorist Groups: Lashkar-e-Taiba (New York: Springer, 2013) Sách, tạp chí
Tiêu đề: Computational Analysis of Terrorist Groups: Lashkar-e-Taiba
Tác giả: V. S. Subrahmanian, Aarron Mannes, Amy Sliva, Jana Shakarian, John P. Dickerson
Nhà XB: Springer
Năm: 2013
24. ‘Home Minister Proposes Radical Restructuring of Security Architecture’, Press Information Bureau, Government of India, 24 December 2009 Sách, tạp chí
Tiêu đề: Press Information Bureau
25. V. Balachandran, ‘NATGRID Will Prove to Be a Security Nightmare’, Sunday Guardian, 19 Aug. 2012.CHAPTER 2 Sách, tạp chí
Tiêu đề: NATGRID Will Prove to Be a Security Nightmare
Tác giả: V. Balachandran
Nhà XB: Sunday Guardian
Năm: 2012
26. Claude E. Shannon, ‘A Mathematical Theory of Communication’, Bell System Technical Journal, 27 (July and Oct. 1948), 379–423, 623–56 Sách, tạp chí
Tiêu đề: A Mathematical Theory of Communication
Tác giả: Claude E. Shannon
Nhà XB: Bell System Technical Journal
Năm: 1948
27. Karen Spậrck Jones, ‘A Statistical Interpretation of Term Specificity and Its Application in Retrieval’, Journal of Documentation, 28/1 (1972), 11–21 Sách, tạp chí
Tiêu đề: Journal of Documentation
Tác giả: Karen Spậrck Jones, ‘A Statistical Interpretation of Term Specificity and Its Application in Retrieval’, Journal of Documentation, 28/1
Năm: 1972
28. Akiko Aizawa, ‘An Information-Theoretic Perspective of TF-IDF Measures’, Journal of Information Processing and Management, 39/1 (2003), 45–65 Sách, tạp chí
Tiêu đề: Journal of Information Processing and Management
Tác giả: Akiko Aizawa, ‘An Information-Theoretic Perspective of TF-IDF Measures’, Journal of Information Processing and Management, 39/1
Năm: 2003
29. Scott Deerwester, Susan T. Dumais, George W. Furnas, Thomas K. Landauer, and Richard Harshman, ‘Indexing by Latent Semantic Analysis’, Journal of the American Society for Information Science 41/6 (1990), 391–407 Sách, tạp chí
Tiêu đề: Indexing by Latent Semantic Analysis
Tác giả: Scott Deerwester, Susan T. Dumais, George W. Furnas, Thomas K. Landauer, Richard Harshman
Nhà XB: Journal of the American Society for Information Science
Năm: 1990
30. András Csomai and Rada Mihalcea, ‘Investigations in Unsupervised Back-of- the-Book Indexing’, Proceedings of the Florida Artificial Intelligence Research Society (2007), 211–16 Sách, tạp chí
Tiêu đề: Proceedings of the Florida Artificial Intelligence Research Society
Tác giả: András Csomai and Rada Mihalcea, ‘Investigations in Unsupervised Back-of- the-Book Indexing’, Proceedings of the Florida Artificial Intelligence Research Society
Năm: 2007
31. Arun Kumar, Sheetal Aggarwal, and Priyanka Manwani, ‘The Spoken Web Application Framework: User Generated Content and Service Creation through Low-End Mobiles’, Proceedings of the 2010 International Cross Disciplinary Conference on Web Accessibility (2010) Sách, tạp chí
Tiêu đề: The Spoken Web Application Framework: User Generated Content and Service Creation through Low-End Mobiles
Tác giả: Arun Kumar, Sheetal Aggarwal, Priyanka Manwani
Nhà XB: Proceedings of the 2010 International Cross Disciplinary Conference on Web Accessibility
Năm: 2010
32. James Gleick, The Information: A History, a Theory, a Flood (New York: Pantheon Books, 2011) Sách, tạp chí
Tiêu đề: The Information: A History, a Theory, a Flood
33. A. Frank and T. F. Jaeger, ‘Speaking Rationally: Uniform Information Density as an Optimal Strategy for Language Production’, 30th Annual Meeting of the Cognitive Science Society (CogSci08) (2008), 933–8 Sách, tạp chí
Tiêu đề: Speaking Rationally: Uniform Information Density as an Optimal Strategy for Language Production
Tác giả: A. Frank, T. F. Jaeger
Nhà XB: 30th Annual Meeting of the Cognitive Science Society (CogSci08)
Năm: 2008
34. Van Deemter, Not Exactly: In Praise of Vagueness (Oxford: Oxford University Press, 2010) Sách, tạp chí
Tiêu đề: Not Exactly: In Praise of Vagueness
Tác giả: Van Deemter
Nhà XB: Oxford University Press
Năm: 2010
35. Noam Chomsky, Syntactic Structures (The Hague: Mouton Books, 1957) Sách, tạp chí
Tiêu đề: Syntactic Structures
Tác giả: Noam Chomsky
Nhà XB: Mouton Books
Năm: 1957
36. Alexis Madrigal, ‘How You Google: Insights From Our Atlantic Reader Sur- vey’, The Atlantic (online), 19 Aug. 2011 Sách, tạp chí
Tiêu đề: The Atlantic