Amazon.com
Of all the tasks programmers are asked to perform, storing, compressing, and retrieving information are some of the most challenging--and critical to many applications. Managing Gigabytes: Compressing and Indexing Documents and Images is a treasure trove of theory, practical illustration, and general discussion in this fascinating technical subject.
Ian Witten, Alistair Moffat, and Timothy Bell have updated their original work with this even more impressive second edition. This version adds recent techniques such as block-sorting, new indexing techniques, new lossless compression strategies, and many other elements to the mix. In short, this work is a comprehensive summary of text and image compression, indexing, and querying techniques. The history of relevant algorithm development is woven well with a practical discussion of challenges, pitfalls, and specific solutions.
This title is a textbook-style exposition on the topic, with its information organized very clearly into topics such as compression, indexing, and so forth. In addition to diagrams and example text transformations, the authors use "pseudo-code" to present algorithms in a language-independent manner wherever possible. They also supplement the reading with mg--their own implementation of the techniques. The mg C language source code is freely available on the Web.
Alone, this book is an impressive collection of information. Nevertheless, the authors list numerous titles for further reading in selected topics. Whether you're in the midst of application development and need solutions fast or are merely curious about how top-notch information management is done, this hardcover is an excellent investment. --Stephen W. Plain
Topics covered: Text compression models, including Huffman, LZW, and their variants; trends in information management; index creation and compression; image compression; performance issues; and overall system implementation.
Book Description
In this fully updated second edition of the highly acclaimed
Managing Gigabytes, authors Witten, Moffat, and Bell continue to provide unparalleled coverage of state-of-the-art techniques for compressing and indexing data. Whatever your field, if you work with large quantities of information, this book is essential reading--an authoritative theoretical resource and a practical guide to meeting the toughest storage and access challenges. It covers the latest developments in compression and indexing and their application on the Web and in digital libraries. It also details dozens of powerful techniques supported by mg, the authors' own system for compressing, storing, and retrieving text, images, and textual images. mg's source code is freely available on the Web.
* Up-to-date coverage of new text compression algorithms such as block sorting, approximate arithmetic coding, and fat Huffman coding
* New sections on content-based index compression and distributed querying, with 2 new data structures for fast indexing
* New coverage of image coding, including descriptions of de facto standards in use on the Web (GIF and PNG), information on CALIC, the new proposed JPEG Lossless standard, and JBIG2
* New information on the Internet and WWW, digital libraries, web search engines, and agent-based retrieval
* Accompanied by a public domain system called MG which is a fully worked-out operational example of the advanced techniques developed and explained in the book
* New appendix on an existing digital library system that uses the MG software
Customer Reviews:
one of the best book on search engineering.......2007-04-20
It has been 8 years since it was published and I could see it is still one of the best in IR field. Without much long magic equations, it is not hard for common user to pick it up. There are mainly 2 parts in the book, the first book is compression, most of them are just principle introduction since it does not make sense for the read to invent or implement an algorithm. The second part is indexing (plus some query) which I highly recommended because it is "practical".
The authors are smart guys who could do sth, google mg for their website and mg4j for the ported java implementation.
A Comprehensive Introduction To Text Retrieval Systems.......2005-07-30
A wonderful feature of this book spans out practicality for various topics including compresion algorithms and theory, document and imaging system and information retrieval. On my personal interest, the authors highlight a vast list of not only the theory but present it in a simple common sense logic.
There are several examples that break down complex processes into simple and easy to understand logic and the pages provides a smooth flow of the structured topics. Well organised, presented and fully informative.
Truly an ideal book. This serves as a superior text for students studying document and imaging systems, processing and information and multimedia retrieval subjects. Beautiful!!!
Just on a personal note, it would be great to see some emphasis in the future editions in regards to web mining applications.
Great Book on Information Retrieval.......2004-05-03
Managing Gigabytes is the best book out there on information retrieval. If you're interested in implementing your own IR system, there's nothing available that comes close to this book. But the book is good not just because it's the only one out there: the writing is excellent, the algorithms are presented clearly and explained well, and the coverage is thorough. Additionally, the coverage of compression algorithms is the best I've found in any book. All algorithms and pseudo-code in the book are presented clearly enough such that any competent programmer should be able to implement them. If all else fails, however, the free downloadable source code for the mg system can fill in any gaps.
All in all, this is the best computer science book I've purchased in years. I wish all CS books were written like this one: it doesn't skimp on the theory or on the implementation details.
The Wonderful Thing Is: It's the Only One.......2001-12-21
This is the only book there is that will actually teach you how to build an information retrieval system (aka search engine). It discusses all the algorithms and tradeoffs, and comes with free downloadable source code to experiment with. Some of the material is standard, but covered in more implementation detail here than anywhere else. Some of the material is novel: you won't find better coverage of compression unless you hand-assemble twenty research papers, and reverse-engineer them to figure out how they're implemented. But with "Managing Gigabytes", it's all here. (Although, after a particularly envigorating discussion of how to string together a bunch of techniques to compress their corpus and save a couple 100MB, I did a check and found you could buy 512MB of RAM for less than the cost of the book. Knowledge is Power, but sometimes a little cash is more powerful.) The only negative is that this book is not called "Managing Terabytes", as the first edition promised/threatened it might be. RAM and disk are cheap, but not that cheap, and for now terabytes (and sometimes petabytes) are managed only by NASA, Google, and a few others. I can't wait to see the third edition!
Very clear, but misses some key real-world issues.......2001-08-15
As others have said, MG is a good introductory text for Information Retrieval. However I think it spends a little too much time on compression techniques and lacks a good discussion of incremental or on-line indexing. The book tends to assume that the set of texts to be searched is static - if new documents can be added or old ones deleted it makes the whole problem much harder and many of MG's techniques are no longer relevant. That said, I strongly look forward to Managing Terabytes (if it ever appears).
Book Description
Since 1994, Nancy Mulvany's Indexing Books has been the gold standard for thousands of professional indexers, editors, and authors. This long-awaited second edition, expanded and completely updated, will be equally revered.
Like its predecessor, this edition of Indexing Books offers comprehensive, reliable treatment of indexing principles and practices relevant to authors and indexers alike. In addition to practical advice, the book presents a big-picture perspective on the nature and purpose of indexes and their role in published works. New to this edition are discussions of "information overload" and the role of the index, open-system versus closed-system indexing, electronic submission and display of indexes, and trends in software development, among other topics.
Mulvany is equally comfortable focusing on the nuts and bolts of indexing—how to determine what is indexable, how to decide the depth of an index, and how to work with publisher instructions—and broadly surveying important sources of indexing guidelines such as The Chicago Manual of Style, Sun Microsystems, Oxford University Press, NISO TR03, and ISO 999. Authors will appreciate Mulvany's in-depth consideration of the costs and benefits of preparing one's own index versus hiring a professional, while professional indexers will value Mulvany's insights into computer-aided indexing. Helpful appendixes include resources for indexers, a worksheet for general index specifications, and a bibliography of sources to consult for further information on a range of topics.
Indexing Books is both a practical guide and a manifesto about the vital role of the human-crafted index in the Information Age. As the standard indexing reference, it belongs on the shelves of everyone involved in writing and publishing nonfiction books.
Customer Reviews:
positive.......2006-11-06
I liked the book overall, most notably the expanded coverage of software used in indexing. I didn't agree with every word I read, but the value of Ms Mulvany's 30 years of experience is apparant in her treatment of style guides and her knowledge of industry practices. I'd recommend this book for anyone in the business.
Essential Guide to Learn how to index.......2005-10-14
Mulvany's Indexing guidebook is a must have for anyone seeking to become an indexer. I purchased the book in connection with the United State Deparment of Agriculture Basic Indexing course that I am taking.
How make better book indexes.......2003-06-14
Es importante expresar en idioma español mi más amplia recomendación acerca del libro Indexing Books (Chicago Guides to Writing, Editing, and Publishing) by Nancy C. Mulvany, del cual estoy convencido es el mejor libro para definir metodologías a seguir en la indización de documentos unitarios como es el caso de las monografías. Especialmente se desea recomendar para instituciones de educación suprior que ofrezcan carreras de Bibliotecología, Biblioteconomía, Documentación y en general de las Ciencias de la Información, ya que funciona perfectamente como libro de texto dentro de las materias de Indices y Abstracts. Es importante indicar, que este tipo de libros, deberán complementarse con otros más que oferta Amazon respecto a elaboración de índices de conujuntos de documentos, así también en la profundización de elaboración de resumenes o abstracts, mismos que complementarían el desarrollo de materias académicas que permitan a nuestros estudiantes dominar la panarámica de esta disciplina.
Very informative.......2003-03-12
This books tells you everything about indexing, its standards, how to cross-reference, which mistakes to avoid. The author discusses the traditional indexing method with cards. Her advice on software is outdated, we are far beyond DOS. Otherwise the book provides all the information you need and gives a good idea of what's involved, e.g. that it's not a leisurely job, why not everyone who can read can index, about indexing associations, etc. A very useful book for the professional.
You're an indexer? You should have this book.......2002-11-15
Mulvaney's "Indexing Books" is one of the two books most recommended by professional indexers (the other being the indexing chapter from the "Chicago Manual of Style"). Mulvaney writes in easy to understand, no-nonsense language to first explain what is an index, why you need a professional indexer (vs. say, a computer program), and then goes into detail about how to index books. "Indexing Books" is a must-have for anyone wanting to be a professional indexer.
Book Description
Basic Information Services, Volume I of
Introduction to Reference Work, explains the essential reference processes and sources in today’s libraries. It is a tool for understanding and mastering fundamental reference forms - online, in print, and elsewhere. This eighth edition is completely rewritten to reflect the radical changes library science has undergone since the advent of widely available electronic databases and the Internet.
Customer Reviews:
Introduction to Reference Work, Volume I.......2007-10-11
Just when you think you know all about reference resources, you are bound to learn something new in this book. For a beginner or an experienced person if you are looking for ways to reach different references , this book is great! I think it is worth every cent I paid for it... now looking to buy the second Volume.
Introduction to Reference Work leaves much to be desired.......2006-11-14
As one pursuing a master's degree in Library Science, I was required to buy this book for my reference course. I have found it to be poorly written and organized and of little practical use. Someone out there must be able to know enough about reference techniques to write a more coherent and interesting text on the subject.
Embarrassing.......2006-02-01
This was used as a college textbook in my class, and to be blunt, I and several other classmates were shocked at the poor quality.
Spelling mistakes, poor grammar, etc. abound. If ever a book needed a good editor, this is the one!
It escapes with 2 stars instead of 1 because the patient reader and beginner Library Science student may be able to glean some helpful advice from the text. At the very least, it seems the late William A. Katz knew a thing or two about the subject of reference work. I have to believe there are MANY better resources on this subject however. If not, then someone needs to start writing, there's money to be made!
(Review edited for spelling)
A waste of money.......2005-02-19
This book is terrible. I bought it for a class and by the end of the semester I was completely disgusted with it. It is full of errors: misspelled words, improper examples, inconsistencies in style and usage, and the like. It's hard to believe that this is the book's 8th edition! If you can ignore all of this then maybe you'll find some of the information useful. I'm not sure about its companion volume, I didn't bother reading it. All I know is that when I pay $120 for two thin books I expect far better quality than this.
The Literary Equivalent of Valium.......2005-02-03
One of the dullest, driest reference books I have ever been forced to read (vols. 1 and 2). When you hear "it reads like a doctoral thesis," this is what they mean.
Poor, poor students. All that money ($140 for both volumes) for such mental torture.
Average customer rating:
|
Essential Thesaurus Construction
Vanda Broughton
Manufacturer: Facet Publishing
ProductGroup: Book
Binding: Paperback
Etymology
| Words & Language
| Reference
| Subjects
| Books
Linguistics
| Words & Language
| Reference
| Subjects
| Books
General
| Computers & Internet
| Subjects
| Books
General
| Library & Information Science
| Social Sciences
| Nonfiction
| Subjects
| Books
Indexing & Abstracting
| Library & Information Science
| Social Sciences
| Nonfiction
| Subjects
| Books
Look Inside Reference Books
| Trip
| Specialty Stores
| Books
All Titles
| Qualifying Textbooks - Fall 2007
| Stores
| Books
Similar Items:
-
Don't Make Me Think: A Common Sense Approach to Web Usability, 2nd Edition
ASIN: 185604565X |
Average customer rating:
- A Jolly Good Read
- Unique and appealing
|
Indexers and Indexes in Fact and Fiction (Studies in Book and Print Culture)
Hazel Bell
Manufacturer: University of Toronto Press
ProductGroup: Book
Binding: Paperback
General
| Books & Reading
| Literature & Fiction
| Subjects
| Books
History of Books
| Books & Reading
| Literature & Fiction
| Subjects
| Books
Literacy
| Books & Reading
| Literature & Fiction
| Subjects
| Books
General
| Criticism & Theory
| History & Criticism
| Literature & Fiction
| Subjects
| Books
Byatt, A.S.
| ( B )
| Authors, A-Z
| Literature & Fiction
| Subjects
| Books
General
| Publishing & Books
| Reference
| Subjects
| Books
General
| Reference
| Subjects
| Books
Library Management
| Library & Information Science
| Social Sciences
| Nonfiction
| Subjects
| Books
Cataloging
| Library & Information Science
| Social Sciences
| Nonfiction
| Subjects
| Books
Literacy
| Education
| Nonfiction
| Subjects
| Books
ASIN: 080208494X |
Book Description
The index, taken for granted, perhaps considered boring - or not considered at all - is an essential part of a book. Indexers and Indexing takes a wry look at the history, uses and implications of this little-considered element of the book, and offers an anthology of amusing index extracts. Compiled by a professional indexer, it examines the history and development of the index, and highlights the debate and comment that the index has invited over the years. The author examines indexes from earlier centuries: some endearingly quaint; some deliberately humorous; some plain awful; and some which are astonishing in the vehemence of the views they present. Bell also examines the depiction of indexers in fiction - and the picture she finds is not encouraging to the professional indexer - variously portrayed as diffident, domestic drudges or incompetent and fallen pedants. A wonderful book for editors, indexers and bibliophiles.
Customer Reviews:
A Jolly Good Read.......2004-03-27
It is not often that a book by an indexer about indexers and indexes can be described as fun. Hazel Bell's meticulously researched book is fun to read. Bell was editor of the British journal, The Indexer, for over 18 years. In this volume she has collected gems -- quotes about indexes, quotes from indexes, indexers that appear in fiction as characters, and much more. And, oh yes! The book includes an index.
Unique and appealing.......2002-02-05
No bibliophile will want to resist this instructive, entertaining and amusing anthology, which reveals indexes as whimsical (Lewis Carroll), enticing (Pepys), hilarious (Julian Barnes), or playful (Virginia Woolf). Indexers portrayed in fiction are noted to be everything from drunk (in Trollope) to meticulous (Sherlock Holmes) to romantic (in Barbara Pym). The entries are fascinating, the brief history of indexing is engrossing, and A.S.Byatt's foreword is brilliant. A splendid and unique 'must-have' for any book lover.
Average customer rating:
|
Beyond Book Indexing: How To Get Started in Web Indexing, Embedded Indexing, and Other Computer-Based Media
Marilyn, Ed. Rowland
Manufacturer: INFORMATION TODAY
ProductGroup: Book
Binding: Paperback
Library Management
| Library & Information Science
| Social Sciences
| Nonfiction
| Subjects
| Books
Cataloging
| Library & Information Science
| Social Sciences
| Nonfiction
| Subjects
| Books
Indexing & Abstracting
| Library & Information Science
| Social Sciences
| Nonfiction
| Subjects
| Books
Internet
| Home Computing
| Computers & Internet
| Subjects
| Books
| Internet & Education
| Online Searching
| Web Browsers
| Web for Kids
General
| Computers & Internet
| Subjects
| Books
General
| Reference
| Subjects
| Books
jp-unknown3
| Specialty Stores
| Books
All Titles
| Qualifying Textbooks - Fall 2007
| Stores
| Books
ASIN: 1573870811 |
Customer Reviews:
If you love indexing..........2000-06-16
We are all familiar with the book index. Located at the end of a paper volume, it contains an alphabetical listing of subjects covered. You can use it to quickly locate information on a topic of particular interest at a given moment. Professional indexers have been creating these lists for many years. However, the publishing industry has undergone rapid changes lately. Automation led to computerized layout and desktop publishing as well as fully online publishing. Indexers have moved into these new areas with confidence and have developed a variety of new skills. If you are an indexer unsure about how to move into the future, or a budding indexer just getting started in the field, "Beyond book indexing" is a good introduction to the topics listed in its subtitle: embedded indexing, web indexing, and other computer media.
Of course, the book includes an excellent index by Janet Perlman. I believe this is the first time I have seen an author given specific credit for an index in a book! A glossary of terms is also provided. Chapters include references and/or a list of relevant web sites for further information. I recommend "Beyond book indexing" for experienced indexers who want to keep up with current trends or find work in new areas of indexing. Beginners may find the book to be a bit overwhelming; the focus is on teaching new skills to the professional rather than explaining the basics of indexing.
Average customer rating:
|
Indexing, the Art of: A Guide to the Indexing of Books and Periodicals
G. Norman Knight
Manufacturer: Routledge
ProductGroup: Book
Binding: Hardcover
History
| Subjects
| Books
| Africa
| Americas
| Ancient
| Arctic & Antarctica
| Asia
| Audiobooks
| Australia & Oceania
| Europe
| Gay & Lesbian
| Historical Study
| Large Print
| Middle East
| Military
| Military Science
| Russia
| United States
| World
General
| Bibliographies & Indexes
| Publishing & Books
| Reference
| Subjects
| Books
History
| Bibliographies & Indexes
| Publishing & Books
| Reference
| Subjects
| Books
Art & Photography
| Bibliographies & Indexes
| Publishing & Books
| Reference
| Subjects
| Books
Science
| Bibliographies & Indexes
| Publishing & Books
| Reference
| Subjects
| Books
General
| Publishing & Books
| Reference
| Subjects
| Books
Library Management
| Library & Information Science
| Social Sciences
| Nonfiction
| Subjects
| Books
General
| Library & Information Science
| Social Sciences
| Nonfiction
| Subjects
| Books
Cataloging
| Library & Information Science
| Social Sciences
| Nonfiction
| Subjects
| Books
General
| Arts & Photography
| Subjects
| Books
ASIN: 0040290026 |
Average customer rating:
|
Alphabetic Indexing Rules
Joseph S. Fosegan
Manufacturer: Thomson South-Western
ProductGroup: Book
Binding: Paperback
Study Skills
| Education
| Nonfiction
| Subjects
| Books
Workbooks
| Education
| Reference
| Subjects
| Books
General
| Bibliographies & Indexes
| Publishing & Books
| Reference
| Subjects
| Books
Science
| Bibliographies & Indexes
| Publishing & Books
| Reference
| Subjects
| Books
Information Systems
| Software Engineering
| Computer Science
| Computers & Internet
| Subjects
| Books
General
| Computers & Internet
| Subjects
| Books
ASIN: 0538711698 |
Book Description
With this versatile product, learners can use the text-workbook, the programmed software, or both to become experts on alphabetic indexing rules and ARMA-compatible filing rules. Completion time: 15-20 hours.
Average customer rating:
|
High-Dimensional Indexing: Transformational Approaches to High-Dimensional Range and Similarity Searches (Lecture Notes Series)
Cui Yu
Manufacturer: Springer-Verlag
ProductGroup: Book
Binding: Paperback
General
| Medicine
| Subjects
| Books
General
| Programming
| Computers & Internet
| Subjects
| Books
Multimedia Information Systems
| Software Engineering
| Computer Science
| Computers & Internet
| Subjects
| Books
General
| Computers & Internet
| Subjects
| Books
General
| Databases
| Computers & Internet
| Subjects
| Books
General
| Software
| Computers & Internet
| Subjects
| Books
Mathematics
| Professional Science
| Professional & Technical
| Subjects
| Books
| Applied
| Chaos & Systems
| Geometry & Topology
| Mathematical Analysis
| Mathematical Physics
| Number Systems
| Pure Mathematics
| Transformations
| Trigonometry
Indexing & Abstracting
| Library & Information Science
| Social Sciences
| Nonfiction
| Subjects
| Books
All Amazon Upgrade
| Amazon Upgrade
| Stores
| Books
Computers & Internet
| Amazon Upgrade
| Stores
| Books
Medicine
| Amazon Upgrade
| Stores
| Books
Nonfiction
| Amazon Upgrade
| Stores
| Books
Professional & Technical
| Amazon Upgrade
| Stores
| Books
ASIN: 3540441999 |
Book Description
In this monograph, we study the problem of high-dimensional indexing and systematically introduce two efficient index structures: one for range queries and the other for similarity queries. Extensive experiments and comparison studies are conducted to demonstrate the superiority of the proposed indexing methods.
Many new database applications, such as multimedia databases or stock price information systems, transform important features or properties of data objects into high-dimensional points. Searching for objects based on these features is thus a search of points in this feature space. To support efficient retrieval in such high-dimensional databases, indexes are required to prune the search space. Indexes for low-dimensional databases are well studied, whereas most of these application specific indexes are not scaleable with the number of dimensions, and they are not designed to support similarity searches and high-dimensional joins.
Book Description
This book constitutes the refereed proceedings of the 4th International Conference on Image and Video Retrieval, CIVR 2005, held in Singapore, in July 2005. The 20 revised full papers and 42 poster papers presented together with an introduction and 4 invited papers were carefully reviewed and selected from 128 submissions. Besides the invited and industrial presentations the papers are organized in topical sections on video retrieval techniques, video story segmentation and event detection, semantics in video retrieval, image indexing and retrieval, image/video annotation and clustering, interactive video retrieval and others, image/video retrieval applications, and two sections comprising the poster presentations on video processing, retrieval and multimedia systems and on image feature extraction, indexing and retrieval.
Books:
- Mastering Italian: with 15 Compact Discs (Mastering Series: Level 1 CD Packages)
- Mathematics for Finance: An Introduction to Financial Engineering (SPRINGER UNDERGRADUATE MATHEMATICS SERIES)
- McGuffey's Eclectic Readers/Boxed
- MCSE Self-Paced Training Kit (Exams 70-290, 70-291, 70-293, 70-294): Microsoft Windows Server 2003 Core Requirements, Second Edition
- Nigella Bites: From Family Meals to Elegant Dinners -- Easy, Delectable Recipes for Any Occasion
- Novel & Short Story Writer's Market 2007 (Novel and Short Story Writer's Market)
- Only You Can Save Mankind (Johnny Maxwell Trilogy)
- Osho Zen Tarot: The Transcendental Game Of Zen
- Our 50 States: A Family Adventure Across America
- Photonics: Optical Electronics in Modern Communications (The Oxford Series in Electrical and Computer Engineering)
Books Index
Books Home
Recommended Books
- Upgrading and Repairing PCs
- The Great Earthquake and Firestorms of 1906: How San Francisco Nearly Destroyed Itself
- Negotiating Business Equipment Leases
- Song Of Eagles
- Study Guide to accompany Cost Management: Strategies for Business Decisions, Third Edition
- The Lost Boy: A Foster Child's Search for the Love of a Family
- The Explorations of Captain James Cook in the Pacific
- Complete Idiot's Guide to QuickBooks and QuickBooks Pro 99
- Selling the Free Market: The Rhetoric of Economic Correctness
- The Journey of Ibn Fattouma