US20110029545A1 - Syllabic search engines and related methods - Google Patents

Syllabic search engines and related methods Download PDF

Info

Publication number
US20110029545A1
US20110029545A1 US12/904,821 US90482110A US2011029545A1 US 20110029545 A1 US20110029545 A1 US 20110029545A1 US 90482110 A US90482110 A US 90482110A US 2011029545 A1 US2011029545 A1 US 2011029545A1
Authority
US
United States
Prior art keywords
syllable
words
word
document
database
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US12/904,821
Inventor
Edward O. Clapper
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Intel Corp
Original Assignee
Intel Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Intel Corp filed Critical Intel Corp
Priority to US12/904,821 priority Critical patent/US20110029545A1/en
Publication of US20110029545A1 publication Critical patent/US20110029545A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution

Definitions

  • the inventive subject matter relates generally to the field of data processing and, more particularly, to a search engine for finding information via a sequence of syllable counts.
  • Search engines are known computer-based programs for finding desired information.
  • Various Internet search engines such as AltaVista and Google, are well known.
  • Other search engines exist for different applications, e.g. search-and-find functions for applications such as word processors, spreadsheets, and data base managers.
  • known search engines look for text strings or phrases that match one that is input by a computer user. When a match is successfully made, the known search engine returns a list of “hits” containing the input text string.
  • known search engines do not perform well when searching for phrases having missing or incorrectly sequenced information, such as missing words, missing letters, or misspelled words.
  • the computer user When a computer user is attempting to find a desired document based upon a phrase or text string, the computer user may not know the complete phrase and may know only isolated words separated by unknown words. Or the computer user may know all of the words but not know the correct spelling of some or all of the words.
  • FIG. 1 illustrates a block diagram of a computing device, and of an optional computer network comprising remote computing devices, that either individually or collectively can provide a syllabic search function, in accordance with one embodiment of the invention
  • FIG. 2 illustrates a block diagram of elements of a computing device that provides a syllabic search function, in accordance with one embodiment of the invention
  • FIGS. 3A and 3B together illustrate a flow diagram of a method of creating a syllabic document database, which method is implemented in a computer system, in a computer network, or in a computational machine operating under control of instructions residing on a machine-accessible medium, in accordance with various embodiments of the invention
  • FIG. 4 illustrates a flow diagram of a method of performing a syllabic search based upon user input, which method is implemented in a computer system, in a computer network, or in a computational machine operating under control of instructions residing on a machine-accessible medium, in accordance with various embodiments of the invention.
  • FIG. 5 illustrates a word sequence, a corresponding syllabic sequence, and an example of a syllabic search sequence, in accordance with various embodiments of the invention.
  • an improved search engine for a computing device or computer network utilizes a search string comprising, in addition to known words, numbers representing a syllable count for each unknown or uncertain word.
  • One or more pattern-matching algorithms are utilized to search a document database for any document that matches the input search string.
  • the document database is constructed by analyzing a number of documents, utilizing document-analyzing algorithms.
  • each database record comprises a document that has been analyzed into one or more groups of word sequences.
  • Each word sequence comprises an ordered list of words in the word sequence, as well as a corresponding ordered list of the syllable count for each word in the word sequence.
  • the syllabic search engine can be implemented in different ways, such as through a software application, an operating system, network software, or a custom software module. Improved computers and computer networks for providing a syllabic search function are also described.
  • syllable means a segment of speech typically produced with a single pulse of air pressure from the lungs.
  • bit has a single syllable
  • data has two syllables
  • computer has three syllables
  • Search engines built in accordance with the inventive subject matter can effectively and quickly locate documents which otherwise could not be located, and they can significantly increase the commercial value of computer software, computer systems, and/or computer networks in which they are featured.
  • FIG. 1 illustrates a block diagram of a computing device 2 , and of an optional computer network 24 comprising remote computing devices 26 and 28 , that either individually or collectively can provide a syllabic search function, in accordance with one embodiment of the invention.
  • FIG. 1 and the following discussion are intended to provide a brief, general description of a suitable computing environment in which certain aspects of the illustrated inventive subject matter may be implemented.
  • An exemplary system to provide a syllabic search function includes a machine or computing device 2 having system bus 3 .
  • processors 4 attached to bus 3 are one or more processors 4 , a display 6 , and one or more data entry elements 8 such as a keyboard, mouse, trackball, joy stick, touch-sensitive screen, or other well-known user interface data entry element(s).
  • data entry elements 8 such as a keyboard, mouse, trackball, joy stick, touch-sensitive screen, or other well-known user interface data entry element(s).
  • a memory 10 which can include any suitable memory device(s) like read only memory (ROM); random access memory (RAM); hard drive; removable media drive for handling compact disks (CDs), digital video disks (DVDs), diskettes, magnetic tape cartridges, and other types of data storage; or the like. Additional elements can also be attached to bus 3 such as a modem 12 , a network interface unit 14 , one or more speakers 16 , and other suitable devices 18 .
  • ROM read only memory
  • RAM random access memory
  • CDs compact disks
  • DVDs digital video disks
  • Additional elements can also be attached to bus 3 such as a modem 12 , a network interface unit 14 , one or more speakers 16 , and other suitable devices 18 .
  • processor means any type of computational circuit, such as but not limited to a microprocessor, a microcontroller, a complex instruction set computing (CISC) microprocessor, a reduced instruction set computing (RISC) microprocessor, a very long instruction word (VLIW) microprocessor, a graphics processor, a digital signal processor, or any other type of processor or processing circuit.
  • CISC complex instruction set computing
  • RISC reduced instruction set computing
  • VLIW very long instruction word
  • graphics processor a digital signal processor
  • digital signal processor or any other type of processor or processing circuit.
  • embedded controllers such as Generic or Programmable Logic Devices or Arrays, Application Specific Integrated Circuits, single-chip computers, smart cards, and the like.
  • Suitable means having characteristics that are sufficient to produce the desired result(s). Suitability for the intended purpose can be determined by one of ordinary skill in the art using only routine experimentation.
  • Network 24 can be an intranet; the Internet; a local area network; a wide area network; a cellular, cable, laser, satellite, microwave, “Blue Tooth”, optical, or infrared network; or any other short-range or long-range wired or wireless network.
  • Program modules may be stored in memory 10 and associated storage media, e.g., hard-drives, floppy-disks, optical storage, magnetic cassettes, tapes, flash memory cards, memory sticks, digital video disks, chemical storage, and/or biological storage.
  • Program modules may be delivered over transmission environments, including network 24 , in the form of packets, serial data, parallel data, propagated signals, etc.
  • Program modules may be used in a compressed or encrypted format, and they may be used in a distributed environment and stored in local and/or remote memory, for access by single and multi-processor machines, portable computers, handheld devices (e.g., Personal Digital Assistants (PDAs)), cellular telephones, pagers, personal entertainment devices (e.g. digital music players, digital video players, etc.), one-way or two-way radios, or the like.
  • PDAs Personal Digital Assistants
  • cellular telephones e.g., cellular telephones, pagers, personal entertainment devices (e.g. digital music players, digital video players, etc.), one-way or two-way radios, or the like.
  • PDAs Personal Digital Assistants
  • personal entertainment devices e.g. digital music players, digital video players, etc.
  • one-way or two-way radios or the like.
  • remote computing devices 26 and 28 may be configured like computing device 2 , and they can include many or all of the elements discussed for computing device 2 .
  • One of remote computing devices 26 and 28 could also be a server comprising software for providing a syllabic search engine capability to computer users of computing device 2 and other computing devices coupled to network 24 .
  • devices 2 , 26 , and 28 are typically separate communicatively-coupled components they could be embodied within a single device.
  • Computing device 2 can be any type of computer or computer-based display for displaying information, such as an Internet appliance; a personal computer; a pay-for-use computer located in an Internet cafe, hotel, or the like; an electronic book or other device for displaying information; a pager; a cellular telephone; a palm-top computer; a notebook computer; a laptop computer; or the like.
  • Devices 2 , 26 , and 28 can further comprise a plurality of types of software programs.
  • computing device 2 can comprise software 20 that includes a basic input/output system (BIOS), operating system (O/S) software, one or more software applications, a user interface, one or more databases including a document database, a syllabic search engine, and any other types of software as required to perform the operational requirements of computing device 2 .
  • BIOS basic input/output system
  • O/S operating system
  • databases including a document database, a syllabic search engine, and any other types of software as required to perform the operational requirements of computing device 2 .
  • FIG. 2 illustrates a block diagram of elements of a computing device 100 that provides a syllabic search function, in accordance with one embodiment of the invention.
  • Computing device 100 comprises a display 102 of any suitable type, such as a cathode ray tube (CRT) display, liquid crystal display, plasma display, or the like.
  • a display 102 of any suitable type, such as a cathode ray tube (CRT) display, liquid crystal display, plasma display, or the like.
  • CTR cathode ray tube
  • LCD liquid crystal display
  • plasma display or the like.
  • a data entry element 104 can be of any suitable type(s), such as a keyboard, pointing device (e.g. a mouse, trackball, joy stick, touch pad, touch sensitive screen, etc.), voice-recognition mechanism, or the like.
  • a keyboard e.g. a keyboard, pointing device (e.g. a mouse, trackball, joy stick, touch pad, touch sensitive screen, etc.), voice-recognition mechanism, or the like.
  • Processor 106 can be of any type, including those mentioned earlier regarding the description of FIG. 1 .
  • processor 106 can access document-analyzing algorithm(s) 108 and pattern-matching algorithm(s) 110 , which reside in any suitable machine-accessible media, such as any type(s) of memory 10 mentioned earlier regarding the description of FIG. 1 .
  • suitable machine-accessible media such as any type(s) of memory 10 mentioned earlier regarding the description of FIG. 1 .
  • One of ordinary skill in the art is capable of writing suitable computer program instructions, for storage on suitable machine-accessible media, which when accessed by a processor or other computational machine result in documents being analyzed as described latter herein, and result in a search string input by a computer user being pattern-matched with search strings stored in a document database to retrieve the identity of one or more matching documents.
  • a resource database 114 includes a suitable syllable dictionary that provides a syllable count for each of a large number of words in the particular language being used by the user of the computing device.
  • the syllable dictionary can, if desired, also include multiple entries for certain words having alternative correct pronunciations, such as “real”, which can be pronounced with either one or two syllables.
  • Resource database 114 can also include, if desired, a grammar resource to assist in resolving pattern matching, and it can also include other resources, such as foreign language dictionaries or special purpose dictionaries, such as scientific, medical, legal, pharmaceutical, financial, or other types of dictionaries.
  • Memory storage 116 can be any type(s) of memory 10 mentioned earlier regarding the description of FIG. 1 .
  • Storage 116 can be used, for example, as temporary storage during processing.
  • Computing device 100 can also comprise additional elements (not shown), such as a printer, modem, network interface, speaker(s), and/or other components or peripherals.
  • additional elements such as a printer, modem, network interface, speaker(s), and/or other components or peripherals.
  • FIGS. 3A and 3B together illustrate a flow diagram of a method 200 of creating a syllabic document database, which method is implemented in a computer system, in a computer network, or in a computational machine operating under control of instructions residing on a machine-accessible medium, in accordance with various embodiments of the invention.
  • the method is merely one example of creating a database of analyzed documents, each comprising an ordered sequence of words, with each word having an associated syllable count.
  • Many other implementations are also possible, as will be apparent to those of ordinary skill in the art.
  • a document is selected for analysis.
  • a document can be of any type such as, but not limited to, any work of authorship as defined in Title 17 (United States Copyright Code) Section 101, “Definitions”, or the legal equivalent. These include literary works, song lyrics, dramatic works, motion picture scripts, and audiovisual scripts. Documents can also include uncopyrightable works of any type, as well as works on which copyrights have expired.
  • the inventive subject matter is used to identify a song title based on a search string that includes nothing more than an ordered sequence of syllable counts from the song lyric.
  • one or more known words can also be input as part of the ordered sequence of syllable counts, in place of the syllable count(s) for such known word(s).
  • the inventive subject matter is used to identify a movie or television episode based upon a search string from the dialog.
  • a search string can be used to identify a speech from a database containing speech documents; to retrieve a poem title from a database of poetry; to identify a book title from a quotation; and the like.
  • a document can also be synonymous with a file, such as a file containing text, an email, a message to a pager or cellular telephone, an Internet page, etc.
  • Each document can be assigned a unique document identifier, such as a number or alphanumeric label.
  • the first phrase in the document is analyzed.
  • a document may comprise only a single phrase.
  • the document may be parsed into a number of phrases in any suitable manner. For example, if the document is a song lyric, each line can be considered a phrase; alternatively, each stanza and the chorus can be considered a phrase. If the document is a book, each sentence can be considered a phrase. Each phrase can be assigned a sequential phrase number, if desired, to identify the order of the phrase within the document.
  • the first word in the first phrase is analyzed.
  • the word's numerical position within the phrase is recorded. This can be done, for example, by assigning a word sequence number, e.g. the number “1”, to the first word, and so forth for successive words.
  • the first word is looked up in the syllable dictionary to obtain the number of syllables, and this number is recorded for the first word.
  • a database record is constructed that may include the following recorded information: the document identifier, the phrase number, the word sequence number within the phrase, the text value of the word, and the syllable count for the word.
  • Each record comprises an ordered listing of words, and an ordered syllable count listing.
  • the process increments to the next word when analysis has been completed on the preceding word.
  • the process increments to the next phrase when analysis has been completed on the preceding phrase.
  • FIG. 4 illustrates a flow diagram of a method 300 of performing a syllabic search based upon user input, which method is implemented in a computer system, in a computer network, or in a computational machine operating under control of instructions residing on a machine-accessible medium, in accordance with various embodiments of the invention.
  • the method is merely one example of utilizing a user-provided search string, comprising an ordered sequence of syllable counts, and optionally one or more known words, to search a document database to retrieve the identification of a desired document.
  • Many other implementations are also possible, as will be apparent to those of ordinary skill in the art.
  • the user input sequence or search string is obtained.
  • the user input sequence comprises a logical ordering of the user's estimate of the syllable count for each unknown word in the desired phrase and/or document.
  • the user input sequence can optionally also include one or more known words of the phrase and/or document. An example of this is provided regarding FIG. 5 to be discussed below.
  • the user input sequence or search string is compared with the contents of the document database that was created using, for example, the process illustrated in FIGS. 3A and 3B .
  • a suitable algorithm can be used to perform this match, such as one or more pattern-matching algorithms. In this manner, an identical match may be found. Or, if an identical match is not found, one or more documents may be identified having similar but not identical search strings to the user's input, enabling the user to evaluate which, if any, satisfy the user's search.
  • the algorithm can include a checksum calculation with respect to the number of syllables in the user input sequence (i.e. the total number of syllables in the known words and in the syllable counts for unknown words), in order to cull out retrieved search strings having a syllable count exceeding a given margin of error. For example, if the checksum count of the number of syllables in the user input sequence is 25, and if the predetermined allowable margin of error is set at plus or minus 10 syllables, then retrieved search strings having fewer than 15 or more than 35 syllables, for example, could be automatically eliminated and not displayed to the user.
  • a checksum calculation with respect to the number of syllables in the user input sequence (i.e. the total number of syllables in the known words and in the syllable counts for unknown words), in order to cull out retrieved search strings having a syllable count exceeding a given margin of error
  • the identity of the retrieved document(s) is displayed on a display of the computing device. If a number of potential “hits” have been retrieved, a suitable algorithm can prioritize them according to the degree of match achieved, and a list of the best-matched hits can be displayed. The process ends at 308 .
  • FIG. 5 illustrates a word sequence, a corresponding syllabic sequence, and an example of a syllabic search sequence, in accordance with various embodiments of the invention.
  • an exemplary word sequence 402 is a phrase or stanza from a song lyric: “What the innocent feel, they never care to reveal”.
  • a corresponding syllabic sequence 404 is shown by the numerical sequence: 1, 1, 3, 1, 1, 2, 1, 1, 2.
  • Each number represents the syllable count for the corresponding word in word sequence 402 .
  • the word “What” has 1 syllable; the word “the” has 1 syllable; the word “innocent” has 3 syllables; and so on.
  • An example of a user search string or search sequence 406 is the sequence: what, ?, ???, ?, they, ??, ?, to, ??, wherein each “?” represents the number of syllables perceived by the user. If the user were unable to understand any word of the song phrase, the user could input just a syllable sequence, e.g. ?, ?, ???, ?, ?, ?, ?, ??.
  • Any suitable user interface can be used to interpret the user search string and to convert it into a digital format that is suitable for conducting a comparison search on the document database.
  • sequences of syllable counts possibly interspersed with known words, a relatively small number of numerical digits in a sequence, or possibly a single-digit total, can be used to locate a matching sequence without the expenditure of large amounts of processing power and/or time that would be required to compare strings of alphanumeric characters.
  • the inventive subject matter is tolerant of missing words, groups of words, or of all words in a user's search string, provided that every missing word is replaced with a corresponding syllable count in the same order as the word appears within the phrase or document. If a conventional search engine were used on only easily understood words, such as “Mississippi”, “computer”, “music”, it would return an enormous number of potential “hits”, whereas the inventive subject matter can narrow the number of hits considerably if syllable counts are substituted for unknown words.
  • the inventive subject matter also is tolerant of misspelled words in the user search string.
  • a conventional search engine can retrieve a large quantity of false hits if a search string contains a misspelled word, whereas the inventive subject matter need not give significant weight to misspelled words and can give proportionally greater weight to sequences of syllable counts.
  • the user can substitute a syllable count for any word whose spelling the user is uncertain about.
  • inventive subject matter could be used as a supplement or background to a conventional search engine, in order to improve the quality of its searches.
  • FIGS. 3A , 3 B, and 4 are merely representative and not exclusive, and that many other different alternative operations could be implemented using the concepts taught by the inventive subject matter.
  • an improved search engine for a computing device or computer network, utilizes a search string comprising, in addition to known words, numbers representing a syllable count for each unknown or uncertain word.
  • One or more pattern-matching algorithms are utilized to search a document database for a document that matches the input search string.
  • the document database is constructed by analyzing a number of documents, utilizing document-analyzing algorithms.
  • each database record comprises a document that has been analyzed into one or more groups of word sequences.
  • Each word sequence comprises an ordered list of words in the word sequence, as well as a corresponding ordered list of the syllable count for each word in the word sequence.
  • a syllabic search engine can be implemented in different ways, such as in a software application, in an operating system, through network software, or in a custom software module. Improved computers and computer networks for providing a syllabic search function have also been described.
  • Search engines built in accordance with the inventive subject matter can effectively and quickly locate documents which otherwise could not be located, and they can significantly increase the commercial value of computer software, computer systems, and/or computer networks in which they are featured.

Abstract

An improved search engine, for a computing device or computer network, utilizes search strings comprising complete words and numbers representing a syllable count for each unknown word. Pattern-matching algorithms are utilized to search a document database for documents that match the input search strings. The document database is constructed by analyzing a number of documents, utilizing document-analyzing algorithms. In one embodiment, each database record comprises a document that has been analyzed into one or more groups of word sequences. Each word sequence comprises an ordered list of words in the word sequence, as well as a corresponding ordered list of the syllable count for each word in the word sequence. The syllabic search engine can be implemented in different ways, such as through a software application, operating system, network software, or a custom software module. Improved computers and computer networks for providing a syllabic search function are also described.

Description

    DIVISIONAL APPLICATION
  • This application is a divisional of U.S. patent application Ser. No. 10/020,483, filed on Dec. 12, 2001, which is incorporated herein by reference in its entirety.
  • TECHNICAL FIELD
  • The inventive subject matter relates generally to the field of data processing and, more particularly, to a search engine for finding information via a sequence of syllable counts.
  • BACKGROUND INFORMATION
  • Search engines are known computer-based programs for finding desired information. Various Internet search engines, such as AltaVista and Google, are well known. Other search engines exist for different applications, e.g. search-and-find functions for applications such as word processors, spreadsheets, and data base managers.
  • Known search engines look for text strings or phrases that match one that is input by a computer user. When a match is successfully made, the known search engine returns a list of “hits” containing the input text string. However, known search engines do not perform well when searching for phrases having missing or incorrectly sequenced information, such as missing words, missing letters, or misspelled words.
  • When a computer user is attempting to find a desired document based upon a phrase or text string, the computer user may not know the complete phrase and may know only isolated words separated by unknown words. Or the computer user may know all of the words but not know the correct spelling of some or all of the words.
  • For the reasons stated above, and for other reasons stated below which will become apparent to those skilled in the art upon reading and understanding the present specification, there is a significant need in the art for systems and methods that support computer-based searching through the utilization of incomplete phrases and phrases containing misspelled words.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • FIG. 1 illustrates a block diagram of a computing device, and of an optional computer network comprising remote computing devices, that either individually or collectively can provide a syllabic search function, in accordance with one embodiment of the invention;
  • FIG. 2 illustrates a block diagram of elements of a computing device that provides a syllabic search function, in accordance with one embodiment of the invention;
  • FIGS. 3A and 3B together illustrate a flow diagram of a method of creating a syllabic document database, which method is implemented in a computer system, in a computer network, or in a computational machine operating under control of instructions residing on a machine-accessible medium, in accordance with various embodiments of the invention;
  • FIG. 4 illustrates a flow diagram of a method of performing a syllabic search based upon user input, which method is implemented in a computer system, in a computer network, or in a computational machine operating under control of instructions residing on a machine-accessible medium, in accordance with various embodiments of the invention; and
  • FIG. 5 illustrates a word sequence, a corresponding syllabic sequence, and an example of a syllabic search sequence, in accordance with various embodiments of the invention.
  • DETAILED DESCRIPTION
  • In the following detailed description of embodiments of the invention, reference is made to the accompanying drawings that form a part hereof, and in which is shown by way of illustration specific preferred embodiments in which the inventive subject matter may be practiced. These embodiments are described in sufficient detail to enable those skilled in the art to practice them, and it is to be understood that other embodiments may be utilized and that logical, procedural, mechanical, and electrical changes may be made without departing from the spirit and scope of the inventive subject matter. Such embodiments of the inventive subject matter may be referred to, individually and/or collectively, herein by the term “invention” merely for convenience and without intending to voluntarily limit the scope of this application to any single invention or inventive concept if more than one is in fact disclosed. The following detailed description is, therefore, not to be taken in a limiting sense, and the scope of the inventive subject matter is defined only by the appended claims.
  • The inventive subject matter provides a syllabic search function for locating desired documents stored within computing devices and/or computer networks. Various embodiments are illustrated and described herein. According to one embodiment, an improved search engine for a computing device or computer network utilizes a search string comprising, in addition to known words, numbers representing a syllable count for each unknown or uncertain word. One or more pattern-matching algorithms are utilized to search a document database for any document that matches the input search string.
  • The document database is constructed by analyzing a number of documents, utilizing document-analyzing algorithms. In one embodiment, each database record comprises a document that has been analyzed into one or more groups of word sequences. Each word sequence comprises an ordered list of words in the word sequence, as well as a corresponding ordered list of the syllable count for each word in the word sequence. The syllabic search engine can be implemented in different ways, such as through a software application, an operating system, network software, or a custom software module. Improved computers and computer networks for providing a syllabic search function are also described.
  • The term “syllable”, as used herein, means a segment of speech typically produced with a single pulse of air pressure from the lungs. By way of example, the word “bit” has a single syllable; the word “data” has two syllables; the word “computer” has three syllables; and so on.
  • Search engines built in accordance with the inventive subject matter can effectively and quickly locate documents which otherwise could not be located, and they can significantly increase the commercial value of computer software, computer systems, and/or computer networks in which they are featured.
  • FIG. 1 illustrates a block diagram of a computing device 2, and of an optional computer network 24 comprising remote computing devices 26 and 28, that either individually or collectively can provide a syllabic search function, in accordance with one embodiment of the invention.
  • FIG. 1 and the following discussion are intended to provide a brief, general description of a suitable computing environment in which certain aspects of the illustrated inventive subject matter may be implemented. An exemplary system to provide a syllabic search function includes a machine or computing device 2 having system bus 3. Typically, attached to bus 3 are one or more processors 4, a display 6, and one or more data entry elements 8 such as a keyboard, mouse, trackball, joy stick, touch-sensitive screen, or other well-known user interface data entry element(s). Also attached to bus 3 is a memory 10, which can include any suitable memory device(s) like read only memory (ROM); random access memory (RAM); hard drive; removable media drive for handling compact disks (CDs), digital video disks (DVDs), diskettes, magnetic tape cartridges, and other types of data storage; or the like. Additional elements can also be attached to bus 3 such as a modem 12, a network interface unit 14, one or more speakers 16, and other suitable devices 18.
  • “Processor”, as used herein, means any type of computational circuit, such as but not limited to a microprocessor, a microcontroller, a complex instruction set computing (CISC) microprocessor, a reduced instruction set computing (RISC) microprocessor, a very long instruction word (VLIW) microprocessor, a graphics processor, a digital signal processor, or any other type of processor or processing circuit. The term also includes embedded controllers, such as Generic or Programmable Logic Devices or Arrays, Application Specific Integrated Circuits, single-chip computers, smart cards, and the like.
  • “Suitable”, as used herein, means having characteristics that are sufficient to produce the desired result(s). Suitability for the intended purpose can be determined by one of ordinary skill in the art using only routine experimentation.
  • Computing device 2 can optionally operate in a networked environment using a physical and/or a logical connection 22 to one or more remote computing devices or systems 26 and 28 via wired or wireless network 24. Network 24 can be an intranet; the Internet; a local area network; a wide area network; a cellular, cable, laser, satellite, microwave, “Blue Tooth”, optical, or infrared network; or any other short-range or long-range wired or wireless network.
  • The inventive subject matter may be implemented in conjunction with program modules, including functions, procedures, data structures, application programs, etc. for performing tasks, or defining abstract data types or low-level hardware contexts. Program modules may be stored in memory 10 and associated storage media, e.g., hard-drives, floppy-disks, optical storage, magnetic cassettes, tapes, flash memory cards, memory sticks, digital video disks, chemical storage, and/or biological storage. Program modules may be delivered over transmission environments, including network 24, in the form of packets, serial data, parallel data, propagated signals, etc. Program modules may be used in a compressed or encrypted format, and they may be used in a distributed environment and stored in local and/or remote memory, for access by single and multi-processor machines, portable computers, handheld devices (e.g., Personal Digital Assistants (PDAs)), cellular telephones, pagers, personal entertainment devices (e.g. digital music players, digital video players, etc.), one-way or two-way radios, or the like.
  • Thus, for example, with respect to the illustrated embodiments, assuming that computing device 2 provides a syllabic search engine capability, then one or both of remote computing devices 26 and 28 may be configured like computing device 2, and they can include many or all of the elements discussed for computing device 2. One of remote computing devices 26 and 28 could also be a server comprising software for providing a syllabic search engine capability to computer users of computing device 2 and other computing devices coupled to network 24. It should also be appreciated that while devices 2, 26, and 28 are typically separate communicatively-coupled components they could be embodied within a single device.
  • Computing device 2 can be any type of computer or computer-based display for displaying information, such as an Internet appliance; a personal computer; a pay-for-use computer located in an Internet cafe, hotel, or the like; an electronic book or other device for displaying information; a pager; a cellular telephone; a palm-top computer; a notebook computer; a laptop computer; or the like.
  • Devices 2, 26, and 28 can further comprise a plurality of types of software programs. For example, computing device 2 can comprise software 20 that includes a basic input/output system (BIOS), operating system (O/S) software, one or more software applications, a user interface, one or more databases including a document database, a syllabic search engine, and any other types of software as required to perform the operational requirements of computing device 2.
  • FIG. 2 illustrates a block diagram of elements of a computing device 100 that provides a syllabic search function, in accordance with one embodiment of the invention.
  • Computing device 100 comprises a display 102 of any suitable type, such as a cathode ray tube (CRT) display, liquid crystal display, plasma display, or the like.
  • A data entry element 104 can be of any suitable type(s), such as a keyboard, pointing device (e.g. a mouse, trackball, joy stick, touch pad, touch sensitive screen, etc.), voice-recognition mechanism, or the like.
  • Processor 106 can be of any type, including those mentioned earlier regarding the description of FIG. 1.
  • Still referring to FIG. 2, processor 106 can access document-analyzing algorithm(s) 108 and pattern-matching algorithm(s) 110, which reside in any suitable machine-accessible media, such as any type(s) of memory 10 mentioned earlier regarding the description of FIG. 1. One of ordinary skill in the art is capable of writing suitable computer program instructions, for storage on suitable machine-accessible media, which when accessed by a processor or other computational machine result in documents being analyzed as described latter herein, and result in a search string input by a computer user being pattern-matched with search strings stored in a document database to retrieve the identity of one or more matching documents.
  • Still referring to FIG. 2, a document database 112 can be implemented with any suitable database software program. A resource database 114 includes a suitable syllable dictionary that provides a syllable count for each of a large number of words in the particular language being used by the user of the computing device. The syllable dictionary can, if desired, also include multiple entries for certain words having alternative correct pronunciations, such as “real”, which can be pronounced with either one or two syllables. Resource database 114 can also include, if desired, a grammar resource to assist in resolving pattern matching, and it can also include other resources, such as foreign language dictionaries or special purpose dictionaries, such as scientific, medical, legal, pharmaceutical, financial, or other types of dictionaries.
  • Memory storage 116 can be any type(s) of memory 10 mentioned earlier regarding the description of FIG. 1. Storage 116 can be used, for example, as temporary storage during processing.
  • Computing device 100 can also comprise additional elements (not shown), such as a printer, modem, network interface, speaker(s), and/or other components or peripherals.
  • FIGS. 3A and 3B together illustrate a flow diagram of a method 200 of creating a syllabic document database, which method is implemented in a computer system, in a computer network, or in a computational machine operating under control of instructions residing on a machine-accessible medium, in accordance with various embodiments of the invention. The method is merely one example of creating a database of analyzed documents, each comprising an ordered sequence of words, with each word having an associated syllable count. Many other implementations are also possible, as will be apparent to those of ordinary skill in the art.
  • In 202, a document is selected for analysis. A document can be of any type such as, but not limited to, any work of authorship as defined in Title 17 (United States Copyright Code) Section 101, “Definitions”, or the legal equivalent. These include literary works, song lyrics, dramatic works, motion picture scripts, and audiovisual scripts. Documents can also include uncopyrightable works of any type, as well as works on which copyrights have expired.
  • In one embodiment, the inventive subject matter is used to identify a song title based on a search string that includes nothing more than an ordered sequence of syllable counts from the song lyric. In another embodiment, one or more known words can also be input as part of the ordered sequence of syllable counts, in place of the syllable count(s) for such known word(s). In yet another embodiment, the inventive subject matter is used to identify a movie or television episode based upon a search string from the dialog. In further embodiments, a search string can be used to identify a speech from a database containing speech documents; to retrieve a poem title from a database of poetry; to identify a book title from a quotation; and the like.
  • A document can also be synonymous with a file, such as a file containing text, an email, a message to a pager or cellular telephone, an Internet page, etc.
  • Each document can be assigned a unique document identifier, such as a number or alphanumeric label.
  • In 204, the first phrase in the document is analyzed. In some embodiments, a document may comprise only a single phrase. In other embodiments wherein the document comprises a plurality of phrases, the document may be parsed into a number of phrases in any suitable manner. For example, if the document is a song lyric, each line can be considered a phrase; alternatively, each stanza and the chorus can be considered a phrase. If the document is a book, each sentence can be considered a phrase. Each phrase can be assigned a sequential phrase number, if desired, to identify the order of the phrase within the document.
  • In 206, the first word in the first phrase is analyzed. The word's numerical position within the phrase is recorded. This can be done, for example, by assigning a word sequence number, e.g. the number “1”, to the first word, and so forth for successive words.
  • The first word is looked up in the syllable dictionary to obtain the number of syllables, and this number is recorded for the first word.
  • A database record is constructed that may include the following recorded information: the document identifier, the phrase number, the word sequence number within the phrase, the text value of the word, and the syllable count for the word. Each record comprises an ordered listing of words, and an ordered syllable count listing.
  • In 208, the process increments to the next word when analysis has been completed on the preceding word.
  • In 210, a determination is made whether all words in the phrase have been analyzed. If so, the process goes to 212; otherwise, it returns to 206.
  • In 212, the process increments to the next phrase when analysis has been completed on the preceding phrase.
  • In 214, a determination is made whether all phrases in the document have been analyzed. If so, the process goes to 216; otherwise, it returns to 204.
  • In 216, the database record for this particular document is closed.
  • In 218, the process increments to the next document.
  • In 220, a determination is made whether all documents requiring analysis have been analyzed. If so, the process finishes at 222; otherwise, it returns to 202.
  • FIG. 4 illustrates a flow diagram of a method 300 of performing a syllabic search based upon user input, which method is implemented in a computer system, in a computer network, or in a computational machine operating under control of instructions residing on a machine-accessible medium, in accordance with various embodiments of the invention. The method is merely one example of utilizing a user-provided search string, comprising an ordered sequence of syllable counts, and optionally one or more known words, to search a document database to retrieve the identification of a desired document. Many other implementations are also possible, as will be apparent to those of ordinary skill in the art.
  • In 302, the user input sequence or search string is obtained. The user input sequence comprises a logical ordering of the user's estimate of the syllable count for each unknown word in the desired phrase and/or document. The user input sequence can optionally also include one or more known words of the phrase and/or document. An example of this is provided regarding FIG. 5 to be discussed below.
  • Still with reference to FIG. 4, in 304, the user input sequence or search string is compared with the contents of the document database that was created using, for example, the process illustrated in FIGS. 3A and 3B. A suitable algorithm can be used to perform this match, such as one or more pattern-matching algorithms. In this manner, an identical match may be found. Or, if an identical match is not found, one or more documents may be identified having similar but not identical search strings to the user's input, enabling the user to evaluate which, if any, satisfy the user's search.
  • In one embodiment, the algorithm can include a checksum calculation with respect to the number of syllables in the user input sequence (i.e. the total number of syllables in the known words and in the syllable counts for unknown words), in order to cull out retrieved search strings having a syllable count exceeding a given margin of error. For example, if the checksum count of the number of syllables in the user input sequence is 25, and if the predetermined allowable margin of error is set at plus or minus 10 syllables, then retrieved search strings having fewer than 15 or more than 35 syllables, for example, could be automatically eliminated and not displayed to the user.
  • In 306, the identity of the retrieved document(s) is displayed on a display of the computing device. If a number of potential “hits” have been retrieved, a suitable algorithm can prioritize them according to the degree of match achieved, and a list of the best-matched hits can be displayed. The process ends at 308.
  • FIG. 5 illustrates a word sequence, a corresponding syllabic sequence, and an example of a syllabic search sequence, in accordance with various embodiments of the invention. In the example illustrated in FIG. 5, an exemplary word sequence 402 is a phrase or stanza from a song lyric: “What the innocent feel, they never care to reveal”.
  • A corresponding syllabic sequence 404 is shown by the numerical sequence: 1, 1, 3, 1, 1, 2, 1, 1, 2. Each number represents the syllable count for the corresponding word in word sequence 402. For example, the word “What” has 1 syllable; the word “the” has 1 syllable; the word “innocent” has 3 syllables; and so on.
  • An example of a user search string or search sequence 406 is the sequence: what, ?, ???, ?, they, ??, ?, to, ??, wherein each “?” represents the number of syllables perceived by the user. If the user were unable to understand any word of the song phrase, the user could input just a syllable sequence, e.g. ?, ?, ???, ?, ?, ??, ?, ?, ??.
  • Any suitable user interface can be used to interpret the user search string and to convert it into a digital format that is suitable for conducting a comparison search on the document database.
  • By comparing sequences of syllable counts, possibly interspersed with known words, a relatively small number of numerical digits in a sequence, or possibly a single-digit total, can be used to locate a matching sequence without the expenditure of large amounts of processing power and/or time that would be required to compare strings of alphanumeric characters.
  • Moreover, the inventive subject matter is tolerant of missing words, groups of words, or of all words in a user's search string, provided that every missing word is replaced with a corresponding syllable count in the same order as the word appears within the phrase or document. If a conventional search engine were used on only easily understood words, such as “Mississippi”, “computer”, “music”, it would return an enormous number of potential “hits”, whereas the inventive subject matter can narrow the number of hits considerably if syllable counts are substituted for unknown words.
  • The inventive subject matter also is tolerant of misspelled words in the user search string. A conventional search engine can retrieve a large quantity of false hits if a search string contains a misspelled word, whereas the inventive subject matter need not give significant weight to misspelled words and can give proportionally greater weight to sequences of syllable counts. In addition, the user can substitute a syllable count for any word whose spelling the user is uncertain about.
  • In addition, the inventive subject matter could be used as a supplement or background to a conventional search engine, in order to improve the quality of its searches.
  • It should be understood that the operations shown in FIGS. 3A, 3B, and 4 are merely representative and not exclusive, and that many other different alternative operations could be implemented using the concepts taught by the inventive subject matter.
  • The operations described above with respect to the methods illustrated in FIGS. 3A, 3B, and 4 can be performed in a different order from those described herein. Also, it will be understood that although the methods are described as having an “end”, they could be continuously performed.
  • Implementing a syllabic search engine is carried out by suitable instructions in one or more computer programs that are stored in and executed by one or more devices 2, 26, and 28 in FIG. 1. One of ordinary skill in the art is capable of writing suitable instructions to implement the objectives and features of the inventive subject matter as described herein.
  • The inventive subject matter provides a syllabic search function for locating desired documents within computing devices and/or computer networks. Various embodiments have been illustrated and described herein. According to one embodiment, an improved search engine, for a computing device or computer network, utilizes a search string comprising, in addition to known words, numbers representing a syllable count for each unknown or uncertain word. One or more pattern-matching algorithms are utilized to search a document database for a document that matches the input search string.
  • The document database is constructed by analyzing a number of documents, utilizing document-analyzing algorithms. In one embodiment, each database record comprises a document that has been analyzed into one or more groups of word sequences. Each word sequence comprises an ordered list of words in the word sequence, as well as a corresponding ordered list of the syllable count for each word in the word sequence. A syllabic search engine can be implemented in different ways, such as in a software application, in an operating system, through network software, or in a custom software module. Improved computers and computer networks for providing a syllabic search function have also been described.
  • Search engines built in accordance with the inventive subject matter can effectively and quickly locate documents which otherwise could not be located, and they can significantly increase the commercial value of computer software, computer systems, and/or computer networks in which they are featured.
  • Other embodiments will be readily apparent to those of ordinary skill in the art. The elements, architecture, and sequence of operations can all be varied to suit particular data security requirements.
  • The various elements depicted in the drawings are merely representational and are not drawn to scale. Certain proportions thereof may be exaggerated, while others may be minimized. The drawings are intended to illustrate various implementations of the inventive subject matter, which can be understood and appropriately carried out by those of ordinary skill in the art.
  • Having described and illustrated the principles of the inventive subject matter with reference to illustrated embodiments, it will be recognized that the illustrated embodiments can be modified in arrangement and detail without departing from such principles. And, though the foregoing discussion has focused on particular embodiments, other configurations are contemplated. In particular, even though expressions such as “in one embodiment”, “in another embodiment”, or the like are used herein, these phrases are meant to generally reference embodiment possibilities, and they are not intended to limit the inventive subject matter to particular embodiment configurations. As used herein, these terms may reference the same or different embodiments that are combinable into other embodiments. Consequently, in view of the wide variety of permutations to the embodiments described herein, this detailed description is intended to be illustrative only, and it should not be taken as limiting the scope of the inventive subject matter.
  • Although specific embodiments have been illustrated and described herein, it will be appreciated by those of ordinary skill in the art that any arrangement or process that is calculated to achieve the same purpose may be substituted for the specific embodiments shown. This application is intended to cover any adaptations or variations of the inventive subject matter. Therefore, it is manifestly intended that embodiments of this inventive subject matter be limited only by the claims and the equivalents thereof.

Claims (18)

1. A method comprising:
analyzing a document, with a computing device, into an ordered sequence of words, each word having an associated syllable count; and
storing the ordered sequence of words and the associated syllable counts in a document database.
2. The method recited in claim 1 wherein, in analyzing, a syllable dictionary is utilized to provide a syllable count for each word.
3. The method recited in claim 1 wherein, in analyzing, the document is parsed into a number of phrases.
4. The method recited in claim 1 wherein, in storing, the database comprises a plurality of records, each comprising an ordered listing of words and an ordered syllable count listing.
5. The method recited in claim 4 wherein, in storing, each database record comprises a work from the group comprising a literary work, a song lyric, a dramatic work, a motion picture script, and an audiovisual script.
6. The method recited in claim 1 wherein, in storing, the ordered sequence of words and the associated syllable counts are stored as an ordered sequence of syllable counts.
7. A computing device including a memory to store a database, the computing device executing a computer program comprising the operations of:
analyzing a document into an ordered sequence of words, each word having an associated syllable count; and
storing the ordered sequence of words and the associated syllable counts in the database.
8. The computing device recited in claim 7 wherein, in analyzing, a syllable dictionary is utilized to provide a syllable count for each word.
9. The computing device recited in claim 7 wherein, in analyzing, the document is parsed into a number of phrases.
10. The computing device recited in claim 7 wherein, in storing, the database comprises a plurality of records, each comprising an ordered listing of words and an ordered syllable count listing.
11. The computing device recited in claim 10 wherein, in storing, each database record comprises a work from the group comprising a literary work, a song lyric, a dramatic work, a motion picture script, and an audiovisual script.
12. The computing device recited in claim 7 wherein, in storing, the ordered sequence of words and the associated syllable counts are stored as an ordered sequence of syllable counts.
13. An article comprising a non-transitory machine-accessible medium having associated instructions, wherein the instructions, when accessed, result in a machine performing:
analyzing a document into an ordered sequence of words, each word having an associated syllable count; and
storing the ordered sequence of words and the associated syllable counts in a database.
14. The article of claim 13 wherein, in analyzing, a syllable dictionary is utilized to provide a syllable count for each word.
15. The article of claim 13 wherein, in analyzing, the document is parsed into a number of phrases.
16. The article recited in claim 13 wherein, in storing, the database comprises a plurality of records, each comprising an ordered listing of words and an ordered syllable count listing.
17. The article recited in claim 16 wherein, in storing, each database record comprises a work from the group comprising a literary work, a song lyric, a dramatic work, a motion picture script, and an audiovisual script.
18. The article recited in claim 13 wherein, in storing, the ordered sequence of words and the associated syllable counts are stored as an ordered sequence of syllable counts.
US12/904,821 2001-12-12 2010-10-14 Syllabic search engines and related methods Abandoned US20110029545A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US12/904,821 US20110029545A1 (en) 2001-12-12 2010-10-14 Syllabic search engines and related methods

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10/020,483 US7836049B2 (en) 2001-12-12 2001-12-12 Syllabic search engines and related methods
US12/904,821 US20110029545A1 (en) 2001-12-12 2010-10-14 Syllabic search engines and related methods

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US10/020,483 Division US7836049B2 (en) 2001-12-12 2001-12-12 Syllabic search engines and related methods

Publications (1)

Publication Number Publication Date
US20110029545A1 true US20110029545A1 (en) 2011-02-03

Family

ID=21798857

Family Applications (2)

Application Number Title Priority Date Filing Date
US10/020,483 Expired - Fee Related US7836049B2 (en) 2001-12-12 2001-12-12 Syllabic search engines and related methods
US12/904,821 Abandoned US20110029545A1 (en) 2001-12-12 2010-10-14 Syllabic search engines and related methods

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US10/020,483 Expired - Fee Related US7836049B2 (en) 2001-12-12 2001-12-12 Syllabic search engines and related methods

Country Status (1)

Country Link
US (2) US7836049B2 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100306260A1 (en) * 2009-05-29 2010-12-02 Xerox Corporation Number sequences detection systems and methods
US10211970B2 (en) 2017-03-31 2019-02-19 Intel Corporation Millimeter wave CMOS engines for waveguide fabrics

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU2003250669A1 (en) * 2002-07-23 2004-02-09 Research In Motion Limted Systems and methods of building and using custom word lists
US8069159B2 (en) * 2004-09-07 2011-11-29 Robert O Stuart More efficient search algorithm (MESA) using prioritized search sequencing
US7792808B2 (en) * 2004-09-07 2010-09-07 Stuart Robert O More efficient search algorithm (MESA) using virtual search parameters
KR20090001625A (en) * 2007-05-04 2009-01-09 주식회사 메디슨 System and method for providing patient information
US20100153425A1 (en) * 2008-12-12 2010-06-17 Yury Tulchinsky Method for Counting Syllables in Readability Software
US8768852B2 (en) * 2009-01-13 2014-07-01 Amazon Technologies, Inc. Determining phrases related to other phrases
US8706643B1 (en) 2009-01-13 2014-04-22 Amazon Technologies, Inc. Generating and suggesting phrases
US8706644B1 (en) 2009-01-13 2014-04-22 Amazon Technologies, Inc. Mining phrases for association with a user
US8423349B1 (en) * 2009-01-13 2013-04-16 Amazon Technologies, Inc. Filtering phrases for an identifier
US9569770B1 (en) 2009-01-13 2017-02-14 Amazon Technologies, Inc. Generating constructed phrases
US9298700B1 (en) 2009-07-28 2016-03-29 Amazon Technologies, Inc. Determining similar phrases
US10007712B1 (en) 2009-08-20 2018-06-26 Amazon Technologies, Inc. Enforcing user-specified rules
US8799658B1 (en) 2010-03-02 2014-08-05 Amazon Technologies, Inc. Sharing media items with pass phrases
US11423029B1 (en) 2010-11-09 2022-08-23 Google Llc Index-side stem-based variant generation
US8375042B1 (en) 2010-11-09 2013-02-12 Google Inc. Index-side synonym generation
US10235680B2 (en) * 2010-11-12 2019-03-19 Maritz Holdings Inc. System and method for populating a database with user input
US9037591B1 (en) * 2012-04-30 2015-05-19 Google Inc. Storing term substitution information in an index

Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5596754A (en) * 1992-10-29 1997-01-21 Digital Equipment Corporation Method for performing private lock management
US5765152A (en) * 1995-10-13 1998-06-09 Trustees Of Dartmouth College System and method for managing copyrighted electronic media
US5832478A (en) * 1997-03-13 1998-11-03 The United States Of America As Represented By The National Security Agency Method of searching an on-line dictionary using syllables and syllable count
US5991756A (en) * 1997-11-03 1999-11-23 Yahoo, Inc. Information retrieval from hierarchical compound documents
US20020072907A1 (en) * 2000-10-19 2002-06-13 Case Eliot M. System and method for converting text-to-voice
US6408266B1 (en) * 1997-04-01 2002-06-18 Yeong Kaung Oon Didactic and content oriented word processing method with incrementally changed belief system
US6502136B1 (en) * 1994-03-24 2002-12-31 Hitachi, Ltd. Exclusive control method with each node controlling issue of an exclusive use request to a shared resource, a computer system therefor and a computer system with a circuit for detecting writing of an event flag into a shared main storage
US6629092B1 (en) * 1999-10-13 2003-09-30 Andrew Berke Search engine
US6687689B1 (en) * 2000-06-16 2004-02-03 Nusuara Technologies Sdn. Bhd. System and methods for document retrieval using natural language-based queries
US6689946B2 (en) * 2000-04-25 2004-02-10 Yamaha Corporation Aid for composing words of song
US6697901B1 (en) * 2000-10-24 2004-02-24 Oracle International Corporation Using secondary resource masters in conjunction with a primary resource master for managing resources that are accessible to a plurality of entities
US6862568B2 (en) * 2000-10-19 2005-03-01 Qwest Communications International, Inc. System and method for converting text-to-voice
US20070061348A1 (en) * 2001-04-19 2007-03-15 International Business Machines Corporation Method and system for identifying relationships between text documents and structured variables pertaining to the text documents

Patent Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5596754A (en) * 1992-10-29 1997-01-21 Digital Equipment Corporation Method for performing private lock management
US6502136B1 (en) * 1994-03-24 2002-12-31 Hitachi, Ltd. Exclusive control method with each node controlling issue of an exclusive use request to a shared resource, a computer system therefor and a computer system with a circuit for detecting writing of an event flag into a shared main storage
US5765152A (en) * 1995-10-13 1998-06-09 Trustees Of Dartmouth College System and method for managing copyrighted electronic media
US5832478A (en) * 1997-03-13 1998-11-03 The United States Of America As Represented By The National Security Agency Method of searching an on-line dictionary using syllables and syllable count
US6408266B1 (en) * 1997-04-01 2002-06-18 Yeong Kaung Oon Didactic and content oriented word processing method with incrementally changed belief system
US5991756A (en) * 1997-11-03 1999-11-23 Yahoo, Inc. Information retrieval from hierarchical compound documents
US6629092B1 (en) * 1999-10-13 2003-09-30 Andrew Berke Search engine
US6689946B2 (en) * 2000-04-25 2004-02-10 Yamaha Corporation Aid for composing words of song
US6687689B1 (en) * 2000-06-16 2004-02-03 Nusuara Technologies Sdn. Bhd. System and methods for document retrieval using natural language-based queries
US20020072907A1 (en) * 2000-10-19 2002-06-13 Case Eliot M. System and method for converting text-to-voice
US6862568B2 (en) * 2000-10-19 2005-03-01 Qwest Communications International, Inc. System and method for converting text-to-voice
US6697901B1 (en) * 2000-10-24 2004-02-24 Oracle International Corporation Using secondary resource masters in conjunction with a primary resource master for managing resources that are accessible to a plurality of entities
US20070061348A1 (en) * 2001-04-19 2007-03-15 International Business Machines Corporation Method and system for identifying relationships between text documents and structured variables pertaining to the text documents

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100306260A1 (en) * 2009-05-29 2010-12-02 Xerox Corporation Number sequences detection systems and methods
US9135249B2 (en) * 2009-05-29 2015-09-15 Xerox Corporation Number sequences detection systems and methods
US10211970B2 (en) 2017-03-31 2019-02-19 Intel Corporation Millimeter wave CMOS engines for waveguide fabrics

Also Published As

Publication number Publication date
US7836049B2 (en) 2010-11-16
US20030120846A1 (en) 2003-06-26

Similar Documents

Publication Publication Date Title
US20110029545A1 (en) Syllabic search engines and related methods
US7921116B2 (en) Highly meaningful multimedia metadata creation and associations
US8073877B2 (en) Scalable semi-structured named entity detection
TWI496012B (en) Modular system and method for managing chinese, japanese, and korean linguistic data in electronic form
US10552467B2 (en) System and method for language sensitive contextual searching
US20100017390A1 (en) Apparatus, method and program product for presenting next search keyword
EP2519896A2 (en) Search suggestion clustering and presentation
WO2005124599A2 (en) Content search in complex language, such as japanese
AU2014285073B9 (en) Method and system for simplifying implicit rhetorical relation prediction in large scale annotated corpus
Alexander et al. Metaphor, popular science, and semantic tagging: Distant reading with the Historical Thesaurus of English
JP3545824B2 (en) Data retrieval device
CN101689198A (en) Phonetic search using normalized string
Dias et al. The impact of semantic annotation techniques on content-based video lecture recommendation
US8046361B2 (en) System and method for classifying tags of content using a hyperlinked corpus of classified web pages
Varol et al. Detecting near-duplicate text documents with a hybrid approach
Anders et al. Online vs. CD-ROM--The Impact of CD-ROM Databases upon a Large Online Searching Program.
Pan et al. Dmdd: A large-scale dataset for dataset mentions detection
CN112802454A (en) Method and device for recommending awakening words, terminal equipment and storage medium
Grishman Information extraction and speech recognition
Baldwin et al. Restoring punctuation and casing in English text
Kumar et al. A transformer based encodings for detection of semantically equivalent questions in cQA
Eichmann et al. A cluster-based approach to broadcast news
Agbogun et al. On the Development of Machine Learning Algorithms for Information Extraction of Structured Academic Data from Unstructured Web Documents
Corazza et al. Computation of upper-bounds for stochastic context-free languages
Porcaro Information extraction from user-generated content in the classical music domain

Legal Events

Date Code Title Description
STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION