WO2004030341A1 - Fingerprint extraction - Google Patents
Fingerprint extraction Download PDFInfo
- Publication number
- WO2004030341A1 WO2004030341A1 PCT/IB2003/003658 IB0303658W WO2004030341A1 WO 2004030341 A1 WO2004030341 A1 WO 2004030341A1 IB 0303658 W IB0303658 W IB 0303658W WO 2004030341 A1 WO2004030341 A1 WO 2004030341A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- sequence
- auto
- property
- samples
- signal
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H1/00—Details of electrophonic musical instruments
- G10H1/0033—Recording/reproducing or transmission of music for electrophonic musical instruments
- G10H1/0041—Recording/reproducing or transmission of music for electrophonic musical instruments in coded form
- G10H1/0058—Transmission between separate instruments or between individual components of a musical system
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N1/00—Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
- H04N1/32—Circuits or arrangements for control or supervision between transmitter and receiver or between image input and image output device, e.g. between a still-image camera and its memory or between a still-image camera and a printer device
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N1/00—Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
- H04N1/32—Circuits or arrangements for control or supervision between transmitter and receiver or between image input and image output device, e.g. between a still-image camera and its memory or between a still-image camera and a printer device
- H04N1/32101—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title
- H04N1/32106—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title separate from the image data, e.g. in a different computer file
- H04N1/32122—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title separate from the image data, e.g. in a different computer file in a separate device, e.g. in a memory or on a display separate from image data
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H2250/00—Aspects of algorithms or signal processing methods without intrinsic musical character, yet specifically adapted for or used in electrophonic musical processing
- G10H2250/131—Mathematical functions for musical analysis, processing, synthesis or composition
- G10H2250/135—Autocorrelation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H2250/00—Aspects of algorithms or signal processing methods without intrinsic musical character, yet specifically adapted for or used in electrophonic musical processing
- G10H2250/131—Mathematical functions for musical analysis, processing, synthesis or composition
- G10H2250/161—Logarithmic functions, scaling or conversion, e.g. to reflect human auditory perception of loudness or frequency
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H2250/00—Aspects of algorithms or signal processing methods without intrinsic musical character, yet specifically adapted for or used in electrophonic musical processing
- G10H2250/131—Mathematical functions for musical analysis, processing, synthesis or composition
- G10H2250/215—Transforms, i.e. mathematical transforms into domains appropriate for musical signal processing, coding or compression
- G10H2250/235—Fourier transform; Discrete Fourier Transform [DFT]; Fast Fourier Transform [FFT]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/018—Audio watermarking, i.e. embedding inaudible data in the audio signal
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N2201/00—Indexing scheme relating to scanning, transmission or reproduction of documents or the like, and to details thereof
- H04N2201/32—Circuits or arrangements for control or supervision between transmitter and receiver or between image input and image output device, e.g. between a still-image camera and its memory or between a still-image camera and a printer device
- H04N2201/3201—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title
- H04N2201/3225—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title of data relating to an image, a page or a document
- H04N2201/3233—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title of data relating to an image, a page or a document of authentication information, e.g. digital signature, watermark
- H04N2201/3235—Checking or certification of the authentication information, e.g. by comparison with data stored independently
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N2201/00—Indexing scheme relating to scanning, transmission or reproduction of documents or the like, and to details thereof
- H04N2201/32—Circuits or arrangements for control or supervision between transmitter and receiver or between image input and image output device, e.g. between a still-image camera and its memory or between a still-image camera and a printer device
- H04N2201/3201—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title
- H04N2201/3225—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title of data relating to an image, a page or a document
- H04N2201/3233—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title of data relating to an image, a page or a document of authentication information, e.g. digital signature, watermark
- H04N2201/3236—Details of authentication information generation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N2201/00—Indexing scheme relating to scanning, transmission or reproduction of documents or the like, and to details thereof
- H04N2201/32—Circuits or arrangements for control or supervision between transmitter and receiver or between image input and image output device, e.g. between a still-image camera and its memory or between a still-image camera and a printer device
- H04N2201/3201—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title
- H04N2201/3274—Storage or retrieval of prestored additional information
Definitions
- the invention relates to a method and arrangement for extracting a fingerprint from a media signal.
- a fingerprint also often referred to as signature or hash, is a sequence of bits that is derived from multimedia content, e.g. an audio song, an image, a video clip, etc.
- Multimedia fingerprints are used, inter alia, in the field of authentication where it is desired to verify whether received content is original or detect whether the content has been tampered with. Fingerprints are also used to identify media content.
- a service that is likely to become very popular in the near future is audio identification.
- a fingerprint being derived from an unknown piece of music is sent to a database where the title, artist and other metadata is looked up and returned to the consumer.
- FIG. 1 A known method of extracting a fingerprint from a media signal is disclosed in Applicant's International Patent Application WO 02/065782.
- FIG. 1 A schematic diagram of this prior-art method is shown in Fig. 1.
- the media signal here an audio song
- a spectral representation of each frame is obtained by performing a Fast Fourier Transform (102).
- the energy of the audio signal in 33 logarithmically spaced sub-bands is subsequently computed (103).
- the bands lie in the range of 300-2000Hz which is perceptually the most relevant range.
- the 33 energy levels constitute a sequence of perceptual property samples of the respective audio signal frame.
- a simple 2-dimensional filter (104) is applied to the spectrogram prior to obtaining 32 differential property samples.
- the sequence is subsequently converted into a bit string by an appropriate thresholding operation (105). More particularly, a sub-band in a particular frame is assigned a bit ' 1' if the energy difference with its neighboring sub-band is larger than the energy difference with its neighboring sub-band in the previous frame. Otherwise, the fingerprint bit is '0'.
- the known method produces a string of 32 bits for each audio frame ( —0.4 sec).
- the frames are preferably overlapping (e.g.
- the prior-art fingerprint extraction method has turned out to be very robust against almost all commonly used audio processing steps such as MP3 encoding, sample rate conversion, D/A and A D conversion, equalization. However, it is not very robust against speed changes. It is quite common for radio stations to speed up audio by a few percent. They supposedly do this for two reasons. First, the duration of songs is then shorter and therefore it enables them to broadcast more commercials. Secondly, the beat of the song is faster and listeners seem to prefer this. The speed changes typically lie between zero and four percent.
- the method according to the invention comprises the steps of deriving from said media signal a sequence of samples of a given perceptual property of the signal; subjecting the sequence of property samples to an auto-correlation function to obtain a sequence of auto-correlation values; comparing said auto-correlation values with respective thresholds; and representing the results of said comparisons by respective bits of the fingerprint.
- the method according to the invention differs from the prior-art method in that the fingerprint bits are not derived from the perceptual property of the signal as such, but from the auto-correlation of said property.
- the invention is based on the recognition that a speed change of an audio signal causes energy levels in sub-bands to be shifted from one sub- band to another, and exploits the insight that the auto-correlation function is shift invariant.
- the auto-correlation function is well-known in the continuous (time) domain.
- the desired auto-correlation is approximated by correlating a sub-sequence of property samples with the complete sequence of property samples.
- the auto-correlation function is preferably computed from a statistically significant number of property samples, which is larger than the desired number of fingerprint bits. Down-sampling of the computed auto-correlation function is provided to obtain the desired number of auto-correlation values.
- Fig. 1 shows schematically a prior-art arrangement for extracting a fingerprint from an audio signal.
- Fig. 2 shows schematically an arrangement for extracting a fingerprint from an audio signal according to the invention.
- Speed changes of an audio signal cause misalignment in both the temporal and frequency domain.
- an audio excerpt subjected to a speed change of, say, 2% causes the 250 th fingerprint of this excerpt to be extracted at the position of the 255 finge ⁇ rint of the original excerpt.
- the fingerprints are constructed in such a way that they possess correlation along the time-axis. Therefore, the BER (bit error rate) between the original excerpt and the same excerpt with a speed change does not increase dramatically due to the temporal misalignment.
- a 2% speedup will result in a scaling of the frequency axis of the spectrum that is obtained with the Fourier Transform. For example, a tone of 500Hz then results in a tone of 510Hz and a tone of 1000Hz results in a tone of 1020Hz.
- the energy in logarithmically spaced bands is determined. Since the bands are logarithmically spaced, the speed change results in a shift of energy from one band to the next. The more energy that shifts from one band to the next, the greater the probability that the extracted fingerprint bits are erroneous. This is due to the fact that the fingerprint bits are determined by energy differences of neighboring bands.
- the brute force approach consists of storing fingerprints extracted at multiple speeds in the database, or querying the database with fingerprints that are extracted at multiple speeds.
- the disadvantage of this method is that the search speed and/or storage requirements increase by a factor N, where N is the number of different speeds that is necessary for a certain application.
- Fig. 2 shows an arrangement for extracting a fingerprint from an audio signal according to the invention.
- the same reference numerals are used for functions that are identical with or similar to the steps that have already been discussed with reference to Fig. 1. More particularly, the audio signal is divided into overlapping frames (101) and the spectrum of each frame is computed (102).
- An auto-correlation step (202) is the fundamental step to achieve the better speed-change resilience.
- a speed change results in a shift of the computed energy vector.
- Auto-correlation has the property that it is shift-invariant.
- the energy of 512 sub-bands is computed (201) instead of 33.
- the bands are still logarithmic and still lie in the range of 300 to 2000Hz. Thus the bands have a smaller width.
- the auto-correlation is approximated by correlating a sub-sequence of energies with the complete sequence. More specifically, the auto-correlation p[x] is calculated from the sub-band energy samples E(j) as follows:
- M denotes the length of the sub-sequence
- K denotes the position where the sub-sequence starts in the complete sequence.
- Typical settings for M and K are 64 and 96, respectively.
- the resulting auto-correlation values are optionally low-pass filtered (203).
- the 448 auto-correlation values are down-sampled to 33 values in a down-sampler (204).
- the resulting fingerprint is a 32-bit string for each frame.
- each image is now divided into a larger number of blocks, and a sub-set of the blocks (a "super-block") is correlated with the whole image for a number of positions of said super-block.
- the obtained sequence of auto-correlation values is invariant to shifts of the video image.
- the sequence is optionally low-pass filtered and subsequently down-sampled.
- Finge ⁇ rints are bit strings extracted from a media signal (e.g. an audio or video clip) to identify said media signal.
- a media signal e.g. an audio or video clip
- they are derived from a perceptual property of the signal, for example, the spectral energy distribution of an audio fragment or the luminance distribution of a video image.
- a method and arrangement for extracting a finge ⁇ rint is here disclosed which is robust with respect to shifts of the perceptual property. Such shifts occur, inter alia, when the finge ⁇ rint is derived from a logarithmically mapped spectral energy distribution of an audio signal and said audio signal is subjected to speed changes.
- the finge ⁇ rint is not derived from the perceptual property as such, but from its auto-correlation function.
Abstract
Description
Claims
Priority Applications (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2004539277A JP2006501498A (en) | 2002-09-30 | 2003-08-11 | Fingerprint extraction |
DE60326743T DE60326743D1 (en) | 2002-09-30 | 2003-08-11 | FINGERPRINT EXTRACTION |
AU2003259400A AU2003259400A1 (en) | 2002-09-30 | 2003-08-11 | Fingerprint extraction |
EP03798257A EP1550297B1 (en) | 2002-09-30 | 2003-08-11 | Fingerprint extraction |
US10/529,360 US20060041753A1 (en) | 2002-09-30 | 2003-08-11 | Fingerprint extraction |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP02079072 | 2002-09-30 | ||
EP02079072.1 | 2002-09-30 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2004030341A1 true WO2004030341A1 (en) | 2004-04-08 |
Family
ID=32039182
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/IB2003/003658 WO2004030341A1 (en) | 2002-09-30 | 2003-08-11 | Fingerprint extraction |
Country Status (9)
Country | Link |
---|---|
US (1) | US20060041753A1 (en) |
EP (1) | EP1550297B1 (en) |
JP (1) | JP2006501498A (en) |
KR (1) | KR20050046815A (en) |
CN (1) | CN1685703A (en) |
AT (1) | ATE426297T1 (en) |
AU (1) | AU2003259400A1 (en) |
DE (1) | DE60326743D1 (en) |
WO (1) | WO2004030341A1 (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2846227A3 (en) * | 2013-09-06 | 2015-04-29 | Immersion Corporation | Systems and methods for generating haptic effects associated with transitions in audio signals |
US9576445B2 (en) | 2013-09-06 | 2017-02-21 | Immersion Corp. | Systems and methods for generating haptic effects associated with an envelope in audio signals |
US9619980B2 (en) | 2013-09-06 | 2017-04-11 | Immersion Corporation | Systems and methods for generating haptic effects associated with audio signals |
US9652945B2 (en) | 2013-09-06 | 2017-05-16 | Immersion Corporation | Method and system for providing haptic effects based on information complementary to multimedia content |
US10572896B2 (en) | 2004-05-27 | 2020-02-25 | Anonymous Media Research LLC | Media usage monitoring and measurement system and method |
Families Citing this family (50)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7362775B1 (en) * | 1996-07-02 | 2008-04-22 | Wistaria Trading, Inc. | Exchange mechanisms for digital information packages with bandwidth securitization, multichannel digital watermarks, and key management |
US5613004A (en) | 1995-06-07 | 1997-03-18 | The Dice Company | Steganographic method and device |
US7664263B2 (en) | 1998-03-24 | 2010-02-16 | Moskowitz Scott A | Method for combining transfer functions with predetermined key creation |
US6205249B1 (en) * | 1998-04-02 | 2001-03-20 | Scott A. Moskowitz | Multiple transform utilization and applications for secure digital watermarking |
US7159116B2 (en) | 1999-12-07 | 2007-01-02 | Blue Spike, Inc. | Systems, methods and devices for trusted transactions |
US7457962B2 (en) | 1996-07-02 | 2008-11-25 | Wistaria Trading, Inc | Optimization methods for the insertion, protection, and detection of digital watermarks in digitized data |
US7095874B2 (en) * | 1996-07-02 | 2006-08-22 | Wistaria Trading, Inc. | Optimization methods for the insertion, protection, and detection of digital watermarks in digitized data |
US5889868A (en) * | 1996-07-02 | 1999-03-30 | The Dice Company | Optimization methods for the insertion, protection, and detection of digital watermarks in digitized data |
US7177429B2 (en) | 2000-12-07 | 2007-02-13 | Blue Spike, Inc. | System and methods for permitting open access to data objects and for securing data within the data objects |
US7346472B1 (en) * | 2000-09-07 | 2008-03-18 | Blue Spike, Inc. | Method and device for monitoring and analyzing signals |
US7730317B2 (en) | 1996-12-20 | 2010-06-01 | Wistaria Trading, Inc. | Linear predictive coding implementation of digital watermarks |
US7664264B2 (en) | 1999-03-24 | 2010-02-16 | Blue Spike, Inc. | Utilizing data reduction in steganographic and cryptographic systems |
US7475246B1 (en) | 1999-08-04 | 2009-01-06 | Blue Spike, Inc. | Secure personal content server |
EP2136306B1 (en) | 2000-08-23 | 2013-03-20 | Gracenote, Inc. | Method and system for obtaining information |
US7127615B2 (en) | 2000-09-20 | 2006-10-24 | Blue Spike, Inc. | Security based on subliminal and supraliminal channels for data objects |
DE60228202D1 (en) * | 2001-02-12 | 2008-09-25 | Gracenote Inc | METHOD FOR GENERATING AN IDENTIFICATION HASH FROM THE CONTENTS OF A MULTIMEDIA FILE |
US7020304B2 (en) * | 2002-01-22 | 2006-03-28 | Digimarc Corporation | Digital watermarking and fingerprinting including synchronization, layering, version control, and compressed embedding |
KR20040086350A (en) | 2002-02-05 | 2004-10-08 | 코닌클리케 필립스 일렉트로닉스 엔.브이. | Efficient storage of fingerprints |
US7287275B2 (en) | 2002-04-17 | 2007-10-23 | Moskowitz Scott A | Methods, systems and devices for packet watermarking and efficient provisioning of bandwidth |
CN1708758A (en) * | 2002-11-01 | 2005-12-14 | 皇家飞利浦电子股份有限公司 | Improved audio data fingerprint searching |
AU2003274545A1 (en) * | 2002-11-12 | 2004-06-03 | Koninklijke Philips Electronics N.V. | Fingerprinting multimedia contents |
US20070106405A1 (en) * | 2005-08-19 | 2007-05-10 | Gracenote, Inc. | Method and system to provide reference data for identification of digital content |
WO2007053112A1 (en) * | 2005-11-07 | 2007-05-10 | Agency For Science, Technology And Research | Repeat clip identification in video data |
US20080274687A1 (en) | 2007-05-02 | 2008-11-06 | Roberts Dale T | Dynamic mixed media package |
US9154942B2 (en) | 2008-11-26 | 2015-10-06 | Free Stream Media Corp. | Zero configuration communication between a browser and a networked media device |
US9519772B2 (en) | 2008-11-26 | 2016-12-13 | Free Stream Media Corp. | Relevancy improvement through targeting of information based on data gathered from a networked device associated with a security sandbox of a client device |
US10977693B2 (en) | 2008-11-26 | 2021-04-13 | Free Stream Media Corp. | Association of content identifier of audio-visual data with additional data through capture infrastructure |
US10334324B2 (en) | 2008-11-26 | 2019-06-25 | Free Stream Media Corp. | Relevant advertisement generation based on a user operating a client device communicatively coupled with a networked media device |
US10419541B2 (en) | 2008-11-26 | 2019-09-17 | Free Stream Media Corp. | Remotely control devices over a network without authentication or registration |
US8180891B1 (en) | 2008-11-26 | 2012-05-15 | Free Stream Media Corp. | Discovery, access control, and communication with networked services from within a security sandbox |
US10880340B2 (en) | 2008-11-26 | 2020-12-29 | Free Stream Media Corp. | Relevancy improvement through targeting of information based on data gathered from a networked device associated with a security sandbox of a client device |
US9986279B2 (en) | 2008-11-26 | 2018-05-29 | Free Stream Media Corp. | Discovery, access control, and communication with networked services |
US9961388B2 (en) | 2008-11-26 | 2018-05-01 | David Harrison | Exposure of public internet protocol addresses in an advertising exchange server to improve relevancy of advertisements |
US10631068B2 (en) | 2008-11-26 | 2020-04-21 | Free Stream Media Corp. | Content exposure attribution based on renderings of related content across multiple devices |
US10567823B2 (en) | 2008-11-26 | 2020-02-18 | Free Stream Media Corp. | Relevant advertisement generation based on a user operating a client device communicatively coupled with a networked media device |
WO2010135623A1 (en) * | 2009-05-21 | 2010-11-25 | Digimarc Corporation | Robust signatures derived from local nonlinear filters |
KR101884483B1 (en) | 2010-07-21 | 2018-08-01 | 디-박스 테크놀러지스 인코포레이트 | Media recognition and synchronisation to a motion signal |
US10515523B2 (en) | 2010-07-21 | 2019-12-24 | D-Box Technologies Inc. | Media recognition and synchronization to a motion signal |
CN102783034B (en) * | 2011-02-01 | 2014-12-17 | 华为技术有限公司 | Method and apparatus for providing signal processing coefficients |
US9196028B2 (en) | 2011-09-23 | 2015-11-24 | Digimarc Corporation | Context-based smartphone sensor logic |
US8995708B2 (en) * | 2011-09-08 | 2015-03-31 | Samsung Electronics Co., Ltd. | Apparatus and method for robust low-complexity video fingerprinting |
CN103179430A (en) * | 2011-12-20 | 2013-06-26 | 中国电信股份有限公司 | Method, device and server for audio and video content transcoding on basis of cloud computing |
US8989376B2 (en) * | 2012-03-29 | 2015-03-24 | Alcatel Lucent | Method and apparatus for authenticating video content |
US8886635B2 (en) * | 2012-05-23 | 2014-11-11 | Enswers Co., Ltd. | Apparatus and method for recognizing content using audio signal |
CN103455514A (en) * | 2012-06-01 | 2013-12-18 | 腾讯科技(深圳)有限公司 | Updating method and updating device for audio file |
US9153239B1 (en) * | 2013-03-14 | 2015-10-06 | Google Inc. | Differentiating between near identical versions of a song |
CN103501449A (en) * | 2013-10-08 | 2014-01-08 | 十分(北京)信息科技有限公司 | Method and device for recommending video source associated with television program |
NL2012567B1 (en) * | 2014-04-04 | 2016-03-08 | Teletrax B V | Method and device for generating improved fingerprints. |
US20160005410A1 (en) * | 2014-07-07 | 2016-01-07 | Serguei Parilov | System, apparatus, and method for audio fingerprinting and database searching for audio identification |
CN110322886A (en) * | 2018-03-29 | 2019-10-11 | 北京字节跳动网络技术有限公司 | A kind of audio-frequency fingerprint extracting method and device |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5436653A (en) * | 1992-04-30 | 1995-07-25 | The Arbitron Company | Method and system for recognition of broadcast segments |
US6266429B1 (en) * | 1998-09-23 | 2001-07-24 | Philips Electronics North America Corporation | Method for confirming the integrity of an image transmitted with a loss |
WO2002065782A1 (en) * | 2001-02-12 | 2002-08-22 | Koninklijke Philips Electronics N.V. | Generating and matching hashes of multimedia content |
Family Cites Families (65)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4677466A (en) * | 1985-07-29 | 1987-06-30 | A. C. Nielsen Company | Broadcast program identification method and apparatus |
US5019899A (en) * | 1988-11-01 | 1991-05-28 | Control Data Corporation | Electronic data encoding and recognition system |
JP2637816B2 (en) * | 1989-02-13 | 1997-08-06 | パイオニア株式会社 | Information playback device |
JPH05509409A (en) * | 1990-06-21 | 1993-12-22 | レイノルズ ソフトウエア,インコーポレイティド | Wave analysis/event recognition method and device |
US5703795A (en) * | 1992-06-22 | 1997-12-30 | Mankovitz; Roy J. | Apparatus and methods for accessing information relating to radio and television programs |
US7171016B1 (en) * | 1993-11-18 | 2007-01-30 | Digimarc Corporation | Method for monitoring internet dissemination of image, video and/or audio files |
US6546112B1 (en) * | 1993-11-18 | 2003-04-08 | Digimarc Corporation | Security document with steganographically-encoded authentication data |
US5822436A (en) * | 1996-04-25 | 1998-10-13 | Digimarc Corporation | Photographic products and methods employing embedded information |
US6408082B1 (en) * | 1996-04-25 | 2002-06-18 | Digimarc Corporation | Watermark detection using a fourier mellin transform |
US5499294A (en) * | 1993-11-24 | 1996-03-12 | The United States Of America As Represented By The Administrator Of The National Aeronautics And Space Administration | Digital camera with apparatus for authentication of images produced from an image file |
US5790793A (en) * | 1995-04-04 | 1998-08-04 | Higley; Thomas | Method and system to create, transmit, receive and process information, including an address to further information |
US5616876A (en) * | 1995-04-19 | 1997-04-01 | Microsoft Corporation | System and methods for selecting music on the basis of subjective content |
US5751672A (en) * | 1995-07-26 | 1998-05-12 | Sony Corporation | Compact disc changer utilizing disc database |
US6829368B2 (en) * | 2000-01-26 | 2004-12-07 | Digimarc Corporation | Establishing and interacting with on-line media collections using identifiers in media signals |
US6505160B1 (en) * | 1995-07-27 | 2003-01-07 | Digimarc Corporation | Connected audio and other media objects |
JPH0991434A (en) * | 1995-09-28 | 1997-04-04 | Hamamatsu Photonics Kk | Human body collation device |
US5767893A (en) * | 1995-10-11 | 1998-06-16 | International Business Machines Corporation | Method and apparatus for content based downloading of video programs |
US5893910A (en) * | 1996-01-04 | 1999-04-13 | Softguard Enterprises Inc. | Method and apparatus for establishing the legitimacy of use of a block of digitally represented information |
US6034925A (en) * | 1996-12-02 | 2000-03-07 | Thomson Consumer Electronics, Inc. | Accessing control method for identifying a recording medium in a jukebox |
US5925843A (en) * | 1997-02-12 | 1999-07-20 | Virtual Music Entertainment, Inc. | Song identification and synchronization |
US5987525A (en) * | 1997-04-15 | 1999-11-16 | Cddb, Inc. | Network delivery of interactive entertainment synchronized to playback of audio recordings |
US5960081A (en) * | 1997-06-05 | 1999-09-28 | Cray Research, Inc. | Embedding a digital signature in a video sequence |
US6076104A (en) * | 1997-09-04 | 2000-06-13 | Netscape Communications Corp. | Video data integration system using image data and associated hypertext links |
US6076111A (en) * | 1997-10-24 | 2000-06-13 | Pictra, Inc. | Methods and apparatuses for transferring data between data processing systems which transfer a representation of the data before transferring the data |
US6195693B1 (en) * | 1997-11-18 | 2001-02-27 | International Business Machines Corporation | Method and system for network delivery of content associated with physical audio media |
US6201176B1 (en) * | 1998-05-07 | 2001-03-13 | Canon Kabushiki Kaisha | System and method for querying a music database |
US6226618B1 (en) * | 1998-08-13 | 2001-05-01 | International Business Machines Corporation | Electronic content delivery system |
US8332478B2 (en) * | 1998-10-01 | 2012-12-11 | Digimarc Corporation | Context sensitive connected content |
US6665417B1 (en) * | 1998-12-02 | 2003-12-16 | Hitachi, Ltd. | Method of judging digital watermark information |
US6748533B1 (en) * | 1998-12-23 | 2004-06-08 | Kent Ridge Digital Labs | Method and apparatus for protecting the legitimacy of an article |
US6952774B1 (en) * | 1999-05-22 | 2005-10-04 | Microsoft Corporation | Audio watermarking with dual watermarks |
GB2351405B (en) * | 1999-06-21 | 2003-09-24 | Motorola Ltd | Watermarked digital images |
US7174293B2 (en) * | 1999-09-21 | 2007-02-06 | Iceberg Industries Llc | Audio identification system and method |
US6941275B1 (en) * | 1999-10-07 | 2005-09-06 | Remi Swierczek | Music identification system |
US6737957B1 (en) * | 2000-02-16 | 2004-05-18 | Verance Corporation | Remote control signaling using audio watermarks |
JP2001275115A (en) * | 2000-03-23 | 2001-10-05 | Nec Corp | Electronic watermark data insertion device and detector |
US6963975B1 (en) * | 2000-08-11 | 2005-11-08 | Microsoft Corporation | System and method for audio fingerprinting |
US6990453B2 (en) * | 2000-07-31 | 2006-01-24 | Landmark Digital Services Llc | System and methods for recognizing sound and music signals in high noise and distortion |
JP2002049631A (en) * | 2000-08-01 | 2002-02-15 | Sony Corp | Information providing device, method and recording medium |
EP2136306B1 (en) * | 2000-08-23 | 2013-03-20 | Gracenote, Inc. | Method and system for obtaining information |
US6674876B1 (en) * | 2000-09-14 | 2004-01-06 | Digimarc Corporation | Watermarking in the time-frequency domain |
US6748360B2 (en) * | 2000-11-03 | 2004-06-08 | International Business Machines Corporation | System for selling a product utilizing audio content identification |
US6983289B2 (en) * | 2000-12-05 | 2006-01-03 | Digital Networks North America, Inc. | Automatic identification of DVD title using internet technologies and fuzzy matching techniques |
KR100375822B1 (en) * | 2000-12-18 | 2003-03-15 | 한국전자통신연구원 | Watermark Embedding/Detecting Apparatus and Method for Digital Audio |
US7958359B2 (en) * | 2001-04-30 | 2011-06-07 | Digimarc Corporation | Access control systems |
US7024018B2 (en) * | 2001-05-11 | 2006-04-04 | Verance Corporation | Watermark position modulation |
DE10133333C1 (en) * | 2001-07-10 | 2002-12-05 | Fraunhofer Ges Forschung | Producing fingerprint of audio signal involves setting first predefined fingerprint mode from number of modes and computing a fingerprint in accordance with set predefined mode |
US6968337B2 (en) * | 2001-07-10 | 2005-11-22 | Audible Magic Corporation | Method and apparatus for identifying an unknown work |
EP1410380B1 (en) * | 2001-07-20 | 2010-04-28 | Gracenote, Inc. | Automatic identification of sound recordings |
US7877438B2 (en) * | 2001-07-20 | 2011-01-25 | Audible Magic Corporation | Method and apparatus for identifying new media content |
JP4398242B2 (en) * | 2001-07-31 | 2010-01-13 | グレースノート インコーポレイテッド | Multi-stage identification method for recording |
US6941003B2 (en) * | 2001-08-07 | 2005-09-06 | Lockheed Martin Corporation | Method of fast fingerprint search space partitioning and prescreening |
US7523312B2 (en) * | 2001-11-16 | 2009-04-21 | Koninklijke Philips Electronics N.V. | Fingerprint database updating method, client and server |
KR20040086350A (en) * | 2002-02-05 | 2004-10-08 | 코닌클리케 필립스 일렉트로닉스 엔.브이. | Efficient storage of fingerprints |
US6983275B2 (en) * | 2002-04-16 | 2006-01-03 | International Business Machines Corporation | Optimizing database query by generating, determining the type of derived predicate based on monotonicity of the column generating expression for each remaining inequality predicate in the list of unexamined predicates |
US6782116B1 (en) * | 2002-11-04 | 2004-08-24 | Mediasec Technologies, Gmbh | Apparatus and methods for improving detection of watermarks in content that has undergone a lossy transformation |
US7082394B2 (en) * | 2002-06-25 | 2006-07-25 | Microsoft Corporation | Noise-robust feature extraction using multi-layer principal component analysis |
US7036024B2 (en) * | 2002-07-09 | 2006-04-25 | Kaleidescape, Inc. | Detecting collusion among multiple recipients of fingerprinted information |
US7110338B2 (en) * | 2002-08-06 | 2006-09-19 | Matsushita Electric Industrial Co., Ltd. | Apparatus and method for fingerprinting digital media |
US7152021B2 (en) * | 2002-08-15 | 2006-12-19 | Digimarc Corporation | Computing distortion of media signals embedded data with repetitive structure and log-polar mapping |
AU2003274545A1 (en) * | 2002-11-12 | 2004-06-03 | Koninklijke Philips Electronics N.V. | Fingerprinting multimedia contents |
KR20050113614A (en) * | 2003-02-26 | 2005-12-02 | 코닌클리케 필립스 일렉트로닉스 엔.브이. | Handling of digital silence in audio fingerprinting |
EP1457889A1 (en) * | 2003-03-13 | 2004-09-15 | Koninklijke Philips Electronics N.V. | Improved fingerprint matching method and system |
CN1882984A (en) * | 2003-11-18 | 2006-12-20 | 皇家飞利浦电子股份有限公司 | Matching data objects by matching derived fingerprints |
US20070106405A1 (en) * | 2005-08-19 | 2007-05-10 | Gracenote, Inc. | Method and system to provide reference data for identification of digital content |
-
2003
- 2003-08-11 AT AT03798257T patent/ATE426297T1/en not_active IP Right Cessation
- 2003-08-11 KR KR1020057005509A patent/KR20050046815A/en not_active Application Discontinuation
- 2003-08-11 CN CNA038233924A patent/CN1685703A/en active Pending
- 2003-08-11 WO PCT/IB2003/003658 patent/WO2004030341A1/en active Application Filing
- 2003-08-11 EP EP03798257A patent/EP1550297B1/en not_active Expired - Lifetime
- 2003-08-11 US US10/529,360 patent/US20060041753A1/en not_active Abandoned
- 2003-08-11 AU AU2003259400A patent/AU2003259400A1/en not_active Abandoned
- 2003-08-11 JP JP2004539277A patent/JP2006501498A/en not_active Withdrawn
- 2003-08-11 DE DE60326743T patent/DE60326743D1/en not_active Expired - Lifetime
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5436653A (en) * | 1992-04-30 | 1995-07-25 | The Arbitron Company | Method and system for recognition of broadcast segments |
US5612729A (en) * | 1992-04-30 | 1997-03-18 | The Arbitron Company | Method and system for producing a signature characterizing an audio broadcast signal |
US5621454A (en) * | 1992-04-30 | 1997-04-15 | The Arbitron Company | Method and system for recognition of broadcast segments |
US6266429B1 (en) * | 1998-09-23 | 2001-07-24 | Philips Electronics North America Corporation | Method for confirming the integrity of an image transmitted with a loss |
WO2002065782A1 (en) * | 2001-02-12 | 2002-08-22 | Koninklijke Philips Electronics N.V. | Generating and matching hashes of multimedia content |
Cited By (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10572896B2 (en) | 2004-05-27 | 2020-02-25 | Anonymous Media Research LLC | Media usage monitoring and measurement system and method |
US10963911B2 (en) | 2004-05-27 | 2021-03-30 | Anonymous Media Research LLC | Media usage monitoring and measurement system and method |
US10719848B2 (en) | 2004-05-27 | 2020-07-21 | Anonymous Media Research LLC | Media usage monitoring and measurement system and method |
US10719849B2 (en) | 2004-05-27 | 2020-07-21 | Anonymous Media Research LLC | Media usage monitoring and measurement system and method |
US10140823B2 (en) | 2013-09-06 | 2018-11-27 | Immersion Corporation | Method and system for providing haptic effects based on information complementary to multimedia content |
US9928701B2 (en) | 2013-09-06 | 2018-03-27 | Immersion Corporation | Method and system for providing haptic effects based on information complementary to multimedia content |
US9934660B2 (en) | 2013-09-06 | 2018-04-03 | Immersion Corporation | Systems and methods for generating haptic effects associated with an envelope in audio signals |
US9947188B2 (en) | 2013-09-06 | 2018-04-17 | Immersion Corporation | Systems and methods for generating haptic effects associated with audio signals |
EP2846227A3 (en) * | 2013-09-06 | 2015-04-29 | Immersion Corporation | Systems and methods for generating haptic effects associated with transitions in audio signals |
US10276004B2 (en) | 2013-09-06 | 2019-04-30 | Immersion Corporation | Systems and methods for generating haptic effects associated with transitions in audio signals |
US10388122B2 (en) | 2013-09-06 | 2019-08-20 | Immerson Corporation | Systems and methods for generating haptic effects associated with audio signals |
US10395490B2 (en) | 2013-09-06 | 2019-08-27 | Immersion Corporation | Method and system for providing haptic effects based on information complementary to multimedia content |
US10395488B2 (en) | 2013-09-06 | 2019-08-27 | Immersion Corporation | Systems and methods for generating haptic effects associated with an envelope in audio signals |
US9711014B2 (en) | 2013-09-06 | 2017-07-18 | Immersion Corporation | Systems and methods for generating haptic effects associated with transitions in audio signals |
US9652945B2 (en) | 2013-09-06 | 2017-05-16 | Immersion Corporation | Method and system for providing haptic effects based on information complementary to multimedia content |
US9619980B2 (en) | 2013-09-06 | 2017-04-11 | Immersion Corporation | Systems and methods for generating haptic effects associated with audio signals |
US9576445B2 (en) | 2013-09-06 | 2017-02-21 | Immersion Corp. | Systems and methods for generating haptic effects associated with an envelope in audio signals |
Also Published As
Publication number | Publication date |
---|---|
US20060041753A1 (en) | 2006-02-23 |
EP1550297B1 (en) | 2009-03-18 |
CN1685703A (en) | 2005-10-19 |
AU2003259400A1 (en) | 2004-04-19 |
EP1550297A1 (en) | 2005-07-06 |
ATE426297T1 (en) | 2009-04-15 |
JP2006501498A (en) | 2006-01-12 |
KR20050046815A (en) | 2005-05-18 |
DE60326743D1 (en) | 2009-04-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1550297B1 (en) | Fingerprint extraction | |
US20060075237A1 (en) | Fingerprinting multimedia contents | |
US6604072B2 (en) | Feature-based audio content identification | |
US7477739B2 (en) | Efficient storage of fingerprints | |
US20060229878A1 (en) | Waveform recognition method and apparatus | |
US20060013451A1 (en) | Audio data fingerprint searching | |
Fenet et al. | A Scalable Audio Fingerprint Method with Robustness to Pitch-Shifting. | |
Haitsma et al. | Speed-change resistant audio fingerprinting using auto-correlation | |
JP2004519015A (en) | Generating and matching hashes of multimedia content | |
WO2003003628A1 (en) | System for monitoring broadcast audio content | |
KR20040087759A (en) | Method for inserting and detecting watermark by a quantization of a characteristic value of a signal | |
EP1497935B1 (en) | Feature-based audio content identification | |
George et al. | Scalable and robust audio fingerprinting method tolerable to time-stretching | |
Kekre et al. | A review of audio fingerprinting and comparison of algorithms | |
Ribbrock et al. | A full-text retrieval approach to content-based audio identification | |
Htun | Analytical approach to MFCC based space-saving audio fingerprinting system | |
Htun | Compact and Robust MFCC-based Space-Saving Audio Fingerprint Extraction for Efficient Music Identification on FM Broadcast Monitoring. | |
Htun et al. | Broadcast Monitoring System using MFCC-based Audio Fingerprinting | |
Steinebach et al. | Re-synchronizing audio watermarking after nonlinear time stretching | |
Kalker et al. | Robust Identification of Audio Using Watermarking and Fingerprinting | |
ROUSSOPOULOS et al. | Mathematical Characteristics for the Automated Recognition of Musical Recordings | |
Camarena-Ibarrola et al. | Robust Audio-Fingerprinting With Spectral Entropy Signatures | |
Lykartsis et al. | ASSESSMENT OF FEATURE EXTRACTION METHODS IN AUDIO FINGERPRINTING |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2003798257 Country of ref document: EP |
|
ENP | Entry into the national phase |
Ref document number: 2006041753 Country of ref document: US Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 10529360 Country of ref document: US |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2004539277 Country of ref document: JP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1020057005509 Country of ref document: KR Ref document number: 20038233924 Country of ref document: CN |
|
WWP | Wipo information: published in national office |
Ref document number: 1020057005509 Country of ref document: KR |
|
WWP | Wipo information: published in national office |
Ref document number: 2003798257 Country of ref document: EP |
|
WWP | Wipo information: published in national office |
Ref document number: 10529360 Country of ref document: US |