Click here to Skip to main content
15,884,472 members
Articles / Desktop Programming / WPF

Duplicate songs detector via audio fingerprinting

Rate me:
Please Sign up or sign in to vote.
4.96/5 (337 votes)
23 Jun 2020MIT44 min read 1.3M   20.4K   533  
Explains sound fingerprinting algorithm, with a practical example of detecting duplicate files on the user's local drive.
The aim of this article is to show an efficient algorithm of signal processing which will allow one to have a competent system of sound fingerprinting and signal recognition. I'll try to come with some explanations of the article's algorithm, and also speak about how it can be implemented using the C# programming language. Additionally, I'll try to cover topics of digital signal processing that are used in the algorithm, thus you'll be able to get a clearer image of the entire system. And as a proof of concept, I'll show you how to develop a simple WPF MVVM application.
// Sound Fingerprinting framework
// git://github.com/AddictedCS/soundfingerprinting.git
// Code license: CPOL v.1.02
// ciumac.sergiu@gmail.com
using System;
using System.Collections.Generic;
using System.Globalization;
using System.IO;

namespace Soundfingerprinting.Hashing
{
    /// <summary>
    ///   Class for reading local permutations from file
    /// </summary>
    public class LocalPermutations : IPermutations
    {
        /// <summary>
        ///   Path to permutations
        /// </summary>
        private readonly string _pathToPerms;

        /// <summary>
        ///   Separator between 2 consecutive indexes
        /// </summary>
        private readonly string _separator;

        /// <summary>
        ///   Permutations
        /// </summary>
        private int[][] _perms;

        /// <summary>
        ///   Local file permutation object
        /// </summary>
        /// <param name = "pathToPermutations">Path to file with permutations</param>
        /// <param name = "separator">Separator between 2 consecutive permutations</param>
        public LocalPermutations(string pathToPermutations, string separator)
        {
            _pathToPerms = pathToPermutations;
            _separator = separator;
        }

        #region IPermutations Members

        /// <summary>
        ///   Get permutations
        /// </summary>
        /// <returns>Permutations read from file</returns>
        public int[][] GetPermutations()
        {
            if (_perms != null)
                return _perms;
            List<int[]> result = new List<int[]>();
            using (StreamReader reader = new StreamReader(_pathToPerms))
            {
                while (reader.Peek() != -1)
                {
                    string line = reader.ReadLine();
                    if (line != null)
                    {
                        string[] ints = line.Split(new[] {_separator}, StringSplitOptions.RemoveEmptyEntries);
                        int[] permutation = new int[ints.Length];
                        int i = 0;
                        foreach (string item in ints)
                        {
                            permutation[i++] = Convert.ToInt32(item, CultureInfo.InvariantCulture);
                        }
                        result.Add(permutation);
                    }
                }
            }
            _perms = result.ToArray();
            return _perms;
        }

        #endregion
    }
}

By viewing downloads associated with this article you agree to the Terms of Service and the article's licence.

If a file you wish to view isn't highlighted, and is a text file (not binary), please let us know and we'll add colourisation support for it.

License

This article, along with any associated source code and files, is licensed under The MIT License


Written By
Software Developer
Moldova (Republic of) Moldova (Republic of)
Interested in computer science, math, research, and everything that relates to innovation. Fan of agnostic programming, don't mind developing under any platform/framework if it explores interesting topics. In search of a better programming paradigm.

Comments and Discussions