Click here to Skip to main content
15,879,326 members
Articles / Programming Languages / C#

Import Excel File to DataSet

Rate me:
Please Sign up or sign in to vote.
4.69/5 (23 votes)
29 May 2012CPOL 343.5K   12.5K   104   41
Allows for inputting Excel tables into a DataSet.

Introduction

I just wanted to put out two ways to import data from Excel into a DataSet.

  1. Preferred - Importing from an XLS file using the OLE Jet engine (simple).
    1. UPDATED: I'm now setting the IMEX flag to 0. This may cause worse performance. Check out  http://www.connectionstrings.com/excel for more information.
    2. UPDATED: I switched it to use ACE driver if it has an XLSX extension. You will need to ensure ACE is installed on the server you deploy to.
      1. http://www.microsoft.com/en-us/download/details.aspx?id=13255 
    3. UPDATED: Please refer to these links for good information about connecting to xls and xlsx files. 
      1. http://www.connectionstrings.com/excel
      2. http://www.connectionstrings.com/excel-2007  
  2. Importing from an Excel XML file. (The XML format that Excel uses, not just any XML file.) 
    1. Note: This is a long-winded custom solution. Should work, but might require tweaks.
    2. This works well if you're sure the data will be valid, or if you don't require it to do type-detection (flag for this on procedure). 
    3. Exporting to an Excel XML file can be found here

Using the code

Download the file for specifics, but here's a summary:

XLS Import 

C#
public static DataSet ImportExcelXLS(string FileName, bool hasHeaders) {
    string HDR = hasHeaders ? "Yes" : "No";
    string strConn;
    if (FileName.Substring(FileName.LastIndexOf('.')).ToLower() == ".xlsx")
        strConn = "Provider=Microsoft.ACE.OLEDB.12.0;Data Source=" + FileName + ";Extended Properties=\"Excel 12.0;HDR=" + HDR + ";IMEX=0\"";
    else
        strConn = "Provider=Microsoft.Jet.OLEDB.4.0;Data Source=" + FileName + ";Extended Properties=\"Excel 8.0;HDR=" + HDR + ";IMEX=0\"";

    DataSet output = new DataSet();

    using (OleDbConnection conn = new OleDbConnection(strConn)) {
        conn.Open();

        DataTable schemaTable = conn.GetOleDbSchemaTable(
            OleDbSchemaGuid.Tables, new object[] { null, null, null, "TABLE" });

        foreach (DataRow schemaRow in schemaTable.Rows) {
            string sheet = schemaRow["TABLE_NAME"].ToString();

            if (!sheet.EndsWith("_")) {
                try {
                    OleDbCommand cmd = new OleDbCommand("SELECT * FROM [" + sheet + "]", conn);
                    cmd.CommandType = CommandType.Text;

                    DataTable outputTable = new DataTable(sheet);
                    output.Tables.Add(outputTable);
                    new OleDbDataAdapter(cmd).Fill(outputTable);
                } catch (Exception ex) {
                    throw new Exception(ex.Message + string.Format("Sheet:{0}.File:F{1}", sheet, FileName), ex);
                }
            }
        }
    }
    return output;
} 

Excel XML Import (Summary)

C#
public static DataSet ImportExcelXML(Stream inputFileStream, 
                      bool hasHeaders, bool autoDetectColumnType) {
    XmlDocument doc = new XmlDocument();
    doc.Load(new XmlTextReader(inputFileStream));
    XmlNamespaceManager nsmgr = new XmlNamespaceManager(doc.NameTable);

    nsmgr.AddNamespace("o", "urn:schemas-microsoft-com:office:office");
    nsmgr.AddNamespace("x", "urn:schemas-microsoft-com:office:excel");
    nsmgr.AddNamespace("ss", "urn:schemas-microsoft-com:office:spreadsheet");

    DataSet ds = new DataSet();

    foreach (XmlNode node in 
      doc.DocumentElement.SelectNodes("//ss:Worksheet", nsmgr)) {
        DataTable dt = new DataTable(node.Attributes["ss:Name"].Value);
        ds.Tables.Add(dt);
        XmlNodeList rows = node.SelectNodes("ss:Table/ss:Row", nsmgr);
        if (rows.Count > 0) {

            //*************************
            //Add Columns To Table from header row
            //*************************
            List<ColumnType> columns = new List<ColumnType>();
            int startIndex = 0;
            if (hasHeaders) {
                foreach (XmlNode data in rows[0].SelectNodes("ss:Cell/ss:Data", nsmgr)) {
                    columns.Add(new ColumnType(typeof(string)));//default to text
                    dt.Columns.Add(data.InnerText, typeof(string));
                }
                startIndex++;
            }
            //*************************
            //Update Data-Types of columns if Auto-Detecting
            //*************************
            if (autoDetectColumnType && rows.Count > 0) {
                XmlNodeList cells = rows[startIndex].SelectNodes("ss:Cell", nsmgr);
                int actualCellIndex = 0;
                for (int cellIndex = 0; cellIndex < cells.Count; cellIndex++) {
                    XmlNode cell = cells[cellIndex];
                    if (cell.Attributes["ss:Index"] != null)
                        actualCellIndex = 
                          int.Parse(cell.Attributes["ss:Index"].Value) - 1;

                    ColumnType autoDetectType = 
                      getType(cell.SelectSingleNode("ss:Data", nsmgr));

                    if (actualCellIndex >= dt.Columns.Count) {
                        dt.Columns.Add("Column" + 
                          actualCellIndex.ToString(), autoDetectType.type);
                        columns.Add(autoDetectType);
                    } else {
                        dt.Columns[actualCellIndex].DataType = autoDetectType.type;
                        columns[actualCellIndex] = autoDetectType;
                    }

                    actualCellIndex++;
                }
            }
            //*************************
            //Load Data
            //*************************
            for (int i = startIndex; i < rows.Count; i++) {
                DataRow row = dt.NewRow();
                XmlNodeList cells = rows[i].SelectNodes("ss:Cell", nsmgr);
                int actualCellIndex = 0;
                for (int cellIndex = 0; cellIndex < cells.Count; cellIndex++) {
                    XmlNode cell = cells[cellIndex];
                    if (cell.Attributes["ss:Index"] != null)
                        actualCellIndex = int.Parse(cell.Attributes["ss:Index"].Value) - 1;

                    XmlNode data = cell.SelectSingleNode("ss:Data", nsmgr);

                    if (actualCellIndex >= dt.Columns.Count) {
                        for (int i = dt.Columns.Count; i < actualCellIndex; i++) {
                            dt.Columns.Add("Column" + 
                                       actualCellIndex.ToString(), typeof(string));
                            columns.Add(getDefaultType());
                        }
                        ColumnType autoDetectType = 
                           getType(cell.SelectSingleNode("ss:Data", nsmgr));
                        dt.Columns.Add("Column" + actualCellIndex.ToString(), 
                                       typeof(string));
                        columns.Add(autoDetectType);
                    }
                    if (data != null)
                        row[actualCellIndex] = data.InnerText;

                    actualCellIndex++;
                }

                dt.Rows.Add(row);
            }
        }
    }
    return ds;
}
    //*************************
    //Format of file, in case you're wondering
    //*************************

    //<?xml version="1.0"?>
    //<?mso-application progid="Excel.Sheet"?>
    //<Workbook>
    // <Worksheet ss:Name="Sheet1">
    //  <Table>
    //   <Row>
    //    <Cell><Data ss:Type="String">Item Number</Data></Cell>
    //    <Cell><Data ss:Type="String">Description</Data></Cell>
    //    <Cell ss:StyleID="s21"><Data ss:Type="String">Item Barcode</Data></Cell>
    //   </Row>
    // </Worksheet>
    //</Workbook>

License

This article, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)


Written By
Software Developer
United States United States
likes boardgames, computer games, and enjoys his .net programming job.

Comments and Discussions

 
Questionimport excel to gridcrontrol of devspress (importar excel a un gridcontrol de devespreess) Pin
Michel Abreu Pardo6-Jun-18 4:09
Michel Abreu Pardo6-Jun-18 4:09 
QuestionExcel source file still in use after ImportExcelXLS(...) done Pin
Member 115226558-Dec-15 3:40
Member 115226558-Dec-15 3:40 
AnswerRe: Excel source file still in use after ImportExcelXLS(...) done Pin
ColinBashBash8-Dec-15 4:11
ColinBashBash8-Dec-15 4:11 
Questionmicrosoft.ace.oledb.12.0' provider is not registered on the local machine Pin
Code10812-Nov-14 13:14
Code10812-Nov-14 13:14 
QuestionLarge files Pin
Ming Fei Lam (Avery)16-Oct-14 2:47
Ming Fei Lam (Avery)16-Oct-14 2:47 
SuggestionRe: Large files Pin
ColinBashBash16-Oct-14 4:34
ColinBashBash16-Oct-14 4:34 
QuestionHttpPostedFile ? Pin
Member 245846716-Sep-14 22:19
Member 245846716-Sep-14 22:19 
QuestionRe: HttpPostedFile ? Pin
ColinBashBash17-Sep-14 4:09
ColinBashBash17-Sep-14 4:09 
QuestionShowing the read data from excel in grid Pin
Arjun Menon U.K25-May-13 6:12
Arjun Menon U.K25-May-13 6:12 
QuestionVoted 5 Pin
antew17-Dec-12 1:38
antew17-Dec-12 1:38 
GeneralMy vote of 5 Pin
parslej15-Oct-12 6:17
parslej15-Oct-12 6:17 
QuestionHttpPostedFile Pin
luffyripper29-Aug-12 3:46
luffyripper29-Aug-12 3:46 
AnswerRe: HttpPostedFile Pin
ColinBashBash29-Aug-12 3:50
ColinBashBash29-Aug-12 3:50 
GeneralRe: HttpPostedFile Pin
luffyripper29-Aug-12 5:43
luffyripper29-Aug-12 5:43 
GeneralMy 5! Pin
_Amy10-Aug-12 3:11
professional_Amy10-Aug-12 3:11 
GeneralWorking very well! Pin
Pavan Gayakwad6-Aug-12 20:50
Pavan Gayakwad6-Aug-12 20:50 
GeneralMy Vote of 5 Pin
Srinivasan Janakiraman20-Jun-12 19:33
Srinivasan Janakiraman20-Jun-12 19:33 
GeneralMy vote of 5 Pin
kaedei29-May-12 15:47
kaedei29-May-12 15:47 
QuestionData is not filled in enough into Dataset Pin
botngot8328-May-12 20:37
botngot8328-May-12 20:37 
AnswerRe: Data is not filled in enough into Dataset Pin
ColinBashBash29-May-12 3:39
ColinBashBash29-May-12 3:39 
GeneralRe: Data is not filled in enough into Dataset Pin
botngot8329-May-12 22:00
botngot8329-May-12 22:00 
GeneralRe: Data is not filled in enough into Dataset Pin
botngot8331-May-12 17:18
botngot8331-May-12 17:18 
Is there any updates for this issue, Colin?I search around and see nothing Frown | :(
GeneralRe: Data is not filled in enough into Dataset Pin
Bruce Goodman10-Jun-12 22:32
Bruce Goodman10-Jun-12 22:32 
Bug'inputFileStream.ReadTimeout' threw an exception of type 'System.InvalidOperationException' Pin
Member 35080023-Apr-12 21:59
Member 35080023-Apr-12 21:59 
GeneralRe: 'inputFileStream.ReadTimeout' threw an exception of type 'System.InvalidOperationException' Pin
ColinBashBash4-Apr-12 3:54
ColinBashBash4-Apr-12 3:54 

General General    News News    Suggestion Suggestion    Question Question    Bug Bug    Answer Answer    Joke Joke    Praise Praise    Rant Rant    Admin Admin   

Use Ctrl+Left/Right to switch messages, Ctrl+Up/Down to switch threads, Ctrl+Shift+Left/Right to switch pages.