Index was outside the bounds of the array.

Question

2.33/5 (3 votes)

See more:

I have Develop the project in that i have error

"Index was outside the bounds of the array."

Actually this project work for pdf files(it's conatain content English language)but

whenever take the Pdf files.it's contain content in Arabic Language at the time raise

the error

raise the error at the following

C#

page = PdfTextExtractor.GetTextFromPage(r, i, Strategy);

my code:

C#

using System;
using System.Collections.Generic;
using System.ComponentModel;
using System.Data;
using System.IO;
using System.Text;
using iTextSharp.text;
using iTextSharp.text.pdf;
using iTextSharp.text.pdf.parser;
using System.Drawing;
using System.Linq;
using System.Text;
using System.Threading.Tasks;
using System.Windows.Forms;

namespace test
{
    public partial class Form1 : Form
    {
        string filename;
        string path;
        public Form1()
        {
            InitializeComponent();
        }

        private void button1_Click(object sender, EventArgs e)
        {
            OpenFileDialog openFileDialog = new OpenFileDialog();
            openFileDialog.CheckFileExists = true;
            openFileDialog.AddExtension = true;
            openFileDialog.Filter = "PDF files (*.pdf)|*.pdf";
            DialogResult result = openFileDialog.ShowDialog();
            if (result == DialogResult.OK)
            {
                //data1 = openFileDialog.FileNames.Select(x => new FileInfo(x)).ToArray();
                filename = Path.GetFileName(openFileDialog.FileName);
                path = Path.GetDirectoryName(openFileDialog.FileName);
                textBox1.Text = path + "\\" + filename;

            }
        }

        private void button3_Click(object sender, EventArgs e)
        {
          string s=  Form1.ExtractTextFromPdf(textBox1.Text);
          string reverseValue = new string(s.Select((c, index) => new { c, index })
                                       .OrderByDescending(x => x.index)
                                       .Select(x => x.c)
                                       .ToArray());
          richTextBox1.Text = reverseValue;
        }



        public static string ExtractTextFromPdf(string filename)
        {
            using (PdfReader r = new PdfReader(filename))
            {
                StringBuilder text = new StringBuilder();
                ITextExtractionStrategy Strategy = new iTextSharp.text.pdf.parser.LocationTextExtractionStrategy();
                for (int i = 1; i <= r.NumberOfPages; i++)
                {
                    //string first;
                    string page = "";
                    page = PdfTextExtractor.GetTextFromPage(r, i, Strategy);
                    string[] lines = page.Split('\n');
                    foreach (string line in lines)
                    {
                        text.Append(line);
                    }                   
                }
                string first = text.ToString();
                return first;
             
                 
            }
        }

    }
}

please help me.

thank u.

Posted 27-Aug-15 3:21am

Krishna Veni

Add a Solution

Comments

Simon_Whale 27-Aug-15 9:32am

Have you made sure that r, i and Strategy are not null?
Also what is PdfTextExtractor.GetTextFromPage method? is this something that you have created or is it a 3rd party API?
What is the exact error message that you are getting?

F-ES Sitecore 27-Aug-15 10:05am

It might be because GetTextFromPage uses a 0-based index.

for (int i = 0; i < r.NumberOfPages; i++)

CHill60 27-Aug-15 10:32am

That's what I thought too, but it looks like page numbering *does* begin with 1 :(

CHill60 27-Aug-15 10:35am

If you change LocationTextExtractionStrategy to SimpleTextExtractionStrategy do you get the same error? I can't see where you're defining the text location

Herman<T>.Instance 27-Aug-15 11:08am

this is your third question over the same code. You expand your code each time. And when there is a next problem you are here. Some questions please Google first. Some of your problems are to easy to tackle. Like this one. Why don't you debug the loop and see if i or r is the problem in which case.

And please accept the solutions given to you in the other questions.

Add your solution here

Treat my content as plain text, not as HTML

Preview 0

…

Existing Members

Sign in to your account

...or Join us

Download, Vote, Comment, Publish.

Your Email
Password
Forgot your password?

Your Email
This email is in use. Do you need your password?
Optional Password

I have read and agree to the Terms of Service and Privacy Policy
Please subscribe me to the CodeProject newsletters

When answering a question please:

Read the question carefully.
Understand that English isn't everyone's first language so be lenient of bad spelling and grammar.
If a question is poorly phrased then either ask for clarification, ignore it, or edit the question and fix the problem. Insults are not welcome.
Don't tell someone to read the manual. Chances are they have and don't get it. Provide an answer or move on to the next question.

Let's work to help developers, not make them feel stupid.

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)