Working With Microsoft.mshtml.dll and SHDocVw.dll
How to work with Microsoft.mshtml.dll and SHDocVw.dll.
Introduction
This article contains a discussion on Microsoft.mshtml.dll and SHDocVw.dll.
Background
I was working on BHO for IE in C#. Initially I did not know any thing about BHO and IE extensions. I read articles about the architecture and related information from MSDN. I also read about the architecture of IE from MSDN.
But I got my best understanding of BHO from CodeProject from this article by Pavel Zolnikov. I have listed here some problems that I faced and the solutions to those problems.
Microsoft.mshtml.dll and SHDocVW.dll
I was working on the BHO toolbar and I was trying to access an Object
element (<object>
tag), Image
element (<img>
tag), and Embed
element (<embed>
tag). I had no idea how to get these elements.
I had two solutions:
- Use JavaScript to get an element.
- Retrieve the Object directly from the page [like from the IE Developer ToolBar]
I used both of them because in many cases the second approach failed to detect the Object tag.
Injection JavaScript in Web Pages
We can inject our own JavaScript in any web page in IE.
//Explorer is Object of SHDocVw.WebBrowserClass
HTMLDocument htmlDoc = (HTMLDocument)this.Explorer.IWebBrowser_Document;
//inject Script
htmlDoc।parentWindow.execScript("alert('hello world !!')", "javascript");
Handling Web Document Events
You can directly handle events of documents from your BHO as follows:
((mshtml.HTMLDocumentEvents2_Event)htmlDoc).onmousedown +=
newHTMLDocumentEvents2_onmousedownEventHandler(MyToolBar_onmousedown);
//Function is as follows:
//This will be raised when mousedown event is fired
//....when user try to click..(downs the mouse button)
void MyToolBar_onmousedown(IHTMLEventObj pEvtObj)
{
MessageBox.Show(pEvtObj.srcElement.tagName);
}
There was a problem with this method. Implementing the above code stopped the actual events of the browser. Events on the web browser were not fired. In other words, events of the web browser were locked. This was an annoying problem, but I found the solution for this problem here by Rick Strahl.
namespace MyToolBar
{
public delegate void DOMEvent(mshtml.IHTMLEventObj e);
public classDOMEventHandler
{
public DOMEvent Handler;
DispHTMLDocument Document;
public DOMEventHandler(DispHTMLDocument doc)
{
this.Document = doc;
}
[DispId(0)]
public voidCall()
{
Handler(Document.parentWindow.@event);
}
}
}
The above class can be used as:
//Explorer is Object of SHDocVw.WebBrowserClass
HTMLDocument htmlDoc =(HTMLDocument)this.Explorer.IWebBrowser_Document;
DispHTMLDocument doc = (DispHTMLDocument ) htmlDoc
DOMEventHandler onmousedownhandler = newDOMEventHandler (doc);
onmousedownhandler.Handler += new DOMEvent(Mouse_Down);
doc.onmousedown = onmousedownhandler;
//mouse Down Can be like this
public void Mouse_Down(mshtml.IHTMLEventObje)
{
MessageBox.Show(e.srcElement.tagName);
}
Collecting All the Images From a Web Page
I was trying to access all the images from a web page. It can be done in one of the following ways.
Using the Document.Images Collection
//Explorer is Object of SHDocVw.WebBrowserClass
HTMLDocument htmlDoc = (HTMLDocument)this.Explorer.IWebBrowser_Document;
//get all the images of document
IHTMLElementCollection imgs = htmlDoc.images;
foreach (HTMLImgClass imgTag in imgs)
{
MessageBox.Show(imgTag.src);
}
Using getElementsByTagName
HTMLDocument htmlDoc = (HTMLDocument)this.Explorer.IWebBrowser_Document;
//get all the images of document
IHTMLElementCollection imgs = htmlDoc.getElementsByTagName("img");
foreach (HTMLImgClass imgTag in imgs)
{
MessageBox.Show(imgTag.src);
}
Accessing Content From Frames and IFrames
The biggest problem was accessing content from Frames and IFrames. I tried a lot to access content from Frames and IFrames. But it was always giving me an IFrameElement
or an Access Denied Exception.
IHTMLElementCollection frames =
(IHTMLElementCollection) htmlDoc।getElementsByTagName("frame");
if (frames != null)
{
foreach (IHTMLElement frminframes)
{
(HTMLFrameElement)frm).contentWindow.execScript(
'alert('Hello From Frame')',"javascript");
}
}
Using the above code, I was able to access the frame but it was giving me an Access Denied Exception. I was trying to attach an event with a Frame/IFrame Document but there were same problems. Finally I got a solution from here. I implemented a solution as follows:
IHTMLElementCollection elcol = htmlDoc.getElementsByTagName("iframe");
foreach (IHTMLElement iel inelcol)
{
HTMLFrameElement frm = (HTMLFrameElement )iel;
DispHTMLDocument doc =(DispHTMLDocument)((SHDocVw.IWebBrowser2)frm).Document;
DOMEventHandler onmousedownhandler = new DOMEventHandler (doc);
onmousedownhandler।Handler += new DOMEvent(Mouse_Down);
doc.onmousedown = onmousedownhandler;
}
Get All the Attributes of the Tag / Element
I was required to get all the attributes of a tag. Attributes like height, width, src, etc. I was able to get all the attribute as follows:
IHTMLElement element;
IHTMLDOMNode nd = (IHTMLDOMNode)element;
IHTMLAttributeCollection attribs = (IHTMLAttributeCollection )nd.attributes;
try
{
foreach (IHTMLDOMAttribute2 att in attribs)
{
if (((IHTMLDOMAttribute)att).specified)
{
MessageBox.Show(att.value);
}
}
}catch{}