Algorithms

Get a polygons from edges

7-Nov-08 0:15

furqan_sindhu5-Nov-08 22:31

5-Nov-08 22:31

Dear All,
i have a list of edges like
(0,0),(2,0)
(2,3),(3,4)
(2,0),(2,2)
(0,0),(1,5)
(1,5),(1,10)
(3,4),(3,10)
(2,3),(1,5)
(3,4),(4,0)
(1,10),(3,10)

If you draw these edges on a Cartesian coordinate system, you visually see 3 polygons.

In mapping applications, a polygon is defined by a continuous, closed set of data points. So, output required is a a list like this.
(0,0),(2,0),(2,3),(1,5),(0,0)
(2,0),(4,0),(3,4),(2,3),(2,0)
(2,3)(3,4),(3,10),(1,10),(1,5),(2,3)

Can you please guide me how to????
Simplifying, i need to have Voronoi polygons around my points. Voronoi algorithms gave me the list of edges, but i am unable to get polygons from those edges.

Hope some one can guide!!!

⁷³Zeppelin6-Nov-08 0:44

6-Nov-08 0:44

Do you know a priori how many sides each polygon has?

Also, is this an ordered list?

furqan_sindhu6-Nov-08 1:25

6-Nov-08 1:25

we dont know about the no of sides, each polygon has different no of sides
what is meant by ordered list in this case? the list is totally random, just a list of all the edges, no ordering at all.

Frown | :(

Thanks for looking into this!
ANY IDEA NOW?

⁷³Zeppelin6-Nov-08 1:35

6-Nov-08 1:35

furqan_sindhu wrote:
what is meant by ordered list in this case? the list is totally random, just a list of all the edges, no ordering at all.

This is what I was trying to understand when I asked about an ordered list. If all the points are simply in a random list, how can you possibly reconstruct the polygons based only on a list of random points? The problem is that polygons can take on complicated shapes, so you need to know something about number of sides and/or which points should belong to which polygons. At the very minimum, the points should be ordered according to which polygon they belong to. Is there no way of doing this? Can't you create a polygon class that contains a list of points, for example?

Member 41945936-Nov-08 7:51

6-Nov-08 7:51

You might want to check in "Algorithms" by Robert Sedgewick, published by Addison-Welsley. He has several algorithms, including "Convex Hull" and "Voronoi" for solutions for these problems.

Dave.

⁷³Zeppelin6-Nov-08 8:08

6-Nov-08 8:08

I'm not familiar with the book or the algorithms, but I don't think he's trying to construct the convex hull. I think he's got a randomized list of polygon edges for separate polygons and he's trying to reconstruct the component polygons from the list. A convex hull can certainly be determined given the points, but I'm not convinced that's what he wants to do.

Member 41945936-Nov-08 8:22

6-Nov-08 8:22

My reply was more directed to his statement that he didn't know what to do with his Voronoi edges. In addition, his stated edges do NOT form three polygons. There are two missing edges one between 2,2 and 2,3, and the other between 2,0 and 4,0, both of which he filled in in the case of the polygon descriptions. He is forming a convex hull with enclosed polygons. He also probably wants to split up the polygons into triangles, but that is another topic.

Dave.

⁷³Zeppelin6-Nov-08 8:29

6-Nov-08 8:29

So the list isn't even complete? Hmmmm. Methinks he needs to put a little more forethought into this one. It helps to have the problem well-defined first!

furqan_sindhu9-Nov-08 19:01

9-Nov-08 19:01

oh, i had a look at your message a little late "Member 4194593".
thanks for your reply.
i think i accidentally missed those edges. my apologize!!!
i dont want to construct triangles, rather polygons. i will have a look at those alogs!

Member 419459310-Nov-08 3:08

10-Nov-08 3:08

I have thought about the problem these last few days. Googled for "Convex Hull" and read all of the related articles. Usually only points are given, not edges. If edges are given, wouldn't at least those edges be required in the final solution? I don't think that eliminating one of these edges would be a solution. If you are given a triangle, then it is a triangle area, you can't make it a square by combining it with some other area - people don't take kindly to "re-districting" or "Gerrymandering".

Fortunately, it has been proven that you only need 4 colors to tint your map when you get it done.

I would be interested in the exact phrasing of the problem as given, along with your final solution. I love algorithms and I frequent these halls daily.

I am serious about the book "Algorithms" - there is such an abundance of simple piece-wise efficient solutions to this type of problem described in this book, and Sedgwick explains it so clearly.

Dave.

furqan_sindhu9-Nov-08 18:56

9-Nov-08 18:56

well,

i dont know anything at all. i just have a list of edges and thats all.

I was working on this and uptil now, i have come up with some code that gives me all possible polygons from those edges.
All possible means that it returns those polygons as well that contain multiple small polygons too. I am trying to come up with something to eliminate these bigger polygons Smile | :)

Anyways! i think i will get it done, inshaALLAH.

darrellp28-Nov-08 19:38

darrellp

28-Nov-08 19:38

I think you have some errors in the list of edges. For instance, your polygons include an edge from (2,0) to (2,3) but there is no such edge in your list of edges. Anyway, I think what you're suggesting here is that you've got a list of edges and vertices which define a mesh (often referred to a "boundary representation") and you want to find the polygons in that mesh. I'd suggest that you look at Winged Edge representation, turn your edges into a winged edge rep and get your polys from that. Essentially, you'd have to keep a list of vertices with the edges around them in clockwise order, then traverse each vertex and trace out each poly between adjacent edges. You have to be careful about skipping over polys that have already been listed which means you have to keep track of the polygons each edge separates. It'd be a bit ugly, but I think that's pretty much what's required. It would also mean that the area surrounding the entire mesh will be represented as a "polygon". You can recognize that polygon by looking at the rightmost vertex, getting the first edge from it that is CCW from vertical and the flagging the polygon clockwise from that edge as the "exterior polygon".

If you're looking for a Voronoi diagram that keeps a record of all the polygons, I've got one that I've been thinking about putting up on CodeProject. There's a fortune algorithm already out there on CodeProject, but it has some bugs and pretty much has a dump of edges like you're talking about without an indication of the polygons each edge goes with. Mine gives a full winged edge representation of the diagram which includes all the polygons, edges and vertices with them all hooked together intelligently. If that's what you're looking for, perhaps I'll put my fortune algorithm up earlier rather than later.

Random Number Generation

BobInNJ5-Nov-08 13:47

5-Nov-08 13:47

I have n random variables. Each random variable is normally distributed with mean 0 and variance 1. In addition, I have a function which I can call which will return a normally distributed random value with mean 0 and variance 1. For each pair of variables, I have their correlation. I would like to generate a list of n random values (one for each of the n variables) such that the given correlations apply. What algorithm should I use to do this?

Thanks

Bob

⁷³Zeppelin5-Nov-08 21:36

5-Nov-08 21:36

Do you mean a single random variable with a fixed autocorrelation structure, or are you taking about multiple random variables with a given correlation matrix?

In any event, there are two main ways of doing this; either by Cholesky decomposition or eigenvector decomposition. Cholesky decomposition is probably more conceptually simple.

For normal variables, the basic idea is to find a matrix U such that U'U = C (where U' is the transpose of U) and C is the given correlation matrix. You can then generate correlated random numbers, N' from random numbers N by multiplying by U. That is:

N' = NU

So, how to find U? Like I said, either by Cholesky decomposition or eigenvector decomposition. Until I know what you really want to do, I'll give you the case for pairs of r.v.'s.

For pairs of normal variables, you can generate two sequences of normal r.v.'s, say X1 and X2. Given correlation, p, the new sequence, Y is calculated as:

Y = p*X1 + sqrt(1-p^2) * X2

This will give you a sequence Y that has correlation p with sequence X1. If you want a fixed correlation structure between more than two sequences, we'll have to get into matrix calculations.

BobInNJ6-Nov-08 6:29

6-Nov-08 6:29

Zeppelin,

Thanks for the response. I am talking about multiple random variables with a given
correlation matrix. I am familar with Cholesky decompostion but not eigenvector decomposition.

Bob

⁷³Zeppelin6-Nov-08 8:28

6-Nov-08 8:28

Cholesky decomposition and eigenvector decomposition will give you the same result for this problem. Since Cholesky is easier to implement, I'd suggest using that method.

The method is just a matrix form of the calculation I described above.

We want U'U = C where C is the correlation matrix. So, assemble your correlation matrix and apply Cholesky decomposition to it. This gives you U. Generate, say, 5 series of random variates of 100 observations. Assemble those 5 column matrices into a large matrix so that the dimension of N is 100x5. This means the correlation matrix has to be 5x5 and contains your known/desired cross-correlations. The 5 correlated series are simply N*U. All done!

Eigenvector decomposition proceeds in a similar fashion but is more complicated to implement. Given the correlation matrix, C, you calculate its eigenvectors and eigenvalues (standard linear algebra). If the eigenvectors are E_i and the eigenvalues are L_i, where i is a subscript, then we calculate a matrix V such that:

V = E_i*diag(sqrt(L_i))

where diag is a diagonal matrix with L_i on the diagonal. Because of it's construction matrix V will have the property that: V'V = C, C being the correlation matrix. Then, as above we have U'U = C where, conveniently (ha!) U = V'.

Nifty, but a roundabout route and used mainly for matrices that have problems being decomposed via Cholesky. Now our correlated random numbers, R_c are just:

R_c = R*U where R is the matrix of original random numbers.

BobInNJ6-Nov-08 9:56

6-Nov-08 9:56

Zeppelin,

When you write U'U = C, is U' the transpose of U? I am thinking that U' should be lower triangular and U should be upper triangular. In addition, U' should have all 1s on its main diagonal. Is this right?

Thanks

Bob

⁷³Zeppelin6-Nov-08 10:21

6-Nov-08 10:21

Yes, sorry. The prime indicates the transpose. U should be upper triangular if C is positive definite, correct. Thus U' will be lower triangular.

Certainly C should have all ones on its main diagonal (and be symmetric) but U may not necessarily have all ones, no.

A quick calculation using the correlation matrix C:

1.0000 0.6000 0.3000
0.6000 1.0000 0.5000
0.3000 0.5000 1.0000

gives U as:

1.0000 0.6000 0.3000
0.0000 0.8000 0.4000
0.0000 0.0000 0.8660

BobInNJ6-Nov-08 12:51

Re: Random Number Generation [modified]

6-Nov-08 12:51

Zepplin,

Your answer makes a lot of sense to me. However, I need to ask a different question. Suppose that I have n random numbers that were generated independently with mean 0 and variance 1. However, I want these numbers to be correlated by a given correlation matrix (this is the same as my original question) in addition I have two vectors u and v. I want the ith number generated to have mean u[i] and variance v[i]. Can I do this by doing the following?
1) Use your above algorithm to generate n correlated random numbers
2) Multiply the ith number by v[i] to give it the correct variance
3) Then add u[i] to the ith number to give it the correct mean.

If I do all this, will I get the distribution of numbers that I seek?

Thanks

Bob

⁷³Zeppelin6-Nov-08 20:50

Records and Clusters [modified]

6-Nov-08 20:50

Yes, this will work. To be perfectly clear, this method will work if the random numbers you are generating are being drawn from a normal/Gaussian distribution with mean 0 and variance 1. Bear in mind that as a result of the law of large numbers[^] you will probably require a large number of samples to get close to the mean and variance that you want. I would suggest a minimum of 1000 samples and go up from there. Also, if you want the variance to be, say, 0.3 then you have to multiply (a normal variable with mean 0 and variance 1) by the square root of 0.3.

To summarize, if U is your Cholesky factor, and X is distributed as a normal variable with mean zero and covariance matrix I (the identity) then:

U*X is distributed as normal with mean zero and covariance matrix C.

Also, if you want a normal variable Z with mean m and covariance matrix C then Z = m + U*X where X is normal with mean 0 and variance-covariance matrix I (the identity). To change the variance of a given column of X, simply multiply that column by the square root of the variance you want to use. Since these are linear transformations, they will also preserve your correlation structure.

modified on Friday, November 7, 2008 2:58 AM

DQNOK5-Nov-08 4:40

DQNOK

5-Nov-08 4:40

For many varied reasons, I decided to write my own balanced binary tree library. One feature of my library that I haven't seen in other freely available libraries is "clusters", and a "recluster" function.

The problem that I am attacking is that as elements are randomly added to the tree, the tree rotations that automatically happen to keep the tree balanced tend to spread the edge pointers all over the place. The end result is that in a large tree, a typical traversal can require reading nodes from all over physical memory (or disk pages): a very inefficient use of cache. By clustering several subtree "near the root nodes" together in physical memory, tree traversals can be sped-up significantly by keeping nodes that always get visited near one another within physical memory (or on the same disk page). For example, in a properly clustered tree, with clusters large enough to hold 4 levels of records (1+2+4+8 = 15 records), a tree of height 16 (>45000 records) can be traversed in only 4 cluster reads instead of 16 reads that might be required of a non-clustered tree.

What I don't know is a good way to measure how "out-of-cluster" a tree is. I'd like to be able to determine when a tree needs to be reclustered. I'm sure there is good theory out there for this; and the idea doesn't seem to be that difficult, but I haven't run onto the solution yet. I've spent a little bit of time googling for a method, and probably a couple of hours thinking about it. Nothing yet.

Any ideas or helpful links?

David
---------
Empirical studies indicate that 20% of the people drink 80% of the beer. With C++ developers, the rule is that 80% of the developers understand at most 20% of the language. It is not the same 20% for different people, so don't count on them to understand each other's code.
http://yosefk.com/c++fqa/picture.html#fqa-6.6
---------

modified on Wednesday, November 5, 2008 11:33 AM

Re: Records and Clusters

Member 41945935-Nov-08 6:28

5-Nov-08 6:28

Have you ever considered B+ trees as implemented by "The C Database Tool Chest", Mix Software. Mix is still on the web - Google "Mix Software". Note that the term "database" is, in fact, just a B+ tree. The database is a tree of memory or file blocks which contain multiple keys and records.

I got this in '85 and have used it ever since. I got the source software (don't know if this is still available). Not Open Source, but any programs that are released can be used without any royalty, only restriction is that it must be linked into an object, no DLLs, and the neither the source nor the library can be released - only the executable with the embedded code. With the source, you can make any modifications you want.

I modified it (originally 16bit - used Borland C to compile) to now use VS2008 thus 32 bit and supports >4GB files. Major rewrite to unwind the INT usage which now is 32 bit - in 16 bit an INT was 16 bit. Many conflicts with structure layout, and old 16 bit databases couldn't be read with the 32 bit version, nor the other way around. I built my version where I could declare which size of INT I wanted so I had only one converter to work with. I created two converters (16 bit and 32 bit) which could read the appropriate data base (in ascending order) and output a flat file with the data it contained, and then read in the flat file and write the other bit sized database.

Dave.

Re: Records and Clusters

DQNOK5-Nov-08 6:56

DQNOK

5-Nov-08 6:56

Member 4194593 wrote:
Have you ever considered B+ trees as implemented by "The C Database Tool Chest", Mix Software.

No, I hadn't; but I am familiar with Mix, and have used both their Power C compiler and their C utilities toolchest.

I'll have to back-up and reconsider whether this would be a good approach to my end goal (which was NOT to rewrite the balanced binary tree algorithms).

I originally rolled my own balanced binary tree routines because the ones I found on the web didn't support duplicate keys, nor multi-threading. The clusters were an after-thought (actually, they came about from a "Hey, I can use the tree structure instead of yet-another-roll-my-own exotic structure -- IF I can speed them up sufficiently -- perhaps reclustering would do it).

Thanks for the thoughts.

Re: Records and Clusters

Member 41945935-Nov-08 7:44