Operations Research Competitions: ListWeaver

Tuesday, 26 June 2012

ListWeaver - WINNER

The winning code is shown below:

using System; 
using System.Collections.Generic; 
using System.Linq; 

namespace Orcomp 
{ 
    public class ListWeaver<T>
    { 
        public ListWeaver() 
        { 
            Yarns = new List<IList<T>>(); 
        } 

        public List<IList<T>> Yarns { get; set; }

        public bool CanWeave(IList<T> yarn)
        { 
            if (yarn == null)
                return false;

            List<IList<T>> yarns = Yarns.ToList(); 
            yarns.Add(yarn); 

            if (Walk(yarns) != null)
                return true;
            else 
                return false;
        } 

        public List<T> Weave() 
        { 
            return Walk(Yarns); 
        } 

        private List<T> Walk(List<IList<T>> yarns)
        { 
            var edges = new Dictionary<T, HashSet<T>>(); 
            foreach (var yarn in yarns)
            { 
                for (int i = 1; i < yarn.Count; i++)
                { 
                    if (!edges.ContainsKey(yarn[i])) 
                        edges[yarn[i]] = new HashSet<T>(); 
                    edges[yarn[i]].Add(yarn[i - 1]); 
                } 
            } 

            var path = new List<T>(edges.Count); 
            var seen = new HashSet<T>(); 

            var valid = yarns.Where(yarn => yarn.Count > 0).Select(yarn => yarn.Last()).All(node => Walk(node, path, edges, seen));
            if (valid) 
                return path; 
            else 
                return null;
        } 

        private bool Walk(T root, List<T> path, Dictionary<T, HashSet<T>> edges, HashSet<T> seen_all)
        { 
            var seen_now = new HashSet<T>(); 
            var to_visit = new LinkedList<KeyValuePair<T, bool>>(); 
            to_visit.AddLast(new KeyValuePair<T, bool>(root, false)); 

            while (to_visit.Count > 0)
            { 
                var last = to_visit.Last.Value; 
                var node = last.Key; 
                var drop = last.Value; 
                to_visit.RemoveLast(); 

                if (drop) 
                { 
                    path.Add(node); 
                    seen_now.Remove(node); 
                    continue; 
                } 

                if (seen_now.Contains(node)) 
                    return false;

                if (seen_all.Contains(node)) 
                    continue; 

                seen_all.Add(node); 
                seen_now.Add(node); 

                to_visit.AddLast(new KeyValuePair<T, bool>(node, true)); 
                if (edges.ContainsKey(node)) 
                    foreach (var edge in edges[node].Reverse())
                        to_visit.AddLast(new KeyValuePair<T, bool>(edge, false)); 
            } 

            return true; 
        } 
    } 
}

Feedback and Improvement:

The winning code runs efficiently in O(V+E) and makes very good use of HashSets and Dictionaries. (i.e. the time it takes to solve all benchmark tests is the same whether the lists are reversed or not, and the last two highly connected graphs solve efficiently in the same amount of time.) Well Done!

There is actually not much I can think of to improve on the code, however I would like to introduce another approach and start a new competition for anyone who can solve this problem efficiently using an adjacency matrix representation of the graph using sparse matrices.

Some preliminary work suggests that if the adjacency matrix can be transformed into an upper triangular matrix a solution exists to the ListWeaver problem.

The following wikipedia page is a good starting point:
- Wikipeida - Adjacency matrix

You are allowed to use 3rd party open source libraries to represent the sparse matrices, if required.

22 comments:

Renze27 June 2012 at 02:50
Hello,

Please have a look at the following test case (I can probably make it smaller if I have a closer look):

[TestMethod]
public void CanWeave_CollectionOfLists_ReturnsCorrectSequenceRdW()
{
// Arrange
var lw = new ListWeaver();
var list1 = new List { "Blue" };
var list2 = new List { "Bronze", "Yellow", "Purple" };
var list3 = new List { "Black", "Orange" };
var list4 = new List { "Red", "Blue", "Gold" };
var list5 = new List { "Silver", "Blue" };
var list6 = new List { "Bronze", "Blue" };

// Act
lw.Yarns.Add(list1);
lw.Yarns.Add(list2);
lw.Yarns.Add(list3);
lw.Yarns.Add(list4);
lw.Yarns.Add(list5);
lw.Yarns.Add(list6);
var result = lw.Weave();

var correct = new List { "Bronze", "Red", "Silver", "Blue", "Yellow", "Purple", "Black", "Orange", "Gold" };

// Assert
Assert.Equal(correct, result );
}

The competition winner returns
{ "Red", "Silver", "Bronze", "Blue", "Yellow", "Purple", "Black", "Orange", "Gold" }

Blue is in the same location. I think Bronze should come first in the result (instead of Red), because it precedes blue in list 6 and is introduced in list2 (Red is introduced in list4).

If the competition winner is correct, can you please explain why?

Cheers,
Renze.
ReplyDelete
Replies
Renze27 June 2012 at 05:45
Hello,

I found a simpler test case for which the competition winner generates a result that is not expected by me (or rather, I wrote a program that generated a simpler test case).

Input:
Purple
Green
Orange,Purple
Green,Purple

The competition winner produces Orange,Green,Purple.

I think it should produce Green, Orange, Purple. Can you explain what is correct and why?

Thank you,
renze.
ReplyDelete
Replies
evgeny77727 June 2012 at 07:05
This comment has been removed by the author.
ReplyDelete
Replies
evgeny77727 June 2012 at 07:06
Yes, my test gives the same results "Green, Orange, Purple". If we treat the input data as "Green goes before Orange unless clearly stated vice-versa somewhere in the lower lists" then there is no other option.

On the other hand we have an ambiguity here. For example we can say, that "Green comes before Orange, because it is in the higher priority list than Orange" and looks like we'are right. But we can also say that "Despite Green comes in the higher priority list than Orange, Green is connected to Purple *AFTER* Orange is connected to Purple, so Green should be after Orange in the result set".

However it looks like such ambiguity is not resolved by the problem statement. Following "vertex priority" or "edge priority" we'll come to completely different algorithms. The fact that test cases are hidden only worsen the situation.

I propose Ben should revise the problem statement, especially because he's just posted the variation of List Weaver contest
ReplyDelete
Replies
orc27 June 2012 at 08:12
evgeny777 is correct in his assessment: "Green comes before Orange, because it is in the higher priority list than Orange"

The answer should be "Green, Orange, Purple".

Please note that the winner was selected based on the unit tests that were made available with the benchmarks. The competition statement did ask for people to submit edge case tests. Some people did provide extra tests, which were also added to the competition unit tests. At the time the selection was made the winning code passed all unit tests successfully.
ReplyDelete
Replies
evgeny77727 June 2012 at 12:53
If so then it is quite strange to select winner based on unit tests which are kept in secret. The first thing that should always be checked is algorithm *correctness*, code length and execution time should come after that. And in your previous post you actually state that winner algorithm is *NOT* correct, because in some cases it produces wrong results. I do not want to dispute the results and take away somebodys victory, but in case you continue selecting the winner according to your hidden unit tests and nothing else, you might soon find nobody taking place in your contests.
ReplyDelete
Replies
bawr27 June 2012 at 14:09
The winning code - it's mine, by the way - passed all the tests that were available at the time the contest closed. This is *including* Ben's tests (which had already been made public at that time), and any *additional* tests submitted by other contestants.

Now, I can see why it's not quite correct in the edge cases you provided (there's a simple fix, by the way, perhaps Ben wants to offer a prize for that to someone?), but again - these additional cases were provided *after* the code has been selected as the winning entry.

If this had happened earlier, I would not have won - or rather, I would have had one more chance to fix my code, since I only used two out of three available entries.

Best regards and best of luck on future entries!
ReplyDelete
Replies

Add comment

Pages

Tuesday, 26 June 2012

ListWeaver - WINNER

Feedback and Improvement:

22 comments: