[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [sup-devel] email threading - tree vs. graph
On Sun, 21 Feb 2010 14:29:47 -0500, "W. Trevor King" <wking@drexel.edu> wrote:
> On Sun, Feb 21, 2010 at 09:52:28AM -0800, Nicolas Pouillard wrote:
> > On Sun, 21 Feb 2010 08:42:49 -0500, "W. Trevor King" <wking@drexel.edu> wrote:
> > > On Sun, Feb 21, 2010 at 08:38:35AM +0200, Tero Tilus wrote:
> > [...]
> > > > Have a pointer to code?
> > >
> > > My code is currently stuffed into an in-transition BE project, but it
> > > should be easy to separate. Grab the whole repo with Bazaar:
> > > bzr branch http://www.physics.drexel.edu/~wking/code/bzr/be.mailing-list
> > > Graphing module is libbe/util/graph.py. My very minimal browser is
> > > misc/mailbox-tools/mailgraph.py. Set up the BE version file with
> > > cd be.mailing-list
> > > make libbe/_version.py
> > > and run the browser with
> > > misc/mailbox-tools/mailgraph.py *.mbox
> > > Press 'h' for help.
> >
> > I've tried your program on a 100 messages mbox and got this:
> >
> > missing Message-ID: <HIDDEN>
> > ...
>
> You probably had a bunch of emails in you mbox with
> In-Reply-To: <HIDDEN>
> But no message(s) with
> Message-ID: <HIDDEN>
>
> If mailgraph.py can't find the parent message, it prints that warning
> and continues, so you can probably just ignore it.
>
> > Traceback (most recent call last):
> > ...
> > libbe.util.graph.CyclicGraph: 3 of 100 elements not reachable from tips
>
> You have a cyclic reference in your mbox somewhere. I've added some
> really inefficient code to actually *find* cycles (rather than just
> deducing their existence) and print useful error messages. Pull my
> current repo and try:
>
> $ misc/mailbox-tools/mailgraph.py --check-for-cycle *.mbox
>
> Then you'll have to go through the mbox (or a copy) by hand and break
> the cycle. The check only finds one cycle at a time, so you may need
> to iterate...
I get:
...
libbe.util.graph.CyclicGraphError: cycle detected:
Sun, 21 Fe Reinier Lamers [darcs-users] [patch167] Reintroduce UTF-8 tagging...
Sun, 21 Fe Reinier Lamers [darcs-users] [patch167] Reintroduce UTF-8 tagging...
Actually the mentioned mail have Message-ID equals to In-Reply-To.
While I'm reporting this issue and thus won't get any such messages, it would be
"nice" to have a more robust behavior in case of cycles. In particular these
auto-cycles can be just ignored.
--
Nicolas Pouillard
http://nicolaspouillard.fr
_______________________________________________
Sup-devel mailing list
Sup-devel@rubyforge.org
http://rubyforge.org/mailman/listinfo/sup-devel