Hi Alex,
With the team at Bordeaux, we are working on an implementation to query histories (the tool is named VPraxis). I implement a importer for Smalltalk. For now, I can load histories for a package, for a metacello configuration, from a repository.
A first part is build in Pharo (building snapshots from Monticello), and a second part in build with our java software and build the history file. We can export xml, sql or swipl, which allows us to build prolog queries.
Here (http://code.google.com/p/harmony/wiki/VPraxis) is a tutorial for the java VPraxis. For Smalltalk, I am working on it.
I generated the current Mondrian history in a swipl format, available here: http://dl.dropbox.com/u/7739334/MondrianHistorySwipl.zip An important point is that we do not into account the multiple branches (it is a current work :) ). So, when there are multiple branches, for now, we check were is the merge and we "ignore the multiple branches".
Now, for your question, we create a script: http://dl.dropbox.com/u/7739334/mondrianScript.txt Using swi-prolog (http://www.swi-prolog.org/) , you can call, in this script, the method moreN(L, 10).
For the package Mondrian-Layout, it returns: L = [id_MOAbstractGraphLayout, id_MOCircleLayout, id_MOSugiyamaLayout]. Which are the three classes that have changed more that 10 times.
For now, I am working on packaging all the things together. But if you have some other requests, do not hesitate to ask me :)
Cheers, Jannik
On Nov 24, 2011, at 14:05 , Alexandre Bergel wrote:
Having a reification in Moose of 100 versions of Mondrian for example :-)
Just answering the question 'Which classes and methods of Mondrian have changed more than 10 times since the day Mondrian was born?' cannot be easily done without a lot of memory
Alexandre
On 24 Nov 2011, at 03:27, Francois Stephany wrote:
I'm wondering: how big is a dataset > 500MB ? I've no idea how big it is. Alex, what is your use case (in practice!) for more than 500MB?
On 23/11/11 18:25, Igor Stasenko wrote:
It is problematic, and requires different memory management than we currently have. I think if you need really big data sets, then use gemstone, which is developed to deal with that specifically.
-- _,.;:~^~:;._,.;:~^~:;._,.;:~^~:;._,.;:~^~:;._,.;: Alexandre Bergel http://www.bergel.eu ^~:;._,.;:~^~:;._,.;:~^~:;._,.;:~^~:;._,.;:~^~:;.
--- Jannik Laval