Hello Sergei and all,
I started preparing a proposal for the project "Statistically optimize mysql-test runs by running less tests". I have a small draft that seems complete without entering any technical details. I am also taking a look at research on the field. Any suggestions on papers are appreciated.

Here is the proposal: draft of the proposal. All suggestions and comments are welcome, and encouraged.

How does it look? : )

Regards
Pablo


On Thu, Mar 6, 2014 at 5:53 PM, Sergei Golubchik <serg@mariadb.org> wrote:
Hi, Pablo!

On Mar 06, Pablo Estrada wrote:
>
> Although I am interested in software development, I am more interested
> in data analysis. That's why I switched from computer science to
> scientific computing, and that's why the project that interests me the
> most is the one called "Statistically optimize mysql-test runs by
> running less tests". I am very excited by the idea of using data to
> try to test more smartly.

Yes. You might also briefly review scientific publications in this area.
I've found a few (it was a couple of years ago), but their approach
was different from what is suggested here.

Anyway, my preliminary experiments shows that it's totally possible to
find 90% of test failures in only 10% of the time.

> If it was not possible to participate in the latter project, I would
> also be interested to work on adding support for "OR REPLACE, CREATE
> IF NOT EXISTS and DROP IF EXISTS".

This is mostly a straightforward coding task, not much to explain here.

> Depending on your feedback, I can get to work on a proposal for either
> one of the two projects. I'd be very happy to be able to participate
> in GSoC with MariaDB.

I'd recommend you to submit proposal for the first one.
It seems like a better match for you, and also it seems like you will
have less competition there. I'd expect more proposals for the second
task.

Regards,
Sergei