Unix Technical Forum

Google's Summer of Code ...

This is a discussion on Google's Summer of Code ... within the pgsql Hackers forums, part of the PostgreSQL category; --> Funny how things come full circle... in previous threads someone mentioned a few low hanging fruit items, we should ...


Go Back   Unix Technical Forum > Database Server Software > PostgreSQL > pgsql Hackers

FAQ Members List Calendar Search Today's Posts Mark Forums Read

Reply

 

LinkBack Thread Tools Display Modes
  #11 (permalink)  
Old 04-11-2008, 05:10 AM
Robert Treat
 
Posts: n/a
Default Re: Google's Summer of Code ...

Funny how things come full circle... in previous threads someone mentioned a
few low hanging fruit items, we should start there, and maybe get one of the
more hard-core hackers to go through the TODO list and mark up some other
items.

My take on the mentor thing was that the students could probably post on
-hackers if they ran into anything too tricky, though they'll need a poc for
scheduling and timelines and getting them up to speed on development
environment and such.

Incidentally I havent seen any objections, if there are none should we go
ahead and whip up an email to google? Do we want to run this through the
foundation? ISTM we would, though I don't know the full extent of what
ramifications that would have.


Robert Treat

On Wednesday 01 June 2005 15:11, Jonah H. Harris wrote:
> Not sure about the current 'students'. I need to take a look and see
> what kind of somewhat easy stuff is on the TODO (I haven't seen it in
> awhile).
>
> I like your suggestion for the mailing list.
>
> Marc G. Fournier wrote:
> > One concern that was raised off of this list was the steep learning
> > curve to start with, to get to the point of beign able to do anything ...
> >
> > Do we have any 'students' that are already up to speed, enough so that
> > they'd be able to accomplish something significant over a 2-3 month
> > period?
> >
> > On Wed, 1 Jun 2005, Marc G. Fournier wrote:
> >> Are there any restrictions on 'mentor groups'? For instance, could
> >> we create a 'mentors' mailing list, closed subscriptions, that
> >> contained those willing to actively mentor, so that those being
> >> mentor'd have multiple ppl's knowledge's to feed upon?
> >>
> >> On Wed, 1 Jun 2005, Jonah H. Harris wrote:
> >>> My participation would be limited to the actual number of
> >>> newcomers. I feel comfortable helping 2 or 3 people; any more than
> >>> that would impact my clients and consulting time. Although, it
> >>> would never hurt to have additional mentors .
> >>>
> >>> Robert Treat wrote:
> >>>> We have a whole TODO list of potential items for people to work on,
> >>>> and we have several member of this list that work for universities
> >>>> in some capacity, so istm we ought to give it a shot. I would
> >>>> think this is something that the foundation would be good to be
> >>>> involved in as well, as it would certainly help it all appear
> >>>> official. We just need to tie up the loose ends on who would be
> >>>> available to do the mentoring, which might need to be different
> >>>> people depending on the work involved, unless you think you could
> >>>> handle all comers Jonah? Robert Treat
> >>>>
> >>>> On Wednesday 01 June 2005 10:53, Jonah H. Harris wrote:
> >>>>> Marc,
> >>>>>
> >>>>> If you wanted to submit PostgreSQL to Google for this, I'd be
> >>>>> willing to
> >>>>> work with any potentials. Should someone choose to work on
> >>>>> PostgreSQL
> >>>>> projects, the money can go to the group.
> >>>>>
> >>>>> If anyone sees a problem with submitting under the PostgreSQL Global
> >>>>> Development Group, I can do it under my company.
> >>>>>
> >>>>> Thoughts?
> >>>>>
> >>>>> Marc G. Fournier wrote:
> >>>>>> In case nobody has seen this:
> >>>>>>
> >>>>>> http://code.google.com/summerofcode.html
> >>>>>>
> >>>>>> Might be something to help fun, say, the GiST stuff? Just need to
> >>>>>> find a student that could be mentor'd/directed/brought up to speed?
> >>>>>>
> >>>>>>
> >>>>>> ----
> >>>>>> Marc G. Fournier Hub.Org Networking Services
> >>>>>> (http://www.hub.org)
> >>>>>> Email: scrappy@hub.org Yahoo!: yscrappy ICQ:
> >>>>>> 7615664
> >>>>>>
> >>>>>> ---------------------------(end of
> >>>>>> broadcast)---------------------------
> >>>>>> TIP 5: Have you checked our extensive FAQ?
> >>>>>>
> >>>>>> http://www.postgresql.org/docs/faq
> >>>>>
> >>>>> ---------------------------(end of
> >>>>> broadcast)---------------------------
> >>>>> TIP 4: Don't 'kill -9' the postmaster
> >>
> >> ----
> >> Marc G. Fournier Hub.Org Networking Services
> >> (http://www.hub.org)
> >> Email: scrappy@hub.org Yahoo!: yscrappy ICQ:
> >> 7615664
> >>
> >> ---------------------------(end of broadcast)---------------------------
> >> TIP 9: the planner will ignore your desire to choose an index scan if
> >> your
> >> joining column's datatypes do not match

> >
> > ----
> > Marc G. Fournier Hub.Org Networking Services
> > (http://www.hub.org)
> > Email: scrappy@hub.org Yahoo!: yscrappy ICQ:
> > 7615664
> >
> > ---------------------------(end of broadcast)---------------------------
> > TIP 9: the planner will ignore your desire to choose an index scan if
> > your
> > joining column's datatypes do not match


--
Robert Treat
Build A Brighter Lamp :: Linux Apache {middleware} PostgreSQL

---------------------------(end of broadcast)---------------------------
TIP 3: if posting/reading through Usenet, please send an appropriate
subscribe-nomail command to majordomo@postgresql.org so that your
message can get through to the mailing list cleanly

Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #12 (permalink)  
Old 04-11-2008, 05:11 AM
Marc G. Fournier
 
Posts: n/a
Default Re: Google's Summer of Code ...


'k, since there appears to be interest, I've tried to read a bit more up
on it, specifically the FAQs ... seems simple enough, but someone else may
have found a link that I haven't ...

Basically, developer (ie. Ramy Hassan) would go to:

http://code.google.com/soc_application.html

Fill out this form and put 'Other' for Sponsor, and PostgreSQL Global
Development Group in the blank space provided ... I can't find any
specific "sponsor application form" on Google, but, again, may be missing
something ...

I've sent a note off to Google asking for more information on becoming a
mentor, will post once I hear back more ...

While waiting, for any students that are interested, can I suggest going
to the above URL, look at what is requested in preparation ...



On Thu, 2 Jun 2005, Oleg Bartunov wrote:

> On Wed, 1 Jun 2005, Marc G. Fournier wrote:
>
>>
>> One concern that was raised off of this list was the steep learning curve
>> to start with, to get to the point of beign able to do anything ...
>>
>> Do we have any 'students' that are already up to speed, enough so that
>> they'd be able to accomplish something significant over a 2-3 month period?

>
> I have a student who is already working on developing custom access method
> to very large astronomical catalogues. Also, Ramy Hassan
> (rhassan@cs.purdue.edu)
> from Purdue University is working on SP-GiST and I think his work would be
> well suited for Google program.
>
>
>>
>> On Wed, 1 Jun 2005, Marc G. Fournier wrote:
>>
>>>
>>> Are there any restrictions on 'mentor groups'? For instance, could we
>>> create a 'mentors' mailing list, closed subscriptions, that contained
>>> those willing to actively mentor, so that those being mentor'd have
>>> multiple ppl's knowledge's to feed upon?
>>>
>>> On Wed, 1 Jun 2005, Jonah H. Harris wrote:
>>>
>>>> My participation would be limited to the actual number of newcomers. I
>>>> feel comfortable helping 2 or 3 people; any more than that would impact
>>>> my clients and consulting time. Although, it would never hurt to have
>>>> additional mentors .
>>>>
>>>>
>>>> Robert Treat wrote:
>>>>
>>>>> We have a whole TODO list of potential items for people to work on, and
>>>>> we have several member of this list that work for universities in some
>>>>> capacity, so istm we ought to give it a shot. I would think this is
>>>>> something that the foundation would be good to be involved in as well,
>>>>> as it would certainly help it all appear official. We just need to tie
>>>>> up the loose ends on who would be available to do the mentoring, which
>>>>> might need to be different people depending on the work involved, unless
>>>>> you think you could handle all comers Jonah? Robert Treat
>>>>>
>>>>> On Wednesday 01 June 2005 10:53, Jonah H. Harris wrote:
>>>>>
>>>>>> Marc,
>>>>>>
>>>>>> If you wanted to submit PostgreSQL to Google for this, I'd be willing
>>>>>> to
>>>>>> work with any potentials. Should someone choose to work on PostgreSQL
>>>>>> projects, the money can go to the group.
>>>>>>
>>>>>> If anyone sees a problem with submitting under the PostgreSQL Global
>>>>>> Development Group, I can do it under my company.
>>>>>>
>>>>>> Thoughts?
>>>>>>
>>>>>> Marc G. Fournier wrote:
>>>>>>
>>>>>>> In case nobody has seen this:
>>>>>>>
>>>>>>> http://code.google.com/summerofcode.html
>>>>>>>
>>>>>>> Might be something to help fun, say, the GiST stuff? Just need to
>>>>>>> find a student that could be mentor'd/directed/brought up to speed?
>>>>>>>
>>>>>>>
>>>>>>> ----
>>>>>>> Marc G. Fournier Hub.Org Networking Services
>>>>>>> (http://www.hub.org)
>>>>>>> Email: scrappy@hub.org Yahoo!: yscrappy ICQ:
>>>>>>> 7615664
>>>>>>>
>>>>>>> ---------------------------(end of
>>>>>>> broadcast)---------------------------
>>>>>>> TIP 5: Have you checked our extensive FAQ?
>>>>>>>
>>>>>>> http://www.postgresql.org/docs/faq
>>>>>>>
>>>>>> ---------------------------(end of
>>>>>> broadcast)---------------------------
>>>>>> TIP 4: Don't 'kill -9' the postmaster
>>>>>>
>>>>>
>>>>>
>>>>
>>>>
>>>
>>> ----
>>> Marc G. Fournier Hub.Org Networking Services
>>> (http://www.hub.org)
>>> Email: scrappy@hub.org Yahoo!: yscrappy ICQ:
>>> 7615664
>>>
>>> ---------------------------(end of broadcast)---------------------------
>>> TIP 9: the planner will ignore your desire to choose an index scan if your
>>> joining column's datatypes do not match
>>>
>>>
>>>

>>
>> ----
>> Marc G. Fournier Hub.Org Networking Services (http://www.hub.org)
>> Email: scrappy@hub.org Yahoo!: yscrappy ICQ: 7615664
>>
>> ---------------------------(end of broadcast)---------------------------
>> TIP 9: the planner will ignore your desire to choose an index scan if your
>> joining column's datatypes do not match
>>

>
> Regards,
> Oleg
> __________________________________________________ ___________
> Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
> Sternberg Astronomical Institute, Moscow University (Russia)
> Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/
> phone: +007(095)939-16-83, +007(095)939-23-83
>


----
Marc G. Fournier Hub.Org Networking Services (http://www.hub.org)
Email: scrappy@hub.org Yahoo!: yscrappy ICQ: 7615664

---------------------------(end of broadcast)---------------------------
TIP 1: subscribe and unsubscribe commands go to majordomo@postgresql.org

Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #13 (permalink)  
Old 04-11-2008, 05:11 AM
Simon Riggs
 
Posts: n/a
Default Re: Google's Summer of Code ...

On Wed, 2005-06-01 at 15:50 -0500, Meredith L. Patterson wrote:
> Marc G. Fournier wrote:
> > Do we have any 'students' that are already up to speed, enough so that
> > they'd be able to accomplish something significant over a 2-3 month period?

>
> Well, I suppose now might be a good time to de-lurk. Hi, my name's
> Meredith, I'm a PhD student at the University of Iowa, I've been reading
> pgsql-hackers for a few months now, and I'm planning on submitting a
> Postgres-related project to the Summer of Code program.
>
> My research area is data mining


Sounds interesting...

> I've been reluctant to mention this in the past, mainly because I don't
> see it as enormously useful to Postgres users as a whole; your average
> user doesn't know what a support vector machine is, and while I have a
> laundry list of use cases for this kind of search capability, the
> Postgres end of it is more useful as support for the system as a whole
> rather than a standalone Postgres feature.


Is it possible that you could put sufficient of the application into
PostgreSQL to genericise some features? Stonebraker's Third Wave was
*all* about putting data intensive operations closer to where the data
is stored/accessed. That definitely applies to data mining, so you are
in the right project.

What would you do AFTER you've built in ALL of the above? Why not think
of ways to lay the ground work for all of the things you'd like to do in
the future.

What can you do for other data mining hackers? How can you lay the
groundwork for others to assist your developments?

> Phew. Thanks for reading. I know there are a lot of TODO items that are
> high on the priority list, and I see that others here already know
> students who are working on projects more closely related to those
> things, so I understand entirely if y'all would prefer to work with
> someone who's adding more directly useful functionality to Postgres. I'm
> very glad to see the PGDG getting involved with the Summer of Code
> project, and either way, I look forward to having enough free time to
> start tackling various TODO items myself.


The project needs free-thinkers, so don't feel that you have to stick to
the TODO. Just make sure you have a very clear benefit case.

Best Regards, Simon Riggs


---------------------------(end of broadcast)---------------------------
TIP 3: if posting/reading through Usenet, please send an appropriate
subscribe-nomail command to majordomo@postgresql.org so that your
message can get through to the mailing list cleanly

Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #14 (permalink)  
Old 04-11-2008, 05:11 AM
Marc G. Fournier
 
Posts: n/a
Default Re: Google's Summer of Code ...

On Wed, 1 Jun 2005, Robert Treat wrote:

> Funny how things come full circle... in previous threads someone mentioned a
> few low hanging fruit items, we should start there, and maybe get one of the
> more hard-core hackers to go through the TODO list and mark up some other
> items.
>
> My take on the mentor thing was that the students could probably post on
> -hackers if they ran into anything too tricky, though they'll need a poc
> for scheduling and timelines and getting them up to speed on development
> environment and such.
>
> Incidentally I havent seen any objections, if there are none should we
> go ahead and whip up an email to google? Do we want to run this through
> the foundation? ISTM we would, though I don't know the full extent of
> what ramifications that would have.


No, the Foundation wasn't formed for this purpose, and doing something
like this through there would end up taking longer then the deadline to
pass through even if it was ...

As for the mentoring program ... my read on it is that the developer has
to make a proposal for what they wish to accomplish, that proposal goes to
the mentoring organization who then approves or rejects the proposal ...
the proposal, again, from my take, can't be simply tackling a few random
issues, but is more meant to deal with large projects that would take
someone full time to get done ... ie. the GiST rewrite or something of
similar scale ...

----
Marc G. Fournier Hub.Org Networking Services (http://www.hub.org)
Email: scrappy@hub.org Yahoo!: yscrappy ICQ: 7615664

---------------------------(end of broadcast)---------------------------
TIP 3: if posting/reading through Usenet, please send an appropriate
subscribe-nomail command to majordomo@postgresql.org so that your
message can get through to the mailing list cleanly

Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #15 (permalink)  
Old 04-11-2008, 05:11 AM
Meredith L. Patterson
 
Posts: n/a
Default Re: Google's Summer of Code ...

Simon Riggs wrote:
> Is it possible that you could put sufficient of the application into
> PostgreSQL to genericise some features? Stonebraker's Third Wave was
> *all* about putting data intensive operations closer to where the data
> is stored/accessed.


And just like that, a lightbulb goes off in my head.

I'd been reluctant to push the training step inside the engine, because
I couldn't come up with a good way of doing it, but now it seems so
obvious. A ranking support vector machine takes as input a series of
partial orders -- think of it as several "buckets" into which data items
are thrown. Or, if you will, a list of lists of unique identifiers. And
that would be *easy* to pass as part of a query string. I'm envisioning
a syntax like:

ORDER BY SVM linear KEY foo ((1, 2, 3), (4, 5), (6, 7, 8), (9))

So this would be a partial ordering where each number is the key (PK is
column 'foo') of some tuple in a table, and 1, 2, 3 > 4, 5 > 6, 7, 8 > 9
in terms of the user's preference.

Use that (much more human-readable than I had originally envisioned)
input to learn the actual ranking function inside the database, apply
that ranking to the results, and boom -- an ORDER BY clause extrapolated
directly from a partial ranking, with no pesky outside-the-database
learning step.

(Nonlinear kernels have some additional parameters, and tuning them can
be something of a black art, but the syntax can be extended to let
people specify them. Default values would also be necessary.)

I'll continue to think on this, but already this approach strikes me as
a lot more useful to the average user. Thanks, Simon!

Cheers,
Meredith

---------------------------(end of broadcast)---------------------------
TIP 6: Have you searched our list archives?

http://archives.postgresql.org

Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #16 (permalink)  
Old 04-11-2008, 05:11 AM
Jonah H. Harris
 
Posts: n/a
Default Re: Google's Summer of Code ...

So, has anyone gone ahead and contacted Google yet?



---------------------------(end of broadcast)---------------------------
TIP 2: you can get off all lists at once with the unregister command
(send "unregister YourEmailAddressHere" to majordomo@postgresql.org)

Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #17 (permalink)  
Old 04-11-2008, 05:12 AM
Marc G. Fournier
 
Posts: n/a
Default Re: Google's Summer of Code ...


Yes, been working on this since last night ...

On Thu, 2 Jun 2005, Jonah H. Harris wrote:

> So, has anyone gone ahead and contacted Google yet?
>
>
>
> ---------------------------(end of broadcast)---------------------------
> TIP 2: you can get off all lists at once with the unregister command
> (send "unregister YourEmailAddressHere" to majordomo@postgresql.org)
>
>
>


----
Marc G. Fournier Hub.Org Networking Services (http://www.hub.org)
Email: scrappy@hub.org Yahoo!: yscrappy ICQ: 7615664

---------------------------(end of broadcast)---------------------------
TIP 1: subscribe and unsubscribe commands go to majordomo@postgresql.org

Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #18 (permalink)  
Old 04-11-2008, 05:12 AM
Jonah H. Harris
 
Posts: n/a
Default Re: Google's Summer of Code ...

Cool. Thanks Marc.

Marc G. Fournier wrote:

>
> Yes, been working on this since last night ...
>
> On Thu, 2 Jun 2005, Jonah H. Harris wrote:
>
>> So, has anyone gone ahead and contacted Google yet?
>>
>>
>>
>> ---------------------------(end of broadcast)---------------------------
>> TIP 2: you can get off all lists at once with the unregister command
>> (send "unregister YourEmailAddressHere" to majordomo@postgresql.org)
>>
>>
>>

>
> ----
> Marc G. Fournier Hub.Org Networking Services
> (http://www.hub.org)
> Email: scrappy@hub.org Yahoo!: yscrappy ICQ:
> 7615664




---------------------------(end of broadcast)---------------------------
TIP 3: if posting/reading through Usenet, please send an appropriate
subscribe-nomail command to majordomo@postgresql.org so that your
message can get through to the mailing list cleanly

Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #19 (permalink)  
Old 04-11-2008, 05:12 AM
Vishal Kashyap @ [SaiHertz]
 
Posts: n/a
Default Re: Google's Summer of Code ...

Dear all ,

>
> Incidentally I havent seen any objections, if there are none should we go
> ahead and whip up an email to google? Do we want to run this through the



We must go straight away for it.

I wonder why bigies like Bruce , Tom , Josh have not responded to this yet.

To my opinion the best way to do it is make a single person
responsible for this all and let every thing be done via him from
filling the application to the end.

--
With Best Regards,
Vishal Kashyap.
Lead Software Developer,
http://saihertz.com,
http://vishalkashyap.tk

---------------------------(end of broadcast)---------------------------
TIP 8: explain analyze is your friend

Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #20 (permalink)  
Old 04-11-2008, 05:12 AM
Marc G. Fournier
 
Posts: n/a
Default Re: Google's Summer of Code ...


It is already being worked on ... more once we know more ...

On Thu, 2 Jun 2005, Vishal Kashyap @ [SaiHertz] wrote:

> Dear all ,
>
>>
>> Incidentally I havent seen any objections, if there are none should we go
>> ahead and whip up an email to google? Do we want to run this through the

>
>
> We must go straight away for it.
>
> I wonder why bigies like Bruce , Tom , Josh have not responded to this yet.
>
> To my opinion the best way to do it is make a single person
> responsible for this all and let every thing be done via him from
> filling the application to the end.
>
> --
> With Best Regards,
> Vishal Kashyap.
> Lead Software Developer,
> http://saihertz.com,
> http://vishalkashyap.tk
>


----
Marc G. Fournier Hub.Org Networking Services (http://www.hub.org)
Email: scrappy@hub.org Yahoo!: yscrappy ICQ: 7615664

---------------------------(end of broadcast)---------------------------
TIP 1: subscribe and unsubscribe commands go to majordomo@postgresql.org

Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
Reply


Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On
Forum Jump


All times are GMT. The time now is 04:25 AM.


Powered by vBulletin® Version 3.6.5
Copyright ©2000 - 2008, Jelsoft Enterprises Ltd.
Search Engine Optimization by vBSEO 3.2.0
www.UnixAdminTalk.com