After a long Summer that included an internship at MSR Redmond/Bing, attending SIGCOMM in Toronto, and a trip to Bangladesh, I'm back in Berkeley again. Time to work!
This summer I worked with Dave Maltz and Lijiang Fang at MSR/Bing on a datacenter-related problem.
I'm attending my second SIGCOMM and had the privilege of giving my first talk at the flagship networking conference. I presented Orchestra, which happened to be very well attended even though it was the last talk of the day at 6PM. I'd like to thank everyone for showing up and also for the lively … Continue Reading ››
A technical report describing the key concepts behind Spark is available online. The abstract goes below:
We present Resilient Distributed Datasets (RDDs), a distributed memory abstraction that allows programmers to perform in-memory computations on large clusters while retaining the fault tolerance of data flow models like MapReduce. RDDs are motivated by two types of applications … Continue Reading ››
We have been working on the Spark cluster computing framework for last couple of years. It has always been open source under the BSD license in github. But yesterday Matei declared official launch of the spark website (spark-project.org) and mailing lists along with its 0.2 release to everyone during the AMPLab summer retreat … Continue Reading ››
An extended, updated, and emended version of our ViNEYard paper in INFOCOM'09 has been accepted for publication in IEEE/ACM Transactions on Networking after yearlong multiple rounds of reviews. Since there is normally a long queue for actually getting an accepted ToN paper printed, its hard to tell when ours will officially be out there. … Continue Reading ››
Today I presented Orchestra for the first time in front of a crowd outside our lab. Taghrid Samak kindly invited me at LBNL's Computing Sciences Seminar after we caught up over lunch last week, after a year. She is currently a post-doc fellow with the Advance Computing for Science group.
Overall, the talk went very … Continue Reading ››
Update: Camera-ready version of the paper should be can be found in the publications page very soon!
Our paper "Managing Data Transfers in Computer Clusters with Orchestra" has been accepted at SIGCOMM'2011. This is a joint work with Matei, Justin, and professors Mike Jordan and Ion Stoica. The project started as part of Continue Reading ››
I'm going to spend this Summer in stealth mode at Microsoft Research, Cambridge working with Christos Gkantsidis and Hitesh Ballani on a super-secret project. Hopefully, we'll have some cool results on a hot topic.
This will be my first time in England/UK as well. Looking forward to the English weather that I've heard so much about! … Continue Reading ››
Our paper, "PolyViNE: Policy-based Virtual Network Embedding Across Multiple Domains" is set to appear in VISA'2010 workshop (with SIGCOMM'2010) in New Delhi. I worked on it during my last few months in Waterloo (circa Winter/Spring 2009), and it has been lying around ever since because everyone had been busy. Finally, its going to wake up … Continue Reading ››
An initial overview of our ongoing work on Spark, an iterative and interactive framework for cluster computing, has been accepted at HotCloud'10. I've been joined the project last February, while Matei has been working on it since last Fall. I will have uploaded the paper in the publications page. once … Continue Reading ››