CV

  • UCLA Computer Science Department 4531D Boelter Hall
  • Los Angeles
  • CA
  • 90095-1596
I am an Assistant Professor in the Computer Science Dept. at UCLA. My current research focuses on large-scale data analytics, distributed systems, Internet-scale query processing and optimization, and declarative language design and implementation.
Education
Work Experience
Redmond, WA
Principal Scientist

Principal Architect of the REEF project.

Sunnyvale, CA
Research Scientist

Research appointment with a focus on cloud computing technologies.

Research Intern

Focused on the development of optimization tools and algorithms for the Apache Pig runtime.

Berkeley, CA
Research Intern

Participated in the design and development of a declarative language and runtime for expressing and executing network protocols.

San Mateo, CA
Software Engineer

Member of the Server Technologies group. I developed and maintained extensions to the SQL compiler in Oracle 9i.

Sybase Inc.
Emeryville, CA
Database Administrator

Maintained and tested the Sybase database server and application products.

Research Projects
Los Angeles
Declarative Machine Learning (DeML)

The DeML project explores a different approach to the development of ML tools; inspired by the principle of declarative data management. The DeML system that we are building enables the authoring and execution of ML tools in a high-level declarative language that is reminiscent to Datalog, which gets automatically translated down to a distributed runtime built on REEF.

Redmond, WA
REEF: Retainable Evaluator Execution Framework

Principal Architect on project REEF: a framework for developing large-scale data processing runtimes on Apache YARN (a.k.a., Hadoop version 2).

Sunnyvale, CA
Scalable Operators (ScalOps)

Developed a domain-specific language for developing machine learning algorithms on Apache Hadoop.

Berkeley, CA
P2 Project

Design and implementation of the P2 distributed query processing engine. Lead for query optimization and low-level networking components.

Berkeley Orders of Magnitude (BOOM)

An exploration of how a data-centric programming methodology can make distributed computing platforms accessible for developer innovation.

Sunnyvale, CA
Apache Pig

Developed a declarative optimization framework for the Apache Pig runtime planner.

Berkeley, CA
Titanium Project

The Titanium project aimed to develop an explicitly parallel dialect of Java to support high-performance scientific computing on large-scale multiprocessors, including massively parallel supercomputers and distributed-memory clusters with one or more processors per node. I focused on the design and development of distributed locking protocols for the language runtime.

Stanford, CA
P2P Sociology Project

This project looked at the sociological aspects (for example, trust, privacy, economics) of P2P networks. The project aimed to address some of the technical issues in P2P networks by treating each peer as an adaptive, rule-based agent that mimics distilled and simplified human rules of behavior.

Publications
Sunnyvale, CA
Scaling Datalog for Machine Learning on Big Data

Y. Bu, V. R. Borkar, M. J. Carey, J. Rosen, N. Polyzotis, T. Condie, M. Weimer, and R. Ramakrishnan. Tech. Report Corr, vol. abs/1203.0160, 2012.

Los Angeles
TRIDENT: VISIONING A SHARED INFRASTRUCTURE FOR DATA RESEARCH AT SCALE

CHAITAN BARU, MICHAEL CAREY, TYSON CONDIE, VAGELIS HRISTIDIS, DAVID LIFKA, RICH WOLSKI, SREERANGA RAJAN, ARNAB ROY, Data Science Symposium

Pregelix: Big(ger) graph analytics on a dataflow engine

Yingyi Bu, Vinayak Borkar, Jianfeng Jia, Michael J. Carey, and Tyson Condie. 2014. Pregelix: Big(ger) graph analytics on a dataflow engine. Proc. VLDB Endow. 8, 2 (October 2014), 161-172. DOI=http://dx.doi.org/10.14778/2735471.273547

REEF: Retainable Evaluator Execution Framework

Markus Weimer, Yingda Chen, Byung-Gon Chun, Tyson Condie, Carlo Curino, Chris Douglas, Yunseong Lee, Tony Majestro, Dahlia Malkhi, Sergiy Matusevych, Brandon Myers, Shravan Narayanamurthy, Raghu Ramakrishnan, Sriram Rao, Russel Sears, Beysim Sezgin, and Julia Wang. 2015. REEF: Retainable Evaluator Execution Framework. In Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data (SIGMOD ’15). ACM, New York, NY, USA, 1343-1355. DOI=http://dx.doi.org/10.1145/2723372.2742793

Big Data Analytics with Datalog Queries on Spark

Alexander Shkapsky, Mohan Yang, Matteo Interlandi, Hsuan Chiu, Tyson Condie, and Carlo Zaniolo. 2016. Big Data Analytics with Datalog Queries on Spark. In Proceedings of the 2016 International Conference on Management of Data (SIGMOD ’16). ACM, New York, NY, USA, 1135-1149. DOI: http://dx.doi.org/10.1145/2882903.2915229

Titian: data provenance support in Spark

Matteo Interlandi, Kshitij Shah, Sai Deep Tetali, Muhammad Ali Gulzar, Seunghyun Yoo, Miryung Kim, Todd Millstein, and Tyson Condie. 2015. Titian: data provenance support in Spark. Proc. VLDB Endow. 9, 3 (November 2015), 216-227. DOI=http://dx.doi.org/10.14778/2850583.2850595

Bigdebug: Debugging primitives for interactive big data processing in spark

Muhammad Ali Gulzar, Matteo Interlandi, Seunghyun Yoo, Sai Deep Tetali, Tyson Condie, Todd Millstein, and Miryung Kim. 2016. BigDebug: debugging primitives for interactive big data processing in spark. In Proceedings of the 38th International Conference on Software Engineering(ICSE ’16). ACM, New York, NY, USA, 784-795. DOI=http://dx.doi.org/10.1145/2884781.2884813

Sunnyvale, CA
Machine learning in scalops, a higher order cloud computing language

Markus Weimer, Tyson Condie and Raghu Ramakrishnan. In NIPS BigLearn.

Online Aggregation for large MapReduce jobs

Niketan Pansare, Vinayak Borkar, Chris Jermaine, and Tyson Condie. In PVLDB, vol. 4, iss. 11, pp. 1135-1145, 2011.

Berkeley, CA
BOOM: Data-Centric Programming in the Datacenter

Peter Alvaro, Tyson Condie, Neil Conway, Khaled Elmeleegy, Joseph M. Hellerstein, and Russell Sears. In Proceedings of the European Conference on Computer Systems (EuroSys).

MapReduce Online

Tyson Condie, Neil Conway, Peter Alvaro, Joseph M. Hellerstein, Khaled Elmeleegy, and Russell Sears. In Proceedings of the 7th USENIX Symposium on Networked Systems Design and Implementation (NSDI).

I Do Declare: Consensus in a Logic Language

Peter Alvaro, Tyson Condie, Neil Conway, Joseph M. Hellerstein, and Russell Sears. In Proceedings of the SOSP Workshop on Networking Meets Databases (NetDB).

Evita Raced: Metacompilation for Declarative Networks

Tyson Condie, David Chu, Joseph M. Hellerstein, and Petros Maniatis. In 34th International Conference on Very Large Data Bases (VLDB).

Public Health for the Internet (PHI)

Joseph M. Hellerstein, Tyson Condie, Minos Garofalakis, Boon Thau Loo, Petros Maniatis, Timothy Roscoe, and Nina Taft. In CIDR.

ROFL: Routing On Flat Labels

Matthew Caesar, Tyson Condie, Jayanthkumar Kannan, Karthik Lakshminarayanan, Ion Stoica, Scott Shenker. In ACM SIGCOMM International Conference on Computer Communication.

Declarative Networking: Language, Execution and Optimization

Boon Thau Loo, Tyson Condie, Minos Garofalakis, David A. Gay, Joseph M. Hellerstein, Petros Maniatis, Raghu Ramakrishnan, Timothy Roscoe and Ion Stoica. In ACM SIGMOD International Conference on Management of Data.

Induced Churn as Shelter from Routing-Table Poisoning

Tyson Condie, Varun Kacholia, Sriram Sankararaman, Joseph M. Hellerstein and Petros Maniatis. In Network and Distributed System Security (NDSS).

Implementing Declarative Overlays

Boon Thau Loo, Tyson Condie, Joseph M. Hellerstein, Petros Maniatis, Timothy Roscoe, and Ion Stoica. In 20th ACM Symposium on Operating Systems Principles (SOSP).

Finally, a use for componentized transport protocols

Tyson Condie, Joseph M. Hellerstein, Petros Maniatis, Sean Rhea, and Timothy Roscoe. In HotNets IV.

Stanford, CA
LSH Forest: self-tuning indexes for similarity search

Mayank Bawa, Tyson Condie, and Prasanna Ganesan. In 14th International Conference on World Wide Web (WWW).

Non-Cooperation in Competitive P2P Networks

Beverly Yang, Tyson Condie, Sepandar Kamvar and Hector Garcia-Molina. In Distributed Computing Systems.

Adaptive Peer-To-Peer Topologies

Tyson Condie, Sepandar Kamvar and Hector Garcia-Molina. In International Conference on Peer-to-Peer Computing.

Simulating a File-Sharing P2P Network

Mario Schlosser, Tyson Condie and Sepandar Kamvar. In Workshop on Semantics of P2P and Grid Computing.