{"id":2224,"date":"2016-03-11T16:03:24","date_gmt":"2016-03-12T00:03:24","guid":{"rendered":"http:\/\/softwaredaily.wpengine.com\/?p=2224"},"modified":"2021-11-04T02:26:37","modified_gmt":"2021-11-04T09:26:37","slug":"filodb-evan-chan","status":"publish","type":"post","link":"https:\/\/softwareengineeringdaily.com\/2016\/03\/11\/filodb-evan-chan\/","title":{"rendered":"FiloDB with Evan Chan"},"content":{"rendered":"<p><img data-attachment-id=\"2226\" data-permalink=\"https:\/\/softwareengineeringdaily.com\/2016\/03\/11\/filodb-evan-chan\/evan-chan\/\" data-orig-file=\"https:\/\/i0.wp.com\/softwareengineeringdaily.com\/wp-content\/uploads\/2016\/03\/evan-chan.jpg?fit=175%2C175&amp;ssl=1\" data-orig-size=\"175,175\" data-comments-opened=\"0\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"evan-chan\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/softwareengineeringdaily.com\/wp-content\/uploads\/2016\/03\/evan-chan.jpg?fit=175%2C175&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/softwareengineeringdaily.com\/wp-content\/uploads\/2016\/03\/evan-chan.jpg?fit=175%2C175&amp;ssl=1\" decoding=\"async\" loading=\"lazy\" class=\"alignright size-full wp-image-2226\" style=\"border-radius: 50%; border: 1px solid #000000;\" src=\"https:\/\/i0.wp.com\/softwaredaily.wpengine.com\/wp-content\/uploads\/2016\/03\/evan-chan.jpg?resize=175%2C175\" alt=\"evan chan\" width=\"175\" height=\"175\" srcset=\"https:\/\/i0.wp.com\/softwareengineeringdaily.com\/wp-content\/uploads\/2016\/03\/evan-chan.jpg?w=175&amp;ssl=1 175w, https:\/\/i0.wp.com\/softwareengineeringdaily.com\/wp-content\/uploads\/2016\/03\/evan-chan.jpg?resize=150%2C150&amp;ssl=1 150w, https:\/\/i0.wp.com\/softwareengineeringdaily.com\/wp-content\/uploads\/2016\/03\/evan-chan.jpg?resize=80%2C80&amp;ssl=1 80w\" sizes=\"(max-width: 175px) 100vw, 175px\" data-recalc-dims=\"1\" \/><\/p>\n<h3>\u201cThe world is becoming more and more interactive, and people want answers right away, so you\u2019re seeing the rise of stream processing and real-time.\u201d<\/h3>\n<p>Big data is yesterday&#8211;fast data is now. FiloDB is a reactive columnar OLAP database that is built on Cassandra and Spark. Today\u2019s guest is Evan Chan, creator of FiloDB.<\/p>\n<p>In our discussion today, we talk about the use cases of an OLAP data store. Evan explains how to tackle the problem of video analytics&#8211;if you have ever found yourself asking how a company like YouTube or Netflix or Ooyala performs analytics on millions of users watching millions of videos, this episode is for you. By combining the database features of Cassandra with the data processing power of Spark, Evan created FiloDB to help solve this type of analytics problem. Evan will also be presenting at <a href=\"http:\/\/www.oreilly.com\/pub\/cpc\/9373\">Strata + Hadoop World<\/a> in San Jose. We&#8217;re partnering with O&#8217;Reilly to support this conference &#8211; if you want to go to Strata, you can save 20% off a ticket with our code PCSED.<\/p>\n<h3>Questions<\/h3>\n<ul>\n<li>What does your quote \u201cbig data is yesterday, fast data is now\u201d mean?<\/li>\n<li>Why is it hard to solve for big data and fast data at the same time?<\/li>\n<li>Prior to your work at FiloDB, what were the options to build the type of OLAP system you were looking for?<\/li>\n<li>Why is Spark a good companion for Cassandra?<\/li>\n<li>How does FiloDB work?<\/li>\n<li>What is a parquet-style layout, and why does this benefit FiloDB?<\/li>\n<li>What technologies are used in the \u201cno lambda\u201d stack?<\/li>\n<li>What is at the frontier of this problem of big data meets fast data?<\/li>\n<\/ul>\n<h3>Links<\/h3>\n<ul>\n<li><a href=\"https:\/\/github.com\/tuplejump\/FiloDB\">FiloDB<\/a><\/li>\n<li><a href=\"https:\/\/en.wikipedia.org\/wiki\/Uncertainty_principle\">Heisenberg\u2019s Uncertainty Principle<\/a><\/li>\n<li><a href=\"https:\/\/www.mapr.com\/developercentral\/lambda-architecture\">Lambda architecture<\/a><\/li>\n<li><a href=\"http:\/\/cassandra.apache.org\/\">Cassandra<\/a><\/li>\n<li><a href=\"https:\/\/parquet.apache.org\/\">Parquet<\/a><\/li>\n<li><a href=\"http:\/\/conferences.oreilly.com\/strata\/hadoop-big-data-ca\/public\/schedule\/detail\/46818\">Evan&#8217;s talk at Strata+Hadoop World<\/a><\/li>\n<li><a href=\"http:\/\/www.alluxio.org\/\">Alluxio<\/a><\/li>\n<li><a href=\"http:\/\/www.tuplejump.com\/\">Tuplejump<\/a><\/li>\n<li><a href=\"https:\/\/velvia.github.io\/\">Evan&#8217;s page<br \/>\n<\/a><\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>\u201cThe world is becoming more and more interactive, and people want answers right away, so you\u2019re seeing the rise of stream processing and real-time.\u201d Big data is yesterday&#8211;fast data is now. FiloDB is a reactive columnar OLAP database that is built on Cassandra and Spark. Today\u2019s guest is Evan Chan, creator of FiloDB. In our<\/p>\n","protected":false},"author":2,"featured_media":2225,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_mi_skip_tracking":false,"_exactmetrics_sitenote_active":false,"_exactmetrics_sitenote_note":"","_exactmetrics_sitenote_category":0,"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_newsletter_tier_id":0,"footnotes":"","jetpack_publicize_message":"","jetpack_is_tweetstorm":false,"jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":false,"jetpack_social_options":{"image_generator_settings":{"template":"highway","enabled":false}}},"categories":[1363,1081,14],"tags":[71,136,88,81,66],"jetpack_publicize_connections":[],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v21.4 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>FiloDB with Evan Chan - Software Engineering Daily<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"http:\/\/softwareengineeringdaily.com\/2016\/03\/11\/filodb-evan-chan\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"FiloDB with Evan Chan - Software Engineering Daily\" \/>\n<meta property=\"og:description\" content=\"\u201cThe world is becoming more and more interactive, and people want answers right away, so you\u2019re seeing the rise of stream processing and real-time.\u201d Big data is yesterday&#8211;fast data is now. FiloDB is a reactive columnar OLAP database that is built on Cassandra and Spark. Today\u2019s guest is Evan Chan, creator of FiloDB. In our\" \/>\n<meta property=\"og:url\" content=\"http:\/\/softwareengineeringdaily.com\/2016\/03\/11\/filodb-evan-chan\/\" \/>\n<meta property=\"og:site_name\" content=\"Software Engineering Daily\" \/>\n<meta property=\"article:published_time\" content=\"2016-03-12T00:03:24+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2021-11-04T09:26:37+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/i0.wp.com\/softwareengineeringdaily.com\/wp-content\/uploads\/2016\/03\/filodb.png?fit=720%2C347\" \/>\n\t<meta property=\"og:image:width\" content=\"720\" \/>\n\t<meta property=\"og:image:height\" content=\"347\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Pranay\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@software_daily\" \/>\n<meta name=\"twitter:site\" content=\"@software_daily\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Pranay\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"http:\/\/softwareengineeringdaily.com\/2016\/03\/11\/filodb-evan-chan\/#article\",\"isPartOf\":{\"@id\":\"http:\/\/softwareengineeringdaily.com\/2016\/03\/11\/filodb-evan-chan\/\"},\"author\":{\"name\":\"Pranay\",\"@id\":\"https:\/\/softwareengineeringdaily.com\/#\/schema\/person\/609dd9667c84e7f6b8406a7a1bd2fb5c\"},\"headline\":\"FiloDB with Evan Chan\",\"datePublished\":\"2016-03-12T00:03:24+00:00\",\"dateModified\":\"2021-11-04T09:26:37+00:00\",\"mainEntityOfPage\":{\"@id\":\"http:\/\/softwareengineeringdaily.com\/2016\/03\/11\/filodb-evan-chan\/\"},\"wordCount\":298,\"publisher\":{\"@id\":\"https:\/\/softwareengineeringdaily.com\/#organization\"},\"keywords\":[\"Big Data\",\"Cassandra\",\"Databases\",\"Distributed Systems\",\"Spark\"],\"articleSection\":[\"All Content\",\"Data\",\"Podcast\"],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"http:\/\/softwareengineeringdaily.com\/2016\/03\/11\/filodb-evan-chan\/\",\"url\":\"http:\/\/softwareengineeringdaily.com\/2016\/03\/11\/filodb-evan-chan\/\",\"name\":\"FiloDB with Evan Chan - Software Engineering Daily\",\"isPartOf\":{\"@id\":\"https:\/\/softwareengineeringdaily.com\/#website\"},\"datePublished\":\"2016-03-12T00:03:24+00:00\",\"dateModified\":\"2021-11-04T09:26:37+00:00\",\"breadcrumb\":{\"@id\":\"http:\/\/softwareengineeringdaily.com\/2016\/03\/11\/filodb-evan-chan\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"http:\/\/softwareengineeringdaily.com\/2016\/03\/11\/filodb-evan-chan\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"http:\/\/softwareengineeringdaily.com\/2016\/03\/11\/filodb-evan-chan\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/softwareengineeringdaily.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"FiloDB with Evan Chan\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/softwareengineeringdaily.com\/#website\",\"url\":\"https:\/\/softwareengineeringdaily.com\/\",\"name\":\"Software Engineering Daily\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\/\/softwareengineeringdaily.com\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/softwareengineeringdaily.com\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/softwareengineeringdaily.com\/#organization\",\"name\":\"Software Engineering Daily\",\"url\":\"https:\/\/softwareengineeringdaily.com\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/softwareengineeringdaily.com\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/i0.wp.com\/softwareengineeringdaily.com\/wp-content\/uploads\/2022\/01\/cropped-logo-new.png?fit=296%2C139&ssl=1\",\"contentUrl\":\"https:\/\/i0.wp.com\/softwareengineeringdaily.com\/wp-content\/uploads\/2022\/01\/cropped-logo-new.png?fit=296%2C139&ssl=1\",\"width\":296,\"height\":139,\"caption\":\"Software Engineering Daily\"},\"image\":{\"@id\":\"https:\/\/softwareengineeringdaily.com\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/twitter.com\/software_daily\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/softwareengineeringdaily.com\/#\/schema\/person\/609dd9667c84e7f6b8406a7a1bd2fb5c\",\"name\":\"Pranay\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/softwareengineeringdaily.com\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/e85f4744ae484bd83dca0d08c8ce0625?s=96&d=retro&r=pg\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/e85f4744ae484bd83dca0d08c8ce0625?s=96&d=retro&r=pg\",\"caption\":\"Pranay\"},\"url\":\"https:\/\/softwareengineeringdaily.com\/author\/pranay\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"FiloDB with Evan Chan - Software Engineering Daily","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"http:\/\/softwareengineeringdaily.com\/2016\/03\/11\/filodb-evan-chan\/","og_locale":"en_US","og_type":"article","og_title":"FiloDB with Evan Chan - Software Engineering Daily","og_description":"\u201cThe world is becoming more and more interactive, and people want answers right away, so you\u2019re seeing the rise of stream processing and real-time.\u201d Big data is yesterday&#8211;fast data is now. FiloDB is a reactive columnar OLAP database that is built on Cassandra and Spark. Today\u2019s guest is Evan Chan, creator of FiloDB. In our","og_url":"http:\/\/softwareengineeringdaily.com\/2016\/03\/11\/filodb-evan-chan\/","og_site_name":"Software Engineering Daily","article_published_time":"2016-03-12T00:03:24+00:00","article_modified_time":"2021-11-04T09:26:37+00:00","og_image":[{"width":720,"height":347,"url":"https:\/\/i0.wp.com\/softwareengineeringdaily.com\/wp-content\/uploads\/2016\/03\/filodb.png?fit=720%2C347","type":"image\/png"}],"author":"Pranay","twitter_card":"summary_large_image","twitter_creator":"@software_daily","twitter_site":"@software_daily","twitter_misc":{"Written by":"Pranay","Est. reading time":"2 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"http:\/\/softwareengineeringdaily.com\/2016\/03\/11\/filodb-evan-chan\/#article","isPartOf":{"@id":"http:\/\/softwareengineeringdaily.com\/2016\/03\/11\/filodb-evan-chan\/"},"author":{"name":"Pranay","@id":"https:\/\/softwareengineeringdaily.com\/#\/schema\/person\/609dd9667c84e7f6b8406a7a1bd2fb5c"},"headline":"FiloDB with Evan Chan","datePublished":"2016-03-12T00:03:24+00:00","dateModified":"2021-11-04T09:26:37+00:00","mainEntityOfPage":{"@id":"http:\/\/softwareengineeringdaily.com\/2016\/03\/11\/filodb-evan-chan\/"},"wordCount":298,"publisher":{"@id":"https:\/\/softwareengineeringdaily.com\/#organization"},"keywords":["Big Data","Cassandra","Databases","Distributed Systems","Spark"],"articleSection":["All Content","Data","Podcast"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"http:\/\/softwareengineeringdaily.com\/2016\/03\/11\/filodb-evan-chan\/","url":"http:\/\/softwareengineeringdaily.com\/2016\/03\/11\/filodb-evan-chan\/","name":"FiloDB with Evan Chan - Software Engineering Daily","isPartOf":{"@id":"https:\/\/softwareengineeringdaily.com\/#website"},"datePublished":"2016-03-12T00:03:24+00:00","dateModified":"2021-11-04T09:26:37+00:00","breadcrumb":{"@id":"http:\/\/softwareengineeringdaily.com\/2016\/03\/11\/filodb-evan-chan\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["http:\/\/softwareengineeringdaily.com\/2016\/03\/11\/filodb-evan-chan\/"]}]},{"@type":"BreadcrumbList","@id":"http:\/\/softwareengineeringdaily.com\/2016\/03\/11\/filodb-evan-chan\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/softwareengineeringdaily.com\/"},{"@type":"ListItem","position":2,"name":"FiloDB with Evan Chan"}]},{"@type":"WebSite","@id":"https:\/\/softwareengineeringdaily.com\/#website","url":"https:\/\/softwareengineeringdaily.com\/","name":"Software Engineering Daily","description":"","publisher":{"@id":"https:\/\/softwareengineeringdaily.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/softwareengineeringdaily.com\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/softwareengineeringdaily.com\/#organization","name":"Software Engineering Daily","url":"https:\/\/softwareengineeringdaily.com\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/softwareengineeringdaily.com\/#\/schema\/logo\/image\/","url":"https:\/\/i0.wp.com\/softwareengineeringdaily.com\/wp-content\/uploads\/2022\/01\/cropped-logo-new.png?fit=296%2C139&ssl=1","contentUrl":"https:\/\/i0.wp.com\/softwareengineeringdaily.com\/wp-content\/uploads\/2022\/01\/cropped-logo-new.png?fit=296%2C139&ssl=1","width":296,"height":139,"caption":"Software Engineering Daily"},"image":{"@id":"https:\/\/softwareengineeringdaily.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/twitter.com\/software_daily"]},{"@type":"Person","@id":"https:\/\/softwareengineeringdaily.com\/#\/schema\/person\/609dd9667c84e7f6b8406a7a1bd2fb5c","name":"Pranay","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/softwareengineeringdaily.com\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/e85f4744ae484bd83dca0d08c8ce0625?s=96&d=retro&r=pg","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/e85f4744ae484bd83dca0d08c8ce0625?s=96&d=retro&r=pg","caption":"Pranay"},"url":"https:\/\/softwareengineeringdaily.com\/author\/pranay\/"}]}},"jetpack_sharing_enabled":true,"jetpack_featured_media_url":"https:\/\/i0.wp.com\/softwareengineeringdaily.com\/wp-content\/uploads\/2016\/03\/filodb.png?fit=720%2C347&ssl=1","jetpack_shortlink":"https:\/\/wp.me\/p7GuoD-zS","_links":{"self":[{"href":"https:\/\/softwareengineeringdaily.com\/wp-json\/wp\/v2\/posts\/2224"}],"collection":[{"href":"https:\/\/softwareengineeringdaily.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/softwareengineeringdaily.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/softwareengineeringdaily.com\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/softwareengineeringdaily.com\/wp-json\/wp\/v2\/comments?post=2224"}],"version-history":[{"count":0,"href":"https:\/\/softwareengineeringdaily.com\/wp-json\/wp\/v2\/posts\/2224\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/softwareengineeringdaily.com\/wp-json\/wp\/v2\/media\/2225"}],"wp:attachment":[{"href":"https:\/\/softwareengineeringdaily.com\/wp-json\/wp\/v2\/media?parent=2224"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/softwareengineeringdaily.com\/wp-json\/wp\/v2\/categories?post=2224"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/softwareengineeringdaily.com\/wp-json\/wp\/v2\/tags?post=2224"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}