{"id":126,"date":"2015-08-06T04:58:47","date_gmt":"2015-08-06T04:58:47","guid":{"rendered":"http:\/\/softwaredaily.wpengine.com\/?p=126"},"modified":"2021-10-20T04:09:30","modified_gmt":"2021-10-20T11:09:30","slug":"kafka-with-guozhang-wang","status":"publish","type":"post","link":"https:\/\/softwareengineeringdaily.com\/2015\/08\/06\/kafka-with-guozhang-wang\/","title":{"rendered":"Apache Kafka with Guozhang Wang"},"content":{"rendered":"<h2>Apache Kafka is a publish-subscribe messaging system rethought as a distributed commit log.<\/h2>\n<h4>Kafka serves as the central repository for data streams in a distributed system.<\/h4>\n<p><img decoding=\"async\" loading=\"lazy\" class=\" aligncenter\" src=\"https:\/\/i0.wp.com\/media.licdn.com\/mpr\/mpr\/shrinknp_400_400\/p\/7\/005\/062\/092\/37a5cf0.jpg?resize=338%2C338&#038;ssl=1\" alt=\"\" width=\"338\" height=\"338\" data-recalc-dims=\"1\" \/><\/p>\n<p>Guozhang Wang is an engineer at Confluent, which\u00a0offers a stream data platform built using Kafka.<\/p>\n<h3>Questions include:<\/h3>\n<ul>\n<li>What is a central repository for data streams?<\/li>\n<li>How does Kafka improve transportation between systems?<\/li>\n<li>How does Kafka allow for richer analytical processing?<\/li>\n<li>What are the roles of topics, producers, consumers, and brokers?<\/li>\n<li>Do Spark, Storm, and Samza all use Kafka the same way?<\/li>\n<li>How does Kafka combine queueing and pub-sub into a single abstraction: the consumer group?<\/li>\n<\/ul>\n<h3>Links:<\/h3>\n<ul>\n<li><a href=\"http:\/\/www.confluent.io\/blog\/stream-data-platform-1\/\">A Practical Guide to Kafka, by Jay Kreps<\/a><\/li>\n<li><a href=\"http:\/\/kafka.apache.org\/\">Kafka Documentation<\/a><\/li>\n<li><a href=\"http:\/\/www.se-radio.net\/2015\/02\/episode-219-apache-kafka-with-jun-rao\/\">Kafka Podcast on Software Engineering Radio<\/a><\/li>\n<li><a href=\"http:\/\/allthingshadoop.com\/2013\/09\/17\/real-time-data-pipelines-and-analytics-with-apache-kafka-and-apache-samza\/\">Kafka Podcast on All Things Hadoop<\/a>\u00a0includes notes and diagrams)<\/li>\n<li><a href=\"http:\/\/radar.oreilly.com\/2014\/12\/building-apache-kafka-from-scratch.html\">Kafka Podcast on O&#8217;Reilly Data<\/a><\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>Apache Kafka is a publish-subscribe messaging system rethought as a distributed commit log. Kafka serves as the central repository for data streams in a distributed system. Guozhang Wang is an engineer at Confluent, which\u00a0offers a stream data platform built using Kafka. Questions include: What is a central repository for data streams? How does Kafka improve<\/p>\n","protected":false},"author":1,"featured_media":214,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_mi_skip_tracking":false,"_exactmetrics_sitenote_active":false,"_exactmetrics_sitenote_note":"","_exactmetrics_sitenote_category":0,"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_newsletter_tier_id":0,"footnotes":"","jetpack_publicize_message":"","jetpack_is_tweetstorm":false,"jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":false,"jetpack_social_options":{"image_generator_settings":{"template":"highway","enabled":false}}},"categories":[1363,1081,14],"tags":[260,259,63,262,261,66,64],"jetpack_publicize_connections":[],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v21.4 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Apache Kafka with Guozhang Wang - Software Engineering Daily<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"http:\/\/softwareengineeringdaily.com\/2015\/08\/06\/kafka-with-guozhang-wang\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Apache Kafka with Guozhang Wang - Software Engineering Daily\" \/>\n<meta property=\"og:description\" content=\"Apache Kafka is a publish-subscribe messaging system rethought as a distributed commit log. Kafka serves as the central repository for data streams in a distributed system. Guozhang Wang is an engineer at Confluent, which\u00a0offers a stream data platform built using Kafka. Questions include: What is a central repository for data streams? How does Kafka improve\" \/>\n<meta property=\"og:url\" content=\"http:\/\/softwareengineeringdaily.com\/2015\/08\/06\/kafka-with-guozhang-wang\/\" \/>\n<meta property=\"og:site_name\" content=\"Software Engineering Daily\" \/>\n<meta property=\"article:published_time\" content=\"2015-08-06T04:58:47+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2021-10-20T11:09:30+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/i0.wp.com\/softwareengineeringdaily.com\/wp-content\/uploads\/2015\/08\/kafka-logo-wide.png?fit=702%2C369\" \/>\n\t<meta property=\"og:image:width\" content=\"702\" \/>\n\t<meta property=\"og:image:height\" content=\"369\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Jeff\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@software_daily\" \/>\n<meta name=\"twitter:site\" content=\"@software_daily\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Jeff\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"1 minute\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"http:\/\/softwareengineeringdaily.com\/2015\/08\/06\/kafka-with-guozhang-wang\/#article\",\"isPartOf\":{\"@id\":\"http:\/\/softwareengineeringdaily.com\/2015\/08\/06\/kafka-with-guozhang-wang\/\"},\"author\":{\"name\":\"Jeff\",\"@id\":\"https:\/\/softwareengineeringdaily.com\/#\/schema\/person\/6365c4c1ff0b8cf742afe4279ddcc5bd\"},\"headline\":\"Apache Kafka with Guozhang Wang\",\"datePublished\":\"2015-08-06T04:58:47+00:00\",\"dateModified\":\"2021-10-20T11:09:30+00:00\",\"mainEntityOfPage\":{\"@id\":\"http:\/\/softwareengineeringdaily.com\/2015\/08\/06\/kafka-with-guozhang-wang\/\"},\"wordCount\":140,\"publisher\":{\"@id\":\"https:\/\/softwareengineeringdaily.com\/#organization\"},\"keywords\":[\"Data Engineering\",\"Distributed System\",\"Kafka\",\"Pub Sub\",\"Samza\",\"Spark\",\"Storm\"],\"articleSection\":[\"All Content\",\"Data\",\"Podcast\"],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"http:\/\/softwareengineeringdaily.com\/2015\/08\/06\/kafka-with-guozhang-wang\/\",\"url\":\"http:\/\/softwareengineeringdaily.com\/2015\/08\/06\/kafka-with-guozhang-wang\/\",\"name\":\"Apache Kafka with Guozhang Wang - Software Engineering Daily\",\"isPartOf\":{\"@id\":\"https:\/\/softwareengineeringdaily.com\/#website\"},\"datePublished\":\"2015-08-06T04:58:47+00:00\",\"dateModified\":\"2021-10-20T11:09:30+00:00\",\"breadcrumb\":{\"@id\":\"http:\/\/softwareengineeringdaily.com\/2015\/08\/06\/kafka-with-guozhang-wang\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"http:\/\/softwareengineeringdaily.com\/2015\/08\/06\/kafka-with-guozhang-wang\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"http:\/\/softwareengineeringdaily.com\/2015\/08\/06\/kafka-with-guozhang-wang\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/softwareengineeringdaily.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Apache Kafka with Guozhang Wang\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/softwareengineeringdaily.com\/#website\",\"url\":\"https:\/\/softwareengineeringdaily.com\/\",\"name\":\"Software Engineering Daily\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\/\/softwareengineeringdaily.com\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/softwareengineeringdaily.com\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/softwareengineeringdaily.com\/#organization\",\"name\":\"Software Engineering Daily\",\"url\":\"https:\/\/softwareengineeringdaily.com\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/softwareengineeringdaily.com\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/i0.wp.com\/softwareengineeringdaily.com\/wp-content\/uploads\/2022\/01\/cropped-logo-new.png?fit=296%2C139&ssl=1\",\"contentUrl\":\"https:\/\/i0.wp.com\/softwareengineeringdaily.com\/wp-content\/uploads\/2022\/01\/cropped-logo-new.png?fit=296%2C139&ssl=1\",\"width\":296,\"height\":139,\"caption\":\"Software Engineering Daily\"},\"image\":{\"@id\":\"https:\/\/softwareengineeringdaily.com\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/twitter.com\/software_daily\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/softwareengineeringdaily.com\/#\/schema\/person\/6365c4c1ff0b8cf742afe4279ddcc5bd\",\"name\":\"Jeff\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/softwareengineeringdaily.com\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/69ae5c01bd43f01c2564f8f85218a6b6?s=96&d=retro&r=pg\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/69ae5c01bd43f01c2564f8f85218a6b6?s=96&d=retro&r=pg\",\"caption\":\"Jeff\"},\"url\":\"https:\/\/softwareengineeringdaily.com\/author\/jeff\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Apache Kafka with Guozhang Wang - Software Engineering Daily","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"http:\/\/softwareengineeringdaily.com\/2015\/08\/06\/kafka-with-guozhang-wang\/","og_locale":"en_US","og_type":"article","og_title":"Apache Kafka with Guozhang Wang - Software Engineering Daily","og_description":"Apache Kafka is a publish-subscribe messaging system rethought as a distributed commit log. Kafka serves as the central repository for data streams in a distributed system. Guozhang Wang is an engineer at Confluent, which\u00a0offers a stream data platform built using Kafka. Questions include: What is a central repository for data streams? How does Kafka improve","og_url":"http:\/\/softwareengineeringdaily.com\/2015\/08\/06\/kafka-with-guozhang-wang\/","og_site_name":"Software Engineering Daily","article_published_time":"2015-08-06T04:58:47+00:00","article_modified_time":"2021-10-20T11:09:30+00:00","og_image":[{"width":702,"height":369,"url":"https:\/\/i0.wp.com\/softwareengineeringdaily.com\/wp-content\/uploads\/2015\/08\/kafka-logo-wide.png?fit=702%2C369","type":"image\/png"}],"author":"Jeff","twitter_card":"summary_large_image","twitter_creator":"@software_daily","twitter_site":"@software_daily","twitter_misc":{"Written by":"Jeff","Est. reading time":"1 minute"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"http:\/\/softwareengineeringdaily.com\/2015\/08\/06\/kafka-with-guozhang-wang\/#article","isPartOf":{"@id":"http:\/\/softwareengineeringdaily.com\/2015\/08\/06\/kafka-with-guozhang-wang\/"},"author":{"name":"Jeff","@id":"https:\/\/softwareengineeringdaily.com\/#\/schema\/person\/6365c4c1ff0b8cf742afe4279ddcc5bd"},"headline":"Apache Kafka with Guozhang Wang","datePublished":"2015-08-06T04:58:47+00:00","dateModified":"2021-10-20T11:09:30+00:00","mainEntityOfPage":{"@id":"http:\/\/softwareengineeringdaily.com\/2015\/08\/06\/kafka-with-guozhang-wang\/"},"wordCount":140,"publisher":{"@id":"https:\/\/softwareengineeringdaily.com\/#organization"},"keywords":["Data Engineering","Distributed System","Kafka","Pub Sub","Samza","Spark","Storm"],"articleSection":["All Content","Data","Podcast"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"http:\/\/softwareengineeringdaily.com\/2015\/08\/06\/kafka-with-guozhang-wang\/","url":"http:\/\/softwareengineeringdaily.com\/2015\/08\/06\/kafka-with-guozhang-wang\/","name":"Apache Kafka with Guozhang Wang - Software Engineering Daily","isPartOf":{"@id":"https:\/\/softwareengineeringdaily.com\/#website"},"datePublished":"2015-08-06T04:58:47+00:00","dateModified":"2021-10-20T11:09:30+00:00","breadcrumb":{"@id":"http:\/\/softwareengineeringdaily.com\/2015\/08\/06\/kafka-with-guozhang-wang\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["http:\/\/softwareengineeringdaily.com\/2015\/08\/06\/kafka-with-guozhang-wang\/"]}]},{"@type":"BreadcrumbList","@id":"http:\/\/softwareengineeringdaily.com\/2015\/08\/06\/kafka-with-guozhang-wang\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/softwareengineeringdaily.com\/"},{"@type":"ListItem","position":2,"name":"Apache Kafka with Guozhang Wang"}]},{"@type":"WebSite","@id":"https:\/\/softwareengineeringdaily.com\/#website","url":"https:\/\/softwareengineeringdaily.com\/","name":"Software Engineering Daily","description":"","publisher":{"@id":"https:\/\/softwareengineeringdaily.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/softwareengineeringdaily.com\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/softwareengineeringdaily.com\/#organization","name":"Software Engineering Daily","url":"https:\/\/softwareengineeringdaily.com\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/softwareengineeringdaily.com\/#\/schema\/logo\/image\/","url":"https:\/\/i0.wp.com\/softwareengineeringdaily.com\/wp-content\/uploads\/2022\/01\/cropped-logo-new.png?fit=296%2C139&ssl=1","contentUrl":"https:\/\/i0.wp.com\/softwareengineeringdaily.com\/wp-content\/uploads\/2022\/01\/cropped-logo-new.png?fit=296%2C139&ssl=1","width":296,"height":139,"caption":"Software Engineering Daily"},"image":{"@id":"https:\/\/softwareengineeringdaily.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/twitter.com\/software_daily"]},{"@type":"Person","@id":"https:\/\/softwareengineeringdaily.com\/#\/schema\/person\/6365c4c1ff0b8cf742afe4279ddcc5bd","name":"Jeff","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/softwareengineeringdaily.com\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/69ae5c01bd43f01c2564f8f85218a6b6?s=96&d=retro&r=pg","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/69ae5c01bd43f01c2564f8f85218a6b6?s=96&d=retro&r=pg","caption":"Jeff"},"url":"https:\/\/softwareengineeringdaily.com\/author\/jeff\/"}]}},"jetpack_sharing_enabled":true,"jetpack_featured_media_url":"https:\/\/i0.wp.com\/softwareengineeringdaily.com\/wp-content\/uploads\/2015\/08\/kafka-logo-wide.png?fit=702%2C369&ssl=1","jetpack_shortlink":"https:\/\/wp.me\/p7GuoD-22","_links":{"self":[{"href":"https:\/\/softwareengineeringdaily.com\/wp-json\/wp\/v2\/posts\/126"}],"collection":[{"href":"https:\/\/softwareengineeringdaily.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/softwareengineeringdaily.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/softwareengineeringdaily.com\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/softwareengineeringdaily.com\/wp-json\/wp\/v2\/comments?post=126"}],"version-history":[{"count":0,"href":"https:\/\/softwareengineeringdaily.com\/wp-json\/wp\/v2\/posts\/126\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/softwareengineeringdaily.com\/wp-json\/wp\/v2\/media\/214"}],"wp:attachment":[{"href":"https:\/\/softwareengineeringdaily.com\/wp-json\/wp\/v2\/media?parent=126"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/softwareengineeringdaily.com\/wp-json\/wp\/v2\/categories?post=126"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/softwareengineeringdaily.com\/wp-json\/wp\/v2\/tags?post=126"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}