Loading…
View More Details & Registration
Please note: All Sessions are in Japan Standard Time Zone (UTC+09:00)
AI/ML/DL [clear filter]
Thursday, December 3
 

10:45 JST

History and Evolution of Data Lake Architecture - Post Lambda Architecture - Takuya Fukuhisa & Masaru Dobashi, NTT DATA
Around 2006, Apache Hadoop realized the open source based “Data Lake” architecture for enterprises to utilize large amounts of data, "Big Data". However, there are also growing expectations against "real-time analysis" that delivers analyzed results to end-users in seconds to minutes by immediately processing a large amount of “stream data”. In this talk, we present the history of open source software related to Data Lake, the overview of current software, and the potential tradeoffs.We also talk about how recent storage technologies, such as Apache Iceberg, Apache Hudi, Delta Lake, try to provide features to leverage both of historical and stream data on Data Lake in a different way from Lambda Architecture. Finally, we summarize these products based on the comparison of internal architectures. Attendances will learn about the overview of current storage software, and similarities and differencesof architectures. This helps you to design the system architecture build on Data Lake technologies to realize both batch and real-time based analysis. This post reflects some software upgrades from previous domestic presentation.

Speakers
avatar for Takuya Fukuhisa

Takuya Fukuhisa

Deputy Manager, Senior IT Architect, NTT DATA
Takuya Fukuhisa is a system infrastructure architect and expert in distributed computing and stream data processing. He has developed mission-critical open systems in the public and financial sector since 2011. Currently, he is responsible for developing a system and addressing the... Read More →
avatar for Masaru Dobashi

Masaru Dobashi

Executive IT Specialist, Manager, NTT DATA
Masaru Dobashi is a system infrastructure architect and expert on distributed computing, machine learning platform, and stream data processing. He leads the open-source professional service team at NTT DATA Corporation and has responsibility for introducing open source-based data... Read More →



Thursday December 3, 2020 10:45 - 11:35 JST
Virtual 3

13:10 JST

Lessons Learned from the Collaboration of Big Data and AI/ML Technologies for Giant Hogweed Eradication - Naoto Umemori & Masaru Dobashi, NTT DATA
Giant Hogweed is a highly toxic plant originating in the Western Caucasus. It has spread across Central and Western Europe and there are sightings of Giant Hogweed reported from North America, too. Landowners are obliged to eradicate it, due to its toxicity and invasive nature in Europe. However, it is difficult for landowners to find and remove Giant Hogweed across large areas of land since it is a very cumbersome manual process. To automate the process of detecting the Giant Hogweed by exploiting technologies like drones and image recognition/detection using Machine Learning is an effective way to address this problem. In this presentation, we show you how we designed the architecture towards the Petabyte scale, how we took advantage of both of Big Data and Machine / Deep Learning technologies and lessons learned through this project. For example, we integrated a drone, Apache Hadoop, Apache Spark and TensorFlow to achieve the usability, flexibility and scalability for both of data engineers and data analysts. We talk about why this integration was needed for us, technical challenges from the viewpoint of enterprises and tips to leverage the above open source software.

Speakers
avatar for Masaru Dobashi

Masaru Dobashi

Executive IT Specialist, Manager, NTT DATA
Masaru Dobashi is a system infrastructure architect and expert on distributed computing, machine learning platform, and stream data processing. He leads the open-source professional service team at NTT DATA Corporation and has responsibility for introducing open source-based data... Read More →
avatar for Naoto Umemori

Naoto Umemori

Deputy Manager, Senior IT Specialist, NTT DATA
Naoto is a Senior IT Specialist and Deputy Manager at NTT DATA Corporation, working on technology and innovation area. He has spent around a decade in the Platform and Infrastructure field, focusing mainly on the open source software technology stack. He has experiences with talking... Read More →



Thursday December 3, 2020 13:10 - 14:00 JST
Virtual 3
 
  • Timezone
  • Filter By Date Open Source Summit Japan & Automotive Linux Summit 2020 Dec 2 - 4, 2020
  • Filter By Venue Virtual - Attend from Anywhere!
  • Filter By Type
  • AI/ML/DL
  • Ask the Experts
  • Automotive Linux Summit
  • Breaks
  • Cloud Infrastructure
  • Embedded & IoT
  • Keynote Sessions
  • Leadership & Governance
  • LF Project Mini-Summit
  • Linux Systems
  • Open Source Dependability
  • Special Events
  • Sponsor Showcase - Staffed Booth Hours
  • Wildcard
  • Skill Level
  • Technical Talk
  • Presentation Slides Attached

Filter sessions
Apply filters to sessions.