Blog

Querying SQL using Metatron’s Workbench

Metatron’s Workbench is used to execute SQL queries. It is usually used in the early stages of data exploration, and it is also used before using Workbook or Notebook. In this article, you will learn how to perform simple SQL queries in the workbench. First, for a simple example, we will connect to a database that holds meta information for the metatron. Let’s assume that you are using mysql as the database for Metatron meta information. 1. Creating Data Connection…

Talks for ApacheCon North America 2018

ApacheCon brings developers and technologists from around the world together to discuss the latest innovations in containers, cloud, DevOps, IoT, servers, web frameworks, and many other Apache projects and communities in a collaborative, vendor-neutral environment. We, Metatron team members, have been announcing annually on ApacheCon about open source projects. Upcoming ApacheCon will be held September 24th through 27th, 2018 at the Montreal Marriott Chateau Champlain. This year at ApacheCon North America, in Montreal, we are going to have talks for…

Visualize Real-time Data with Metatron Discovery

Let’s find out how to create a real-time dashboard when having a streaming data. First of all, you need a streaming data. I just create an artificial one for this test. If you already have one, just skip this step. import sys import json import math from datetime import datetime from time import sleep from kafka import KafkaProducer #producer = KafkaProducer(bootstrap_servers=’localhost:9092′) list_category = [’10’, ’20’, ’30’]; for i in range(0, 100000) : for cur_cate in list_category : cur_result = {};…

Worldcup Analysis with Metatron Discovery

Let’s make a dashboard for analysis using the World Cup match results data. This document will show you how Metatron Discovery ‘s Data Prep function and workbook features can be used in your analysis. 1. Creating datasets for Worldcup First, let’s get the data with World Cup matches result information. You can download data for the World Cup matches result in Excel format from Tableau’s Public Data site. Now let’s process the downloaded data using Data Prep. All accounts of…

Metatron distributed Druid

Metatron implements distributed in-memory engine based on Druid which is a high-performance, column-oriented, distributed data store. It supports users can discover insights from data intuitively and query for the massive volume of data. Adaptation of Druid Technology Why choose Druid? Metatron needed an end-to-end solution that enables management of the whole process of data analysis at once. Druid was adequate for such need for the following reasons: First is speed. Druid compiles large volumes of data in real time and…