• Build Presto on OSX

    As described here, we cannot run Presto package build on OSX. Mainly it was caused by JNI side issue and machine architecture. I knew the issue from Issue 3849. [Read More]
    Tags:
  • Multi node Presto cluster on docker

    Recently I’m getting started using Presto. This is a distributed SQL query engine like Hive. I’m working on Hadoop, Hive until now. So I have though there are a lot of similarity between Hive and Presto. This is almost true in terms of user interface and SQL syntax. But Presto... [Read More]
    Tags:
  • Assemble and creating table in Hive UDF

    histogram_numeric is a UDAF which should calculate the distribution of given records. But at the same time it should generate a table that represents one category by one row. In this point we can regard this type of UDF is a combination of UDAF and UDTF. For example the output... [Read More]
    Tags:
  • Digdag syntax highlighter in Atom

    Digdag was released from Treasure Data. This is a highly scalable distributed workflow engine. It was developed for both analyst and engineers in order to make their daily batch and adhoc jobs more easy. The important part I want to say here is we can define workflow in one file... [Read More]
    Tags: