Processor R Code

The R code processor allows you to transform data using R scripts. R is a free software environment for statistical computing and graphics that is supported by the R Foundation for Statistical Computing. The R language is widely used among statisticians and data miners for developing statistical software and data analysis. R’s popularity has increased over the past years.

Requirements

You need to have experience in programming and to be comfortable with R syntax enough for writing simple scripts in order to work with the R code processor.

Topics you need to be familiar with include (but are not limited to):

Data structures
Control flow tools
Working with files (opening, reading, writing, unpacking, etc.)
Modules and packages.

Learn more: R package official website, Official documentation, R documentation search engine

Features

OS Debian Jessie
R Version - 3.5
Libraries available
BTYD,
car, caret, caTools, ChannelAttribution, Cubist,
data.table, data.tree, digest, doParallel, dplyr,
earth, ellipse, e1071,
forecast, foreach,
gam, gbm, gdata, ggplot2, gsl,
ipred, ISOweek,
kernlab, klaR,
lattice, lubridate,
MASS, mda, mgcv, mlbench,
nlme, nnet,
party, pamr, pls, plyr, pROC, proxy, purrr,
randomForest, RANN, reshape2, R6, RcppArmadillo, rgdal,
spls, sqldf, stringi, stringr, subselect, superpc,
testthat, tidyverse, timeDate, tree

Data In/Data Out

Data In	Files for processing and transformation can be located in `in/tables/` (CSV files) or `in/files/` (all other types of files) folder depending on the previous component in the data flow and the file type.
Data Out	Output files should be written in `out/tables` (CSV files) or `out/files` (all other types of files) folder depending on the need for the next component and the file type file.

Learn more: about folder structure in configuration here.

Code Editor, Script

This field is where you write R script to process the data.

Learn more: How to search & replace within a code ed itor

Script location and paths	A script file is located in the /data folder. You can use the absolute or relative path to access the data files. Input files absolute path `/data/in/tables/..` and `/data/in/files/..` a relative path `in/tables/..` and `in/files/..` Output files absolute path `/data/out/tables/..` and `/data/out/files/..` a relative path `out/tables/..` and `out/files/..`
Standard output	The analog of console log in Meiro Integrations is the activity log. If you run the script `print(“Hello world!”)` in the configuration, the system will write the result in the activity log.
Indentation	The Script field supports indentation in R and creates an indent automatically when changing to a new line if needed. This helps the user to keep order in the command. However, if you write the script in your local IDE and copy-paste it, be sure that you do not mix tabs and spaces.
Script requirements	While working with data flow, you will need to work with files and tables a lot. Generally, you will need to open the input file, transform the data and write it in the output file. There are no special requirements to the script except the R syntax itself. You can check a few examples of scripts solving common tasks in the section below.

Examples

Example 1

This example illustrates a simple code that imports libraries, imports an open dataset from an external source (URL), saves the response, writes output to the table and finally prints output to the console log. Usually, you will need to open the file from the input bucket, which was downloaded using a connector, but in some cases requesting the data from external resources can be necessary.

#import necessary libraries
library(data.table)

#request URL and save response to variable titanic
titanic<-fread("http://web.stanford.edu/class/archive/cs/cs109/cs109.1166/stuff/titanic.csv")

# write output to table and print first 10 rows
write.csv(titanic, file = "out/tables/titanic.csv", row.names = FALSE)
print(titanic[1:10,"Survived"])

Example 2

This example illustrates opening, filtering and writing a CSV file. In this script, we will use the Titanic dataset which contains data of about 887 of the real Titanic passengers. This dataset is open and very common in data analytics and data science courses.

Let’s imagine we need to analyze the data, compute the mean of all passengers who survived and male passengers who survived, and want to write the data in 2 separate files.

Data in this example was previously downloaded using the connector component. We will show below how you can reproduce the code on your computer.

library(dplyr)
#for computing mean and piping operator usage
# titanic:
# Survived
# Pclass
# Name
# Sex
# Age
# Siblongs.Spouses.Aboad
# Parents.Children.Aboard
# Fare
#read table
titanic_in<-read.csv("in/tables/titanic.csv",stringsAsFactors = FALSE)

#compute mean of ages and fares of all passengers and store it in data frame
mean_all<-data.frame(titanic_in%>%summarise(mean_age=mean(Age),mean_fare=mean(Fare)))

#filter table for only male passengers who survived
data_selected_male<-subset(titanic_in,Sex=='male'& Survived==1)

#compute mean of ages and fares of those male and survived and store it in data frame
mean_male_survived<-data.frame(data_selected_male%>%summarise(mean_age=mean(Age),mean_fare=mean(Fare)))

# write output means to tables
write.csv(mean_all, file = "out/tables/mean_all.csv", row.names = FALSE)
write.csv(mean_male_survived, file = "out/tables/mean_male_survived.csv", row.names = FALSE)

Reproducing and debugging

If you want to reproduce running the code on your computer for testing and debugging, or you want to write the script in a local IDE and copy-paste it in Meiro Integrations configuration, the easiest way to do this would be to reproduce the structure of folders as below:

/data
     script.r
     /in
          /tables
          /files
     /out
          /tables
          /files

The script file should be located in the /data folder, input files, and tables in the folder in/ in the corresponding subfolders, output files and tables in output/files and output/tables respectively.

To reproduce Example 2, you will need to download the dataset and save it to the folder /data/in/tables as titanic.csv, paste the code from the example to the script file and run it. New files will be written to the folder /data/out/tables/.

Introduction to Meiro Integrations with list of integrations

Glossary: what is what in Meiro Integrations

Terms & conditions

Meiro User Security Guidelines

Technical Support

User Interface Of Meiro Integrations

Components

Tab: Administration

Tab: DAWG

Tab: DAWG Detail

Tab: Full-text search

Tab: Monitoring

Tab: Trash

Tab: User Settings

Tab: Workspaces

Tab: Workspaces Detail

How data flow and components work

How to set up a schedule

Activity Details

Configuration file

Filter in components

Folder Structure

Quick tips

Workspace variables

Connector Adobe Analytics

Connector AppsFlyer

Connector AWS S3

Connector Clockify

Connector contactSPACE

Connector Doubleclick

Connector Facebook

Connector Facebook Ads

Connector Facebook Pixel

Connector Gmail

Connector Google Gmail Attachments

Connector Google Ads

Connector Google Analytics

Connector Google BigQuery

Connector Google Cloud Storage OAuth2

Connector Google Cloud Storage Service Account

Connector Google Spreadsheet

Connector HTTP

Connector Instagram

Connector ipSCAPE

Connector Kafka

Connector Klaviyo

Connector LDAP

Connector Magento V2

Connector Mailchimp

Connector Microsoft Dynamics

Connector MsSQL

Connector MySQL

Connector Optimizely

Connector OracleDB

Connector Pipedrive

Connector Postgres

Connector Provetic

Connector Pure Cloud

Connector Salesforce Commerce Cloud

Connector Salesforce Sales Cloud

Connector SFTP

Connector SmartEmailing API CSV

Connector Snowflake

Connector Twitter

Connector WooCommerce

Connector Youtube Insight

Connector Zoho CRM

Connector Zoom

Processor Command Line Interface Code

Processor Create File

Processor Great Expectations

Processor IP Geolocation

Processor Json to CSV

Processor Postgres

Processor Python 3 Code

Processor Python from Git repository

Processor R Code

Processor R from Git repository

Processor SQL on CSV

Loader ActiveCampaign