Postingan

ANALISA TERHADAP REVIEW PENGGUNA TOKOPEDIA (FINAL TASK)

Gambar
TEUKU RAIHAN 1401153592 MB-39-INT-3 LATAR BELAKANG Tokopedia adalah sebuah pasar online yang memungkinkan individu maupun pengusaha kecil menengah di Indonesia untuk membuka dan mengelola toko online mereka secara mudah, disamping memberikan pengalaman berbelanja online yang lebih aman dan nyaman. TUJUAN untuk memcari tahu pandangan pengguna terhadap Tokopedia dan apakah tokopedia adalah pengantar yang baik dan nyaman dimata pengguna DATA YANG DIGUNAKAN data yang digunakan adalah dari review pengguna tokopedia mengambil 100 review dari 467 review yang tersedia SOLUTION menggunakan bags of words, word cloud, text mining semua solusi yang saya gunakan terdapat di aplikasi Orange. METODE: 1. menggunakan corpus 2. preprocess text 3. bag of words 4. Data table 5. word cloud STEP: 1. masukan data yang telah anda ambil ke corpus 2. masukan preprocess text,bag of words, data table, word cloud  tarik semua garis ...

DATA MINING FOR THE MASSES

Gambar
From Matthew North (Data Mining for the Masses) Chapter 10/Decision Tree, Page 157-174 Using Decision Tree model in order to find good early predictors of buying behavior, and there is a rich data set of information, including items they have just browsed for, and those have actually purchased. With following attributes : User_ID, Gender, Age, Marital_Status, Website_Activity, Browsed_Electronics_12Mo, Bought_Electronics_12Mo, Bought_Digital_Media_18Mo, Bought_Digital_Books, Payment_Method, eReader_Adoption. The Results the complete process before press "play" button decision tree graph result in apply model data description of decision tree result in apply model statistics

CLASSIFICATION METHOD

Classification Method Data Mining classification Classification techniques in data mining are capable of processing a large amount of data. It can be used to predict categorical class labels and classifies data based on training set and class labels and it can be used for  classifying newly available data.The term could cover any context in which some decision or forecast is made on the basis of presently available information. Classification procedurs recognized method for repeatedly making such decisions in new situations. Here if we assume that problem is a concern with the construction of a procedure that will be applied to a continuing sequence of cases in which each new case must be assigned to one of a set of pre defined classes on the basis of observed features of data.Creation of a classification procedure from a set of data for which the exact classes are known in advance is termed as pattern recognition or supervised learning. Contexts in which a classificatio...

PREDICTION BY USING RAPID MINER

Gambar
Rapid Miner In this blog i will use rapid miner as my prediction by using data pemilu dataset. RapidMiner is a software platform developed by the company of the same name that provides an integrated environment for machine learning, data mining, text mining, predictive analytics and business analytics. I will use three main algorithms, which are; Decision Tree (C4.5), Naïve Bayes (NB) and K-Nearest Neighbor (K-NN). 1. Decision Tree  The Decision Tree the decision tree description the performance vector   So, Decision Tree have an accuracy 96.28% with predicition TIDAK true TIDAK is 362 and true YA is 14, and for the prediction YA true TIDAK is 15 and true YA is 34. 2. NAIVE BAYES (NB)   A Naive Bayes classifier is a simple probabilistic classifier based on applying Bayes’ theorem (from Bayesian statistics) with strong (naive) independence assumptions. A more descriptive term for the underlying probability model would be ‘independent feature...

Data Visualization

Gambar
What is the value of data visualization? In this SAS Best Practices Prac'toon, we'll show you how it's more than just a pretty picture -- it's a way to process and understand your data that adds value to your organization and your strategy. The way we see it? It's invaluabl e  . find any data? Languages Of The World This data visualization illustrates an analysis of the current linguistic situation in the world. It leverages data from WALS, which according to the resource is,  “ The World Atlas of Language Structures (WALS) is a large database of structural (phonological, grammatical, lexical) properties of languages gathered from descriptive materials (such as reference grammars) by a team of 55 authors (many of them the leading authorities on the subject).” visualization source link:  www.maptive.com

Pattern Of Frontier Airlines

Gambar
Introduction: Frontier Airlines  is an American ultra low cost carrier headquartered in Denver, Colorado .  The carrier, which is a subsidiary and operating brand of Indigo Partners, operates flights to 54 destinations throughout the United States and 5 international destinations. The airline maintains a  hub  at  Denver International Airport  with numerous focus cities across the  United state . Also, under a codeshare agreement  with  great lake airlines. the airline connects passengers to surrounding  Rocky mountain states  through their Denver hub. Frontier Hubs and Passenger Stats: According to Frontier, their hubs  are located in Denver, Kansas City and Milwaukee — this can be a bit of a misnomer as Denver has around 170 daily departures and the others less than ten. Frontier has alliances (code share agreements) with Midwest, Chautauqua and Great Lakes Aviation that extend its route syst...

Analytics On Big Aviation Data

Gambar
Analytics On Big Aviation Data abstract: Recent days the aviation industry adopts on Condition/Preventive maintenance procedures due to its operational efficiency and it depends upon the failure mode calculations made after testing a part under circumstances. These conditions may  fluctuate depending on the external factors/human errors which may result in the  variation in the life time of components in turn reducing the operational efficiency of the aircrafts. Problems: The aviation industry encompasses a huge amount of data, and many airlines and airports cannot manage and process the amount of data they receive, but such data could be used to revolutionise the passenger experience. The vast amount  of data produced related to passenger flow, cost reduction and revenue enhancement is  too much to handle for most small airline IT departments. Data-driven marketing can provide insights from data in real-time so there can be consistent understanding of pas...