Copyright 2022 Arjuna Sky Kok (https://twitter.com/arjunaskykok)
This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.
You should have received a copy of the GNU General Public License along with this program. If not, see https://www.gnu.org/licenses/.
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
dataset_url = "https://arjunaskykok.s3.ap-southeast-1.amazonaws.com/analisa/cto_2022_04_20.csv"
dataset = pd.read_csv(dataset_url, dtype={"Tebakan Umur Saat Menjabat CTO": "Int64"})
dataset.head()
Startup | Nama | Pendiri/Profesional | Pendidikan | Nama Universitas Pendidikan Terakhir | Negara Pendidikan Terakhir | Jurusan | Tebakan Umur Saat Menjabat CTO | Gender | Tebakan Warganegara | ||
---|---|---|---|---|---|---|---|---|---|---|---|
0 | Bukalapak | Jun Yao | Profesional | Ph.D. | The University of New South Wales | Australia | Mobile Computing, Vehicular Communications | 42 | L | Tiongkok/Australia | https://www.linkedin.com/in/jun-yao-61412715/ |
1 | Tokopedia | Herman Widjaja | Profesional | S1 | Monash University | Australia | Computer Science, Electrical Engineering | 38 | L | Indonesia | https://www.linkedin.com/in/hermanwidjaja/ |
2 | Gojek | Severan Rault | Profesional | Master | Eseo, ESCP Business School | NaN | International Project Management, Signal Proce... | 43 | L | Prancis | https://www.linkedin.com/in/severan/ |
3 | Ajaib | Winston Lays | NaN | S1 | University of Southern California | Amerika | Computer Engineering & Computer Science | 29 | L | Indonesia | https://www.linkedin.com/in/wlays/ |
4 | Xendit | Bo Chen | Pendiri | S1 | University of California, Berkeley | Amerika | Electrical Engineering & Computer Science | 23 | L | Amerika | https://www.linkedin.com/in/bochen303/ |
dataset["Gender"].value_counts()
L 20 P 3 Name: Gender, dtype: int64
dataset["Negara Pendidikan Terakhir"].value_counts()
Amerika 7 Australia 3 Indonesia 3 Tiongkok 2 India 2 Singapura 2 Kanada 1 Inggris, India 1 Jerman, Spanyol 1 Name: Negara Pendidikan Terakhir, dtype: int64
dataset["Tebakan Warganegara"].value_counts()
Indonesia 12 India 4 Amerika 2 Tiongkok/Australia 1 Prancis 1 Tiongkok/Amerika 1 Amerika/Kanada 1 Tiongkok 1 Name: Tebakan Warganegara, dtype: int64
def asing(x):
if x == "Indonesia":
return "Indonesia"
else:
return "Asing"
dataset["Asing"] = dataset["Tebakan Warganegara"].apply(asing)
dataset["Asing"].value_counts()
Indonesia 12 Asing 11 Name: Asing, dtype: int64
dataset["Pendiri/Profesional"].value_counts()
Profesional 13 Pendiri 7 Name: Pendiri/Profesional, dtype: int64
dataset[dataset["Pendiri/Profesional"]=="Profesional"]["Tebakan Umur Saat Menjabat CTO"].mean()
39.5
dataset[dataset["Pendiri/Profesional"]=="Pendiri"]["Tebakan Umur Saat Menjabat CTO"].mean()
27.714285714285715
dataset["Pendidikan"].value_counts()
S1 13 Master 8 Ph.D. 1 Name: Pendidikan, dtype: int64