Welcome, Guest: Register On Nairaland / LOGIN! / Trending / Recent / NewStats: 3,208,272 members, 8,002,118 topics. Date: Thursday, 14 November 2024 at 01:44 AM |
Nairaland Forum / Science/Technology / Programming / Chronicle Of A Data Scientist/analyst (356453 Views)
Chronicle Of A Data/cloud Engineer / Net Salary For A Data Analyst Or Scientist Or Web Dev / Aspiring Data Scientist. (2) (3) (4)
(1) (2) (3) ... (37) (38) (39) (40) (41) (42) (43) ... (147) (Reply) (Go Down)
Re: Chronicle Of A Data Scientist/analyst by mbhs139(m): 8:40pm On Jun 01, 2020 |
Toppytek: Bro, instead of name, and hype, think about your pocket first! Most machine, on budget, will do the work of a data analytics or data scientist, particularly if you have not started earning big time like the original creator of this thread. All you need is a bigger RAM, 16GB minimum, with a good processor speed, say a base speed of 2.5Ghz, that can turbo up to... well, I don't know, and a solid state disk (SSD) minimum of 512GB. Baba, once your machine has any of this, you're good to go. Forget about name. 7 Likes 1 Share |
Re: Chronicle Of A Data Scientist/analyst by Samzeal(m): 9:42pm On Jun 01, 2020 |
mbhs139: Which programme are you learning now. Is it python? |
Re: Chronicle Of A Data Scientist/analyst by mbhs139(m): 12:29am On Jun 02, 2020 |
Samzeal: Python MySQL R Power BI, and Tableau. I paid for Mosh online courses on MySQL and Python. I want to have a good grasp of all these courses first, before going into their application in data science. I don't just want to jump into the data science application without having a good grasp and background understanding of what and what each of the programmes are used for or can be used for. Other ones I am watching free YouTube videos and reading free books on Amazon Kindle. I don't know if that can work in Nigeria, I'd encourage you guys to check it out. There are a handful of books there. 2 Likes 1 Share |
Re: Chronicle Of A Data Scientist/analyst by gargoylern: 2:31am On Jun 02, 2020 |
Re: Chronicle Of A Data Scientist/analyst by gargoylern: 2:34am On Jun 02, 2020 |
Meanwhile, You can now get "Automate the Boring Stuff with Python" from Udemy free this June. This is probably one of the best open-source books written on python. You use this coupon JUN2020FREE or just visit this link www(dot)udemy(dot)com(forward_slash)course(forward_slash)automate(forward_slash)(question_mark)couponCode(equal_sign)JUN2020FREE 4 Likes |
Re: Chronicle Of A Data Scientist/analyst by Samzeal(m): 4:51am On Jun 02, 2020 |
mbhs139: Thank you I really need a book on data science. And if you can share some materials on python that can help it would be appreciated. |
Re: Chronicle Of A Data Scientist/analyst by dollyj1996(m): 8:16am On Jun 02, 2020 |
hello house,i ve started learning d use of excel up to a level and decided to go thru d tutorial videos i have regarding python and i notice almost all the functions performed by excel can b done by python(vice versa). question is cant i just do deep learning on excel alone and what is d advantage of using python libraries,power i.b,mySQL etc over excel? 3 Likes |
Re: Chronicle Of A Data Scientist/analyst by mightguy(m): 9:55am On Jun 02, 2020 |
Hello guys. This excel for Android how good is that? Is it similar to the one on lappy. can u use it for practice after watching Excel videos online? Am planning on buying a laptop next month |
Re: Chronicle Of A Data Scientist/analyst by Schoolhike: 10:22am On Jun 02, 2020 |
dollyj1996: Not all bro, only few of excel functions, power Bi and excel power pivot perform excellently when it comes to DAX in performing powerful stuffs, I’d advice you to have a good grasp of excel first. SQL is soo powerful when it comes to querying data from a database. |
Re: Chronicle Of A Data Scientist/analyst by mbhs139(m): 10:26am On Jun 02, 2020 |
Samzeal: Drop your email here if you don't mind. The files are too large for this site |
Re: Chronicle Of A Data Scientist/analyst by mbhs139(m): 10:35am On Jun 02, 2020 |
Samzeal: You may also want to consider these materials: (1) https://automatetheboringstuff.com/2e/chapter1/ (2) http://www.openbookproject.net/thinkcs/python/english3e/ (3) https://wiki.python.org/moin/FrontPage (4) https://stackoverflow.com/questions/17079074/creating-database-in-python (6) https://download.audible.com/product_related_docs/BK_ACX0_141304.pdf ( https://www.tutorialspoint.com/sql/sql-alter-command.htm (9) https://www.tutorialspoint.com/sql/index.htm (10) https://www.w3schools.com/sql/default.asp (11) https://docs.python.org/3.8/tutorial/index.html (12) https://docs.python.org/3.8/index.html (14) https://www.youtube.com/results?search_query=tutorials+on+mysql+workbench (15) https://www.youtube.com/watch?v=_K40ZE3WeSk&list=PLWf6TEjiiuIByHVmtFf5I743nbGObbCWl&index=1 (16) https://www.youtube.com/watch?v=w-0IWyAeZ3M (17) https://www.youtube.com/watch?v=aGQ7PGZ1u24&list=PLhwzZMYRiUEq5e5TJCznKVuPAsnNzT8k5 (18) https://www.youtube.com/watch?v=WmGgxTpGs_8 (20) https://www.youtube.com/watch?v=RSHevYMwCVw (21) https://www.youtube.com/watch?v=7S_tz1z_5bA 23 Likes 6 Shares |
Re: Chronicle Of A Data Scientist/analyst by dollyj1996(m): 11:05am On Jun 02, 2020 |
Schoolhike:thank u. |
Re: Chronicle Of A Data Scientist/analyst by mayprince(m): 3:56pm On Jun 02, 2020 |
Hello guys. I just finish learning Excel to a point where I can move to the next step in Data Science. Now my question is what next? Should I start learning python or learn other data visualization software such as Tableau or Power BI? 1 Like |
Re: Chronicle Of A Data Scientist/analyst by Schoolhike: 5:08pm On Jun 02, 2020 |
AnalysisGuru: A good introductory course on statistics check “Intro to statistics” on udemy it’s free josh_wills says "Data Scientist is a person who is better at statistics than any software engineer and better at software engineering than any statistician.” That's the balance you need to achieve and maintain. 3 Likes |
Re: Chronicle Of A Data Scientist/analyst by Schoolhike: 5:32pm On Jun 02, 2020 |
I’d advice you read at least first 3 chapters of this book, “An introduction to statistical learning with application in R” by springer tech in statistics, if you want pursue career in data science, suitable for those without math/stats background. When you reach the part R of programming apply the knowledge of python if you don’t know R, I also don’t know R 4 Likes 3 Shares |
Re: Chronicle Of A Data Scientist/analyst by Toppytek(m): 6:42pm On Jun 02, 2020 |
Schoolhike: If you have the soft copy or link to the said pls help us with it |
Re: Chronicle Of A Data Scientist/analyst by brashear: 7:07pm On Jun 02, 2020 |
Toppytek:b-ok dot xyz. just type the title of the book into the search bar. |
Re: Chronicle Of A Data Scientist/analyst by dollyj1996(m): 7:47pm On Jun 02, 2020 |
mayprince:bro,which video tutorial did u use for ur excel lesson? |
Re: Chronicle Of A Data Scientist/analyst by Gundel001: 10:02pm On Jun 02, 2020 |
mcemmy0z:where in Ota bro |
Re: Chronicle Of A Data Scientist/analyst by Gundel001: 10:31pm On Jun 02, 2020 |
mcemmy0z:where in Ota bro?? |
Re: Chronicle Of A Data Scientist/analyst by Dthinkerman: 11:03pm On Jun 02, 2020 |
mbhs139: Thanks for the share. I am quite interested in the Mosh's paid Python course. I would appreciate it if I could have it. 1 Like |
Re: Chronicle Of A Data Scientist/analyst by Samzeal(m): 3:56am On Jun 03, 2020 |
mbhs139: amoosamson@gmail.com |
Re: Chronicle Of A Data Scientist/analyst by cochtrane(m): 7:02am On Jun 03, 2020 |
As a budding data scientist who visits NL often, it's not surprising that you start to get more than interested in the some of the topics making front page and how frequent topics from individual sections reach the top. I have been looking into this for a while and thought it would be nice to do some investigation in this regard. For example, which section makes front page most often? How often do we see programming topics get to the front page? Who posts more often on the front page? Is it really lalasticlala, as is frequently supposed, or is it someone else? What exactly has been the relationship between lalasticalala and snakes over the past year? Some people think he loves to push snake topics to the frontpage more often than other topics. What else can we learn from the topics making frontpage? Like for example, are they mostly about Buhari or something else? To this end, I scrapped the front page data and obtained more than 28,000 records. You can download this data set I obtained here on my github. If you are a data science enthusiast who also likes Nairaland, this may be good motivation to dig into a topic that interests you. You will find a metadata file in the sublink as well and can investigate what the attributes are about. You've got titles, links, sections and time that posts made front page. It's a year of data from 31st May 2019 till date. It turns out to get the whole frontpage information may need more than 230,000 records! That's huge, and probably not so wise to collect for a quick, lazy analysis. Except, of course, you have business motives For me, I was interested in a few topics. First, from which section did we get the most frontpage material over the past year? Apparently, it is "Politics". It trumps everything. "Celebrities" come a close second. Not surprising, right? What with the volume of Bobrisky posts and co. And then "Crime" comes third. Does this point to a high frequency of crime in Nigeria? I leave that question to you. "Programming"? Didn't even make bank one time! The fact that politics make frontpage more often clearly shows that top on our discourse as Nigerians is probably politics, if Nairaland reflects a microcosm of the Nigerian environment, which I feel it does. Who posts more often on the frontpage? Not lalasticlala like you might think. It's a person called dre11; at least over the last year. Maybe you know him, may you don't. Lalasticlala is not even in the top three. One quirky thing I found, however, was that the time it takes for a post to get to frontpage has a heavily right-skewed distribution. Before plotting this, I lazily thought it might be dome-shaped, cos...well, a lot of things in life are and it shouldn't be unusual to have this behave the same; few make front page early, few late, and most are in between. Right? On the contrary, the reality is skewed. I feel the heavy skewness and bimodal attribute probably points to deliberate human intervention. Most posts make front page early, not late. They are created and in little time pushed to the front page. Evidently in a deliberate fashion. Else the data should be dome-shaped, don't you think? Anyways, that's what my data shows. Maybe, better insight could be derived though if one scraped randomly over the past several years in order to obtain a truly random sample. And there were a few threads which made front page late. Very late! In the past year, we have had threads from 8 years ago make frontpage. Yes, 8 years ago! Thats's 2012. And then there are those that were initially posted 5 years ago before they made front page. Perhaps you can find more if you looked into the data set? Anyways, getting your hands dirty with a data set is always a good way to learn data analysis. If you need help with navigating this, you can buzz me. 31 Likes 3 Shares
|
Re: Chronicle Of A Data Scientist/analyst by cochtrane(m): 7:39am On Jun 03, 2020 |
What are the longest years between the creation of a post and when it made front page? (first figure). And has lalasticlala posted anything associated with snakes in the past year? Apparently not. (second figure). The data shows that hasn't been the case. 4 Likes
|
Re: Chronicle Of A Data Scientist/analyst by Dum20: 8:27am On Jun 03, 2020 |
cochtrane: Bro this is fantastic and awesome. I have plenty questions for you later |
Re: Chronicle Of A Data Scientist/analyst by cochtrane(m): 9:07am On Jun 03, 2020 |
And how frequently has a topic containing the "Buhari" keyword appeared on the frontpage? Comprises about 5% of all topics in the past year. You would think this is not such a big number, until you see that the frontpage topics that contain the keyword "Tinubu" or "Osibanjo" were only 0.63% and 0.0069% respectively. Crazy, right? Nigerians do talk a lot about the number one man. I feel the fact that politics dominate discussion on Nairaland is some evidence that the site is becoming more like a news website than a social network. It's probably firstly a news website, then it is a social network. What more can we discover from this data? -A time analysis of post frequency? using cumulative sums -A time analysis of front page posts especially for the top posters? using cumulative sums -A bag of words model for frontpage titles over the past years. Would be nice to see the most frequent words which make frontpage -The shortest intervals between when a post is created and when it makes frontpage -The last time some infrequent sections made frontpage -The sections with the most frontpage material per month (probably politics, but need to confirm) -How often have you or your friend or someone you know made frontpage? -you really may want to find this out, if you think you've been on the frontpage before I leave the rest to you! 5 Likes
|
Re: Chronicle Of A Data Scientist/analyst by cochtrane(m): 9:07am On Jun 03, 2020 |
Dum20:Thanks! Just drop a message here if you've got a question. |
Re: Chronicle Of A Data Scientist/analyst by Zabiboy: 9:44am On Jun 03, 2020 |
Nice one @ cochtrane I'll use Tableau/Power bi to analyse mine.. Pandas needs more logic...not to talk of matplotlib ... Although i'll still use them ( pandas nd mpl) later |
Re: Chronicle Of A Data Scientist/analyst by cochtrane(m): 10:08am On Jun 03, 2020 |
Zabiboy:Cool. Share the results with us when you are done. |
Re: Chronicle Of A Data Scientist/analyst by Dum20: 10:39am On Jun 03, 2020 |
cochtrane: Okay great. 1. Firstly how did you start? 2. What are the courses you took to get to this stage of competence. 3. For how long have been in Data Science My background i am taking a Bussiness Intelligence course on Udemy. I have taken the statistics, SQL, Tableau parts of the course. I am just started the python section. But i still feel inadequate. I have also taken courses on Excel Power tools and Power BI. N.B. Can you give a step by step guide on how you got to final charts above. You do not need to go into details. Something like: 1. You used XYZ to extract data 2. You used ABC to clean the data 3. Used MNOP to visualise the data Thanks There are a lot of courses to study. I am wondering if i should stop for now and practice real life examples of the subjects i have learnt. 2 Likes |
(1) (2) (3) ... (37) (38) (39) (40) (41) (42) (43) ... (147) (Reply)
I Want To Learn Programming. Which Language Should I Start With?
(Go Up)
Sections: politics (1) business autos (1) jobs (1) career education (1) romance computers phones travel sports fashion health religion celebs tv-movies music-radio literature webmasters programming techmarket Links: (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) Nairaland - Copyright © 2005 - 2024 Oluwaseun Osewa. All rights reserved. See How To Advertise. 71 |