<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet href="https://shkspr.mobi/blog/wp-content/themes/edent-wordpress-theme/rss-style.xsl" type="text/xsl"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	    xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	     xmlns:dc="http://purl.org/dc/elements/1.1/"
	   xmlns:atom="http://www.w3.org/2005/Atom"
	     xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	  xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>
<channel>
	<title>QADMSPML &#8211; Terence Eden’s Blog</title>
	<atom:link href="https://shkspr.mobi/blog/tag/qadmspml/feed/" rel="self" type="application/rss+xml" />
	<link>https://shkspr.mobi/blog</link>
	<description>Regular nonsense about tech and its effects 🙃</description>
	<lastBuildDate>Fri, 24 Jan 2025 09:36:01 +0000</lastBuildDate>
	<language>en-GB</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.9.4</generator>

<image>
	<url>https://shkspr.mobi/blog/wp-content/uploads/2023/07/cropped-avatar-32x32.jpeg</url>
	<title>QADMSPML &#8211; Terence Eden’s Blog</title>
	<link>https://shkspr.mobi/blog</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title><![CDATA[Practical Machine Learning with Python]]></title>
		<link>https://shkspr.mobi/blog/2022/01/practical-machine-learning-with-python/</link>
					<comments>https://shkspr.mobi/blog/2022/01/practical-machine-learning-with-python/#respond</comments>
				<dc:creator><![CDATA[@edent]]></dc:creator>
		<pubDate>Mon, 24 Jan 2022 12:34:30 +0000</pubDate>
				<category><![CDATA[/etc/]]></category>
		<category><![CDATA[AI]]></category>
		<category><![CDATA[MSc]]></category>
		<category><![CDATA[python]]></category>
		<category><![CDATA[QADMSPML]]></category>
		<guid isPermaLink="false">https://shkspr.mobi/blog/?p=41618</guid>

					<description><![CDATA[As part of my MSc I&#039;m taking a short course in Practical Machine Learning via QA.com.  The first three days were just about basic stats visualisation using Python. It was great to have a refresher - but I would have expected that to be a pre-requisite.  The tutor was excellent - very patient at explaining complex concepts. And the use of Jupyter Notebooks is a gamechanger for taught courses like…]]></description>
										<content:encoded><![CDATA[<p>As part of my <a href="https://shkspr.mobi/blog/tag/MSc">MSc</a> I'm taking a short course in <a href="https://web.archive.org/web/20220518000827/https://www.qa.com/course-catalogue/courses/practical-machine-learning-qadmspml/">Practical Machine Learning</a> via QA.com.</p>

<p>The first three days were just about basic stats visualisation using Python. It was great to have a refresher - but I would have expected that to be a pre-requisite.</p>

<p>The tutor was excellent - very patient at explaining complex concepts. And the use of Jupyter Notebooks is a gamechanger for taught courses like this.</p>

<p>Ultimately, it was a useful course - although I expected a lot more time to be spent on training for machine learning models rather than the underlying statistics.</p>

<p>These are mostly notes to myself to help consolidate my knowledge - and to provide some more information on the course itself if you are thinking of taking it.</p>

<h2 id="day-1"><a href="https://shkspr.mobi/blog/2022/01/practical-machine-learning-with-python/#day-1">Day 1</a></h2>

<p>All done in Python, pretty standard.</p>

<h3 id="python"><a href="https://shkspr.mobi/blog/2022/01/practical-machine-learning-with-python/#python">Python</a></h3>

<p><a href="https://packaging.python.org/en/latest/guides/installing-using-pip-and-virtual-environments/#creating-a-virtual-environment">Python Virtual Environments</a>.</p>

<ul>
<li>Create <code>python3 -m venv SomeName</code></li>
<li>Activate <code>source path/to/SomeName/bin/activate</code></li>
<li>Install packages <code>pip install -U whatever</code></li>
<li>Run <code>python</code></li>
<li>To exit the venv, <code>deactivate</code></li>
</ul>

<h3 id="jupyter-notebooks"><a href="https://shkspr.mobi/blog/2022/01/practical-machine-learning-with-python/#jupyter-notebooks">Jupyter Notebooks</a></h3>

<ul>
<li>In the venv <a href="https://jupyter.org/install#installation-with-pip"><code>pip install -U jupyterlab</code></a></li>
<li>Run it with <code>jupyter notebook</code></li>
<li>Don't forget to "Close and Halt" to stop the notebooks running in the background.</li>
<li>Don't <code>pip install</code> from within Jupyter</li>
</ul>

<h3 id="anaconda"><a href="https://shkspr.mobi/blog/2022/01/practical-machine-learning-with-python/#anaconda">Anaconda</a></h3>

<p>Possibly the easiest way to do everything (debatable!)
* <a href="https://docs.anaconda.com/anaconda/install/linux/">Install Anaconda for Linux</a></p>

<h3 id="why"><a href="https://shkspr.mobi/blog/2022/01/practical-machine-learning-with-python/#why">Why</a></h3>

<p>The process of automatically extracting meaning from data.</p>

<p>Data can be raw and unstructured. Lots of modern data is structured - e.g. tabular, database. But unstructured data - mostly media - isn't easy to classify and extract information from.</p>

<p>Exponential growth of data. 90% of the world's data has been created in the last few months. Rise of sensor data, etc.</p>

<p>Use of modern data science tools to invite others to reproduce your results.</p>

<p>Data Science = Turn data into a valuable asset, gain insight, make decisions and take actions.
Data Analyst explains. Data Scientists predict and visualise.</p>

<h3 id="python-basics-and-jupyter-basics"><a href="https://shkspr.mobi/blog/2022/01/practical-machine-learning-with-python/#python-basics-and-jupyter-basics">Python basics and Jupyter basics</a></h3>

<p>Different data types. Tuples, Dicts.</p>

<p><a href="https://news.ycombinator.com/item?id=17856700">Some reasons <em>not</em> to use notebooks</a>.</p>

<p>Basic Python and Markdown syntax.</p>

<h3 id="stats"><a href="https://shkspr.mobi/blog/2022/01/practical-machine-learning-with-python/#stats">Stats</a></h3>

<p>The usual intro to mean, median, mode, and basic statistical techniques. Basically enough to make sure everyone is on the same page.</p>

<h2 id="day-2"><a href="https://shkspr.mobi/blog/2022/01/practical-machine-learning-with-python/#day-2">Day 2</a></h2>

<p>Numpy. What it is, how it works, how fast it is. N-Dimensional Arrays</p>

<p>Pandas. Again, overview of the basics. Series and DataFrames.</p>

<h2 id="day-3"><a href="https://shkspr.mobi/blog/2022/01/practical-machine-learning-with-python/#day-3">Day 3</a></h2>

<p>Matplotlib and Seaborn. Again, basic interfaces for drawing graphs and getting data out of DataFrames.</p>

<h2 id="day-4"><a href="https://shkspr.mobi/blog/2022/01/practical-machine-learning-with-python/#day-4">Day 4</a></h2>

<p>Linear regression, best fit. K-Nearest-Neighbours - use of random samples to test the K.</p>

<h2 id="day-5"><a href="https://shkspr.mobi/blog/2022/01/practical-machine-learning-with-python/#day-5">Day 5</a></h2>

<p>Voronoi Diagrams and basic clustering. Mixture models.</p>
<img src="https://shkspr.mobi/blog/wp-content/themes/edent-wordpress-theme/info/okgo.php?ID=41618&HTTP_REFERER=RSS" alt="" width="1" height="1" loading="eager">]]></content:encoded>
					
					<wfw:commentRss>https://shkspr.mobi/blog/2022/01/practical-machine-learning-with-python/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
	</channel>
</rss>
