Machine Learning-based System Development Process Model: Difference between revisions

From GM-RKB
Jump to navigation Jump to search
m (Text replacement - ". " to ". ")
m (Text replacement - "<P> [[" to "<P>  [[")
 
Line 35: Line 35:
=== 2016 ===
=== 2016 ===
* http://databricks.com/blog/2016/05/31/apache-spark-2-0-preview-machine-learning-model-persistence.html
* http://databricks.com/blog/2016/05/31/apache-spark-2-0-preview-machine-learning-model-persistence.html
** QUOTE: So far, we have only looked at saving and loading a single ML model. In practice, [[ML workflow]]s consist of many stages, from feature extraction and transformation to model fitting and tuning. [[MLlib]] provides [[MLlib Pipeline|Pipeline]]s to help users construct these workflows. (See this notebook for a tutorial on [[ML Pipeline]]s analyzing a bike sharing dataset.)        <P>       [[MLlib]] allows users to save and load entire Pipelines. Let’s look at how this is done on an example Pipeline with these steps:
** QUOTE: So far, we have only looked at saving and loading a single ML model. In practice, [[ML workflow]]s consist of many stages, from feature extraction and transformation to model fitting and tuning. [[MLlib]] provides [[MLlib Pipeline|Pipeline]]s to help users construct these workflows. (See this notebook for a tutorial on [[ML Pipeline]]s analyzing a bike sharing dataset.)        <P>         [[MLlib]] allows users to save and load entire Pipelines. Let’s look at how this is done on an example Pipeline with these steps:
*** Feature extraction: Binarizer to convert images to black and white
*** Feature extraction: Binarizer to convert images to black and white
*** Model fitting: Random Forest Classifier to take images and predict digits 0–9
*** Model fitting: Random Forest Classifier to take images and predict digits 0–9

Latest revision as of 01:47, 27 February 2024

A Machine Learning-based System Development Process Model is a software development process model to develop machine learning-based systems.



References

2019b

2017

2016

  • http://databricks.com/blog/2016/05/31/apache-spark-2-0-preview-machine-learning-model-persistence.html
    • QUOTE: So far, we have only looked at saving and loading a single ML model. In practice, ML workflows consist of many stages, from feature extraction and transformation to model fitting and tuning. MLlib provides Pipelines to help users construct these workflows. (See this notebook for a tutorial on ML Pipelines analyzing a bike sharing dataset.)

       MLlib allows users to save and load entire Pipelines. Let’s look at how this is done on an example Pipeline with these steps:

      • Feature extraction: Binarizer to convert images to black and white
      • Model fitting: Random Forest Classifier to take images and predict digits 0–9
      • Tuning: Cross-Validation to tune the depth of the trees in the forest