15:[["$","$L133",null,{"props":{"lessonContent":{"components":[{"type":"MarkdownEditor","mode":"edit","content":{"version":"2.0","text":"SGD implements **stochastic gradient descent** with support for momentum and #key# Nesterov acceleration: Nesterov acceleration refers to a method of accelerating the convergence of iterative optimization algorithms commonly used in machine learning. #key#. Momentum makes obtaining optimal model weights faster by accelerating gradient descent in a certain direction.","mdHtml":"

SGD implements stochastic gradient descent with support for momentum and Nesterov accelerationNesterov acceleration refers to a method of accelerating the convergence of iterative optimization algorithms commonly used in machine learning.. Momentum makes obtaining optimal model weights faster by accelerating gradient descent in a certain direction.

\n","comp_id":"tT1cS9RiMb6LTY3gc_8kP"},"iteration":0,"hash":0,"children":[{"text":""}],"contentID":"Wfz50K-MJYTQYpOjV4UhI","saveVersion":4,"status":"normal"},{"type":"DrawIOWidget","mode":"edit","content":{"path":"/api/collection/10370001/6643099853127680/page/4829387475386368/image/6061872790634496?page_type=collection_lesson","caption":"Gradient function","editorImagePath":"/api/collection/10370001/6643099853127680/page/4829387475386368/image/4532471426973696?page_type=collection_lesson","version":1,"height":331,"width":511,"comp_id":"M-WKlohWbQ9o_0JQWjASJ","slidesId":"","editorGCSImagePath":"educative-us-central1/uc/v5/10370001/collections/6643099853127680/rev-20/pages/4829387475386368/images/4532471426973696-2023-08-10T07:08:23.679787"},"iteration":0,"hash":1,"children":[{"text":""}],"contentID":"0SSF-0Z_ih4lIKKTgSBXo","saveVersion":4,"status":"normal"},{"type":"SlateHTML","content":{"html":"

Let’s understand how to use SGD in the following playground:

","comp_id":"fD1wp5i5BEoVaUEbZmpwO"},"hash":2,"iteration":0},{"type":"Code","mode":"view","content":{"version":"8.0","caption":"SGD optimizer","language":"python38","title":"","theme":"default","additionalContent":[],"selectedIndex":0,"runnable":false,"judge":false,"staticEntryFileName":true,"judgeContent":"","judgeHints":"","allowDownload":false,"treatOutputAsHTML":false,"enableHiddenCode":true,"enableStdin":false,"evaluateWithoutExecution":false,"showSolution":false,"timeLimit":55,"hiddenCodeContent":{"prependCode":"import jax\nimport jax.numpy as jnp\nimport numpy as np","appendCode":"\n\n","codeSelection":"prependCode"},"dockerJob":{"key":"QybscipXVkKu6ng5HCDBX","jobType":"Default","name":"jax-code-widget","inputFileName":"main.py","runScript":"python3 main.py","runInLiveContainer":false},"selectedApiKeys":{},"selectedEnvVars":{},"specialInput":"no-input","solutionContent":"\n\n\n","judgeContentPrepend":"\n\n\n","evaluateLanguage":"","isCodeDrawing":false,"content":"import optax\nseed = random.PRNGKey(0)\nlearning_rate = jnp.array(1/1e4)\n\nmodel = CNN()\nweights = model.init(seed, X_train[:5])\n\noptimizer = optax.sgd(learning_rate=learning_rate) # Initialize SGD as Optimizer\noptimizer_state = optimizer.init(weights) # Optmizer state","comp_id":"GDaCfYubAC5tO0zb8-qNJ","entryFileName":"main.py","staticEntryName":false,"dockerExecutionContext":{"imageName":"author-5481233866031104-collection-4898974156980224-rev-1-container-6055086541504512-jupyter","job":{"key":"QybscipXVkKu6ng5HCDBX","jobType":"Default","name":"jax-code-widget","inputFileName":"main.py","runScript":"python3 main.py","runInLiveContainer":false},"envs":[],"liveInstance":{"id":"ed-5481233866031104","url":"https://ed-5481233866031104.educative.run","live-app-id":"ed-5481233866031104-live-app","live-app-url":"https://ed-5481233866031104-live-app.educative.run","cloudlab-id":"ed-5481233866031104-cloudlab","cloudlab-url":"https://ed-5481233866031104-cloudlab.educative.run","vm-lease-time":false}},"isCopied":true,"highlightedLines":"8"},"contentID":"R5XxIv4VyIbuJ2EwayXU6","saveVersion":6,"iteration":0,"hash":3,"children":[{"text":""}],"status":"normal"},{"type":"SlateHTML","content":{"html":"

In the code above:

...","comp_id":"dSaeZCne0CRP519oYleNg"},"hash":4,"iteration":0}],"summary":{"title":"Stochastic Gradient Descent","titleUpdated":true,"description":"Learn about SGD-based optimizers in JAX and Flax."},"content":[{"type":"MarkdownEditor","mode":"edit","content":{"version":"2.0","text":"SGD implements **stochastic gradient descent** with support for momentum and #key# Nesterov acceleration: Nesterov acceleration refers to a method of accelerating the convergence of iterative optimization algorithms commonly used in machine learning. #key#. Momentum makes obtaining optimal model weights faster by accelerating gradient descent in a certain direction.","mdHtml":"

Let’s understand how to use SGD in the following playground:

In the code above:

...","comp_id":"dSaeZCne0CRP519oYleNg"},"hash":4,"iteration":0}],"darkModeContent":[{"type":"MarkdownEditor","mode":"edit","content":{"version":"2.0","text":"SGD implements **stochastic gradient descent** with support for momentum and #key# Nesterov acceleration: Nesterov acceleration refers to a method of accelerating the convergence of iterative optimization algorithms commonly used in machine learning. #key#. Momentum makes obtaining optimal model weights faster by accelerating gradient descent in a certain direction.","mdHtml":"

Let’s understand how to use SGD in the following playground:

In the code above:

...","comp_id":"dSaeZCne0CRP519oYleNg"},"hash":4,"iteration":0}]},"isPreviewLesson":false,"pageType":"collection_lesson","aiCoachVideoUrl":"https://youtu.be/kgl8y9J3O6c","collectionDetailsSSR":{"title":"Deep Learning with JAX and Flax","summary":"This course comprehensively introduces JAX and Flax, two open-source libraries that have gained prominence for their efficiency, flexibility, and scalability in deep learning applications. \n\nIn this course, you’ll explore deep learning principles and understand the unique features of JAX and Flax. You will learn the basics of JAX, optimizers using JAX and Flax, and loss and activation functions. You’ll also learn how to load datasets, perform classification using distributed learning, and use ResNet and LSTM models. In the end, you will complete a project for hands-on experience using JAX and Flax for transfer learning. \n\nBy the end of the course, you’ll be proficient in implementing and customizing neural network models using JAX and Flax, equipped with hands-on skills in advanced optimization and distributed training.","details":"","clos":["An understanding of the basics of JAX, including Autograd and array operations","The ability to apply JAX for numerical computing and machine learning tasks","Hands-on experience using the Flax framework for defining, customizing, and training neural network architectures","The ability to apply and adjust learning rates for various optimizers available in JAX and Flax ","Hands-on experience performing training in a distributed computing environment","The ability to apply ResNet and LSTM models along with transfer learning using JAX and Flax"],"arabic_available":false,"page_tags":{"5673261273972736":"","4708292986404864":"","4803590425411584":"","6408925015703552":"","5755734712385536":"","4618879484821504":"","5014224479977472":"","4953933641678848":"","6468329849225216":"","5141905972396032":"","4908104864235520":"","4964871547650048":"","5872486754549760":"","4998117178212352":"","6636285988175872":"","5243430140903424":"","5001068357615616":"","4516367406727168":"","5834192893247488":"","5234373397053440":"","5879089528569856":"","6745837735772160":"","5978697604792320":"","6179051990679552":"","5484981651767296":"","6520552457240576":"","5619493786353664":"","5118492864151552":"","4699304962031616":"","6038708982382592":"","4788151172988928":"","6411684217094144":"","6226349761757184":"","6243990775791616":"","5843703179247616":"","5859557960843264":"","6037043977912320":"","6425736410562560":"","5464814277885952":"","5068245156233216":"","6422507914264576":"","4941892989747200":"","6493996990595072":"","6124649398927360":"","6600682216620032":"","6219628177784832":"","5050407519518720":"","6511147891818496":"","4607475385630720":"","4613216117456896":"","6125456033316864":"","6741778941345792":"","5903049741828096":"","4864988727738368":"","5850934944202752":"","6696111158067200":"","6260988356853760":"","5698100311359488":"","5118211409575936":"","4867369716678656":"","5371345524490240":"","5630898031820800":"","5271242939826176":"","4736973033177088":"","5816475079409664":"","4667611660156928":"","4786908805988352":"","4700359175176192":"","5204561290854400":"","6433936086663168":"","5634441883156480":"","4829387475386368":"","5502248192049152":"","5747526375571456":"","5094755653648384":"","4551053763936256":"","5171422984142848":"","6377211312734208":"","5361910236315648":"","5511445696741376":"","6438527988137984":"","4821799476133888":"","6504889533595648":"","4765233280450560":""},"collection_toc_is_enabled":true,"page_count":null,"docker":{"container":{"file":{"name":"edujax_1.tar.gz","size":25006636},"imageName":"author-10370001-collection-6643099853127680-rev-36-container-5674459639971840-edujax_1","buildStatus":"SUCCESS","buildStatusUrl":"/api/author/10370001/collection/6643099853127680/containers/5674459639971840/build/status","buildLogUrl":"/api/author/10370001/collection/6643099853127680/containers/5674459639971840/build/log","metadata":{"sizeInBytes":25006636},"id":-1,"tarballDownloadUrl":"/api/author/10370001/collection/6643099853127680/containers/5674459639971840/download","rebuildImageUrl":"/api/author/10370001/collection/6643099853127680/containers/5674459639971840/rebuild","track":false},"envs":[],"jobs":[{"key":"Wsv6oOLMMjnD8DM2AjWXK","jobType":"Live","name":"jaxbook","inputFileName":"foo","runScript":"nohup jupyter notebook /usr/local/notebooks/JAX/JAX.ipynb --allow-root --no-browser > /dev/null 2>&1 &","ports":"3300","startScript":"echo start","runInLiveContainer":true,"https":false,"forceRelaunchOnRun":false},{"key":"QybscipXVkKu6ng5HCDBX","jobType":"Default","name":"jax-code-widget","inputFileName":"main.py","runScript":"python3 -W ignore main.py","runInLiveContainer":false}],"testRunners":[],"version":3,"loaded":true},"discounted_price":29,"cover_image_id":5712902868434944,"cover_image_metadata":"{\"width\":1024,\"height\":512,\"sizeInBytes\":60880,\"name\":\"Jax and Flax.png\"}","cover_image_serving_url":"/v2api/collection/10370001/6643099853127680/image/5712902868434944","tags":["deep learning","python","artificial intelligence"],"intro_video_url":"","intro_video_thumbnail_url":null,"aggregated_widget_stats":{"projects":1,"assessments":0,"MarkdownEditor":322,"codeExerciseCount":0,"codeRunnableCount":104,"codeSnippetCount":192,"illustrations":52,"Image":1,"SlateHTML":233,"Code":254,"MatchTheAnswers":0,"LiveApp":23,"TerminalWidget":0,"WebpackBin":0,"DrawIOWidget":33,"EducativeArray":1,"RunJS":0,"Quiz":9,"Latex":12,"Columns":17,"CanvasAnimation":0,"cloudlabs":0,"EditorCode":7,"Adaptive":5},"default_themes":{"code_themes":{"Code":"default","Markdown":"default","RunJS":"default","SPA":"default","isForced":{"Code":false,"Markdown":false,"RunJS":false,"SPA":false}}},"api_keys":{"api_keys":[]},"skills":[],"testimonials":[],"licensing":null,"target_audience":"intermediate","author_id":"10370001","collection_id":"6643099853127680","approval_status":3005,"price":29,"is_private":false,"path_type":"regular","organization_id":null,"is_mini":false,"is_priced":true,"brief_summary":"Gain insights into JAX and Flax's features for deep learning. Learn about optimizers, functions, data loading, and model training. Explore hands-on projects for practical experience.","approval_update_time":"2024-02-13T10:39:26.475Z","rating_visibility":true,"update_last_published_on_homepage":true,"show_developed_by":true,"udata_files":[],"CodeThemes":{"Code":"default","Markdown":"default","RunJS":"default","SPA":"default","isForced":{"Code":false,"Markdown":false,"RunJS":false,"SPA":false}},"is_marked_for_deletion":false,"transition_page_title":"","is_redirectable":false,"collection_type":"collection","adaptive_learning_mode":false,"HLOs_to_toc":{},"is_guide":false,"read_time":68400,"allow_logged_out_executions":false,"unique_live_widget_urls":false,"metadata_status":101,"palified_version":null},"pageSummarySSR":{"title":"Stochastic Gradient Descent","description":"Learn about SGD-based optimizers in JAX and Flax.","discourse_page_url":"https://discuss.educative.io/tag/stochastic-gradient-descent__optimizers-in-jax-and-flax__deep-learning-with-jax-and-flax?open=true&ctag=deep-learning-with-jax-and-flax__derrick-mwiti&cslug=deep-learning-with-jax-and-flax&pslug=stochastic-gradient-descent"},"adaptiveLearningConfigConstantSSR":0,"enableLessonPageLockedBannerV2":true,"allowAllLessonPreview":false,"lockedBannerStatsSSR":{"b2cTrialStats":{"is_b2c_trial_active":true,"b2c_trial_active_duration":21,"b2c_trial_categories":"$134"},"b2cStatus":100,"learnerTags":"$135","workStats":1600,"interviewWorksStats":93,"inL2cStarterPack":false,"l2cWorkStats":46,"enableL2cStarterPackPaymentWidget":"false"},"pageTocSSR":"

","authorId":"10370001","collectionId":"6643099853127680","pageId":"4829387475386368","isCollectionPageLockedCachingEnabled":true,"aceFeatureFlags":{"enableAceEditor":true,"enableAceEditorForAnswers":true},"meta":{"type":["Article","TechArticle"],"title":"Stochastic Gradient Descent","name":"Deep Learning with JAX and Flax","description":"Learn about SGD-based optimizers in JAX and Flax.","image":"https://educative.io/api/collection/10370001/6643099853127680/image/5712902868434944.png","isAccessibleForFree":false,"keywords":"$135","provider":"Educative","publisher":"Educative","id":"courses/deep-learning-with-jax-and-flax/stochastic-gradient-descent","author":"Educative","educationalLevel":"intermediate","noIndex":true,"isForcedNoIndex":true,"noFollow":false,"redirectInfo":{"isDeletedCollectionPageRedirectable":false},"page_titles":{"4765233280450560":"Installing Packages","4708292986404864":"Introduction to JAX Data Types and Array","4803590425411584":"Pure Functions and Random Numbers","6408925015703552":"Array Operations","4618879484821504":"Just-in-Time Compilation","5673261273972736":"About the Course","5755734712385536":"Machine Learning with JAX","5816475079409664":"Summary: Basics of JAX","4864988727738368":"Challenge: Basics of JAX","6377211312734208":"Solution: Basics of JAX","5634441883156480":"Quiz: Basics of JAX","5850934944202752":"Introduction to Optimizers","6696111158067200":"Adaptive Optimizers","4829387475386368":"Stochastic Gradient Descent","4667611660156928":"Summary: Optimizers in JAX and Flax","4613216117456896":"Challenge: Optimizers","6260988356853760":"Solution: Optimizers","5502248192049152":"Quiz: Optimizers","4953933641678848":"Introduction to Loss Functions","5014224479977472":"Types of Loss Functions in JAX","6468329849225216":"Computing and Monitoring Loss in JAX","4786908805988352":"Summary: Loss and Activation Functions","5141905972396032":"Challenge: Loss and Activation function","5094755653648384":"Quiz: Loss and Activation Functions","5747526375571456":"Solution: Loss and Activation function","5698100311359488":"Activation Functions in JAX","4908104864235520":"Preparing Text Dataset","4964871547650048":"Text Preprocessing and Sentiment Analysis","5118211409575936":"Loading Image Dataset","4700359175176192":"Summary: Load Datasets in JAX","5903049741828096":"Image Classification with JAX and Flax","5001068357615616":"Summary: Image Classification and Distributed Training","4998117178212352":"Distributed Training with JAX and Flax","4821799476133888":"Setting Up TensorBoard","4867369716678656":"How to Use TensorBoard with Flax","5630898031820800":"Summary: TensorBoard and State Handling","5371345524490240":"Applying BatchNorm and DropOut Layers in JAX and Flax","5234373397053440":"Data Preprocessing","5879089528569856":"LSTM Model","5361910236315648":"Challenge: Train and Test a Model","5872486754549760":"Solution: Train and Test a Model","5204561290854400":"Summary: LSTM in JAX and Flax","5978697604792320":"Major Functions in Flax and TensorFlow","6125456033316864":"Training model in Flax vs. TensorFlow","6179051990679552":"Training the ResNet Model","6433936086663168":"Summary: Using ResNet Model in Flax","6741778941345792":"Summary: Flax vs. TensorFlow","5484981651767296":"ResNet Model Definition","6520552457240576":"Transfer Learning using ResNet Model","6636285988175872":"Challenge: Distributed Training with JAX and Flax","5243430140903424":"Solution: Distributed Training with JAX and Flax","6504889533595648":"Quiz: Distributed Training with JAX and Flax","5834192893247488":"Quiz: TensorBoard and State Handling","5171422984142848":"Quiz: LSTM in JAX and Flax","5271242939826176":"Solution: LSTM in JAX and Flax","6745837735772160":"Challenge: LSTM in JAX and Flax","4551053763936256":"Quiz: Loading Datasets in JAX","4516367406727168":"Track Model Training in JAX Using TensorBoard","5118492864151552":"Sharing TensorBoard","4736973033177088":"Quiz: Flax vs. TensorFlow","4699304962031616":"Transfer Learning in JAX and Flax","5619493786353664":"Quiz: Using ResNet Model in Flax","5511445696741376":"Wrap Up","6038708982382592":"Task 1: Import the Module","4788151172988928":"Task 2: Define the Dataset Class","6411684217094144":"Task 3: Load the DataFrame","6226349761757184":"Task 4: Define the Custom Collate Function","6243990775791616":null,"5843703179247616":null,"5859557960843264":"Task 7: Define the Zero Gradients Function","6037043977912320":"Task 8: Define the create_mask Function and Optimizer","5464814277885952":null,"5068245156233216":"Task 10: Compute the Metrics","6422507914264576":"Task 11: Define the Training Functions","6493996990595072":"Task 12: Define the Model Evaluation Functions","6124649398927360":"Task 13: Perform Model Training","6600682216620032":null,"6511147891818496":null,"4607475385630720":null,"6438527988137984":"Task 14: Evaluate the Trained Model"},"is_marked_for_deletion":false,"transition_page_title":"","is_redirectable":false,"deleted_course_lesson_redirect":{"author_id":null,"collection_id":null,"page_id":null,"redirect_url_slug":null},"metadata_status":101,"additional_course_alternatives":[]},"requestUrl":"/courses/deep-learning-with-jax-and-flax/stochastic-gradient-descent","requestUrlInfo":{"authorId":10370001,"collectionId":6643099853127680,"pageId":4829387475386368,"courseUrlSlug":"deep-learning-with-jax-and-flax","pageUrlSlug":"stochastic-gradient-descent"},"isExternalContent":false}}],[["$","script",null,{"id":"generate-data","type":"application/ld+json","dangerouslySetInnerHTML":{"__html":"$136"}}],false,"$undefined"]]