Release v0.10.0 · microsoft/SynapseML

Release v0.10.0 · microsoft/SynapseML

Added a Text to Speech example to cognitive service overview ( #1350 )
Added opinion mining to TextSentiment Models ( #1449 )
Fixed Azure Maps schemas ( #1553 )
Removed modelID param validators in FormRecognizerV3 ( #1551 )
Fixed form recognizer and form ontology learner issues ( #1506 )
Fixed setServiceName python method in OpenAI ( #1498 )
Fixed error in Text Analytics Analyze schema
Improved error handling for MVAD ( #1448 , #1391 )
Removed unused concurrency parameter for MVAD ( #1383 )
Improved robustness of flood risk notebook by adding polling ( #1427 )
Responsible AI at Scale
????
Added partial dependence plots (PDP) to allow for understanding how independent variables affect a model's prediction ( #1426 )
Updated ICE/PDP documentation with PDP-based feature importance and additional examples ( #1441 , #1352 )
Updated data balance documentation to better describe how it can be used to ensure model fairness ( #1540 )
MLFlow
LightGBM on Spark
????
Added the ability to pass in generic argument strings to LightGBM enabling many complex parameterizations ( #1444 )
Added seed parameters to LightGBM ( #1387 )
Added a method to get LightGBM native model string directly ( #1515 )
Fixed issue with validation data creation during useSingleDataset mode ( #1527 )
Fixed multiclass training with initial scores ( #1526 )
Fixed saving LightGBM model iterations with early stopping ( #1497 )
Fixed issue where chunk size parameter was incorrectly specified during data copy ( #1490 )
Fixed issue where when empty partition is chosen as the main worker in singleDatasetMode ( #1458 )
Fixed bug with data repartitioning in LightGBMRanker ( #1368 )
Fixed outdated docs for useSingleDatasetMode ( #1562 )
Refactored LightGBM class structure to improve logging and debugging ( #1557 )
Vowpal Wabbit
Fixed issues with the saveNativeModel for the VWRegressionModel #1364 ( #1366 )
Fixed issues with building quadratic interaction terms ( #1460 )
Isolation Forests
Split Synapse tests into multiple test ( #1377 )
Moved from ADO Pipelines to GitHub Workflows ( #1406 )
Website Improvements
Fixed MathJax expressions rendering ( #1343 )
Fixed google analytics gtags ( #1434 )
Corrected placement of BingSiteAuth.xml config ( #1445 , #1439 )
Fixed website security and upgrade docusaurus ( #1545 )
Moveed Geospatial Services to its own folder ( #1345 )
Bumped minimist from 1.2.5 to 1.2.6 in /website ( #1455 )
Bumped node-forge from 1.2.1 to 1.3.0 in /website ( #1451 )
Bumped prismjs from 1.25.0 to 1.27.0 in /website ( #1430 )
Bumped follow-redirects from 1.14.7 to 1.14.8 in /website ( #1402 )
Bumped nanoid from 3.1.23 to 3.2.0 in /website ( #1355 )
Bumped shelljs from 0.8.4 to 0.8.5 in /website ( #1347 )
Bumped follow-redirects from 1.14.1 to 1.14.7 in /website ( #1348 )
Bumped cross-fetch from 3.1.4 to 3.1.5 in /website ( #1496 )
Bumped async from 2.6.3 to 2.6.4 in /website ( #1481 )
Pinned onnxmltools to a specific version ( #1524 )
Bug Fixes
Fixed twitter sentiment detection notebook ( #1544 )
Fixed issue with DataConversion serialization ( #1505 )
Fixed typos in TestBase ( #1501 )
Fixed issue in GridSpace python API ( #1470 )
Fixed reflective class loading in IntelliJ ( #1456 )
Removed verbose ComputeModelStatistics output and convert scoredLabelsCol to DoubleType ( #1361 )
Fixed flaking in geospatial notebooks
Code Style
Improved style checks using pre-commit ( #1538 , #1528 , #1535 )
Formatted code and notebooks with Black style checker ( #1522 , #1520 )
Documentation
Keerthi Yanda
Yagna Oruganti
Mark is a Senior Software Engineer on the SynapseML team with a deep knowledge of the .NET ecosystem and infrastructure development. In this release, Mark architected SynapseML’ s .NET binding blob publishing strategy, drove the OpenAI GPT-3 bindings to completion, and wrote a detailed GPT-3 walkthrough . Mark completed these projects while supporting the Time Series Insights service, speaking to his ability to keep multiple plates spinning at a time.
Keerthi is a Software Engineer II on the SynapseML team. Despite joining Microsoft just a few months ago, Keerthi has quickly learned the SynapseML ropes to take command of our integration with the Azure Synapse platform. Huge kudos to her for braving long build times, and daunting error messages to make sure SynapseML works out of the box on Synapse Analytics clusters.
Yagna is a Senior Data and Applied Scientist on the Industry AI team with a talent for building solutions that integrate many community tools to solve customer challenges. Yagna's first contribution to SynapseML was a masterpiece of a demo showing how to use Isolation Forests, MLFlow, Tabular SHAP, and the interpret-ml explanation dashboard in a single anomaly detection example.
Acknowledgements
We would like to acknowledge the developers and contributors, both internal and external, who helped create this version of SynapseML
Serena Ruan @serena-ruan , Eric Dettinger, Scott Votaw @svotaw , Puneet Pruthi @ppruthi , Ric Serradas @riserrad , Mark Niehaus @niehaus59 , Kyle Rush @k-rush , Keerthi Yanda @KeerthiYandaOS , Yagna Oruganti @YagnaDeepika , Jason Wang @memoryz , Ilya Matiach @imatiach-msft , Yazeed Alaudah @yalaudah , Elena Zherdeva @ezherdeva , Kashyap Patel @ms-kashyap , Martha Laguna @martthalch @marthalc , Alex Li @liyzcj , Maria Guirguis @maguir , Alexandra Savelieva @alsavelv , @netang , Sudhindra Kovalam @SudhindraKovalam , Markus Cozowicz @eisber , Tom Finley, Markus Weimer, Jeff Zheng, James Verbus @jverbus , Chris Hoder, Misha Desai, Nellie Gustafsson, Eren Orbey, Beverly Kodhek, Louise Han @jr-MS , Justyna Lucznik, Kim Manis, Mitrabhanu Mohanty, Bogdan Crivet, Anand Raman, William T. Freeman, James Montemagno, Luis Quintanilla, Dennis Kennedy, Ryan Hurey, Jarno Ensio, Brian Mouncer, Steve Suh @suhsteve , Akshaya Annavajhala (AK), Guolin Ke, Tara Grumm, Niharika Dutta @Niharikadutta , Andrew Fogarty, Juanyong Duan, Weichen Xu @WeichenXu123 , Spark.NET Team, ONNX Team, Azure Global, Vowpal Wabbit Team, LightGBM Team, MSFT Garage Team, MSR Outreach Team, Speech SDK Team, MLflow Team
Learn More

Images Powered by Shutterstock