Skip to content
Logo
  • Industries
    • Aged Care and NDIS
    • Banking, Financial Services & Insurance
    • Energy
    • Public Sector
    • Non Profit
  • Services
    • Consulting
      • Transformation Advisory
      • Architecture & Security
      • Customer Experience
      • Product Engineering
    • Artificial Intelligence
      • Generative AI
      • Conversational AI
      • Machine Learning
    • Data
      • Data Engineering
      • BI and Data Analytics
      • Data Management
    • Salesforce
      • Capabilities
      • For Non-profits
      • Marketing Cloud
    • Digital Experiences
      • HubSpot
      • Jitterbit
      • Strapi
      • LeadSquared
      • WP Engine
      • Mobile & Web
      • Zepic
      • Web Development
      • Mobile Development
      • Product Engineering Services
    • Managed Services
      • Application Management
      • Integration Platforms
      • Infrastructure & Cloud
      • Cybersecurity & Compliance
    • Automation & Testing
      • RPA
      • Testing Services
    • Connected Devices
      • Engineering services
      • IoT Services
      • Automation Services
    • ERP
      • Odoo
    • Business Services
      • Market Research
      • Documentation
      • Mortgage
      • Creative
      • Legal
  • Solutions
    • Finnate
      • Investing
      • Lending
      • Digital Onboarding
    • Metiz
      • PIIManager
      • DocuParse
      • CustomerPulse
    • Accelerators
      • Accelor MissionXcel
      • Accelor Object Importer
      • Accelor Virtual Assistant
    • Metiz
      • At play
      • Under the hood
    • Connected Devices
      • Centelon IoT platform
      • Cento
    • DXP
      • Capabilites
      • Industries
  • Partners
  • Resources
    • Case Studies
    • White papers
    • Blogs
    • Podcasts
    • Brochures
    • Events
    • Newsletters
  • Company
    • About Us
    • Careers
Contact
Search
Close this search box.

TransmogrifAI- ML Library

Machine learning models — artificial intelligence (AI) that identifies relationships among hundreds, thousands, or even millions of data points — are rarely easy to architect.

Data scientists spend weeks and months not only preprocessing the data on which the models are to be trained, but extracting useful features (i.e., the data types) from that data, narrowing down algorithms, and ultimately building (or attempting to build) a system that performs well not just within the confines of a lab, but in the real world.

Salesforce’s new toolkit aims to ease that burden somewhat. On GitHub today, the San Francisco-based cloud computing company published TransmogrifAI, an automated machine learning library for structured data — the kind of searchable, neatly categorized data found in spreadsheets and databases — that performs feature engineering, feature selection, and model training in just three lines of code.

It’s written in Scala and built on top of Apache Spark (some of the same technologies that power Salesforce AI platform Einstein) and was designed from the ground up for scalability. To that end, it can process datasets ranging from dozens to millions of rows and run on clustered machines on top of Spark or an off-the-shelf laptop.

Mayukh Bhaowal, director of product management for Salesforce Einstein, told VentureBeat in a phone interview that TransmogrifAI essentially transforms raw datasets into custom models. It’s the evolution of Salesforce’s in-house machine learning library, which allowed the Einstein team to deploy custom models for enterprise clients in just hours.

“It’s informed by what our data scientists learned while building Einstein,” Bhaowal explained. Chief among those lessons: Custom-built models beat global, pretrained models. “If you’re using the same model to make predictions for a Fortune 500 company and a mom and pop shop, you’ll have a hard time finding the right pattern.”

Machine learning made easy

TransmogrifAI offers a three-step workflow.

First is feature inference and automated feature selection. It’s a crucial part of model training, as selecting the wrong features could result in an overly optimistic, inaccurate, or biased model.

Using TransmogrifAI, users specify a schema for their data, which the library uses to extract features automatically (such as phone numbers and zip codes, for example). It also performs statistical tests, automatically cataloging text fields with low cardinality — i.e., a small number of elements — and throwing out features with little-to-no predictive power, or those that are likely to result in hindsight bias (the tendency to overestimate an event’s predictability) and other unwanted signals.

In a demo, Bhaowal showed how TransmogrifAI could quickly isolate features like job titles, emails, and addresses and figure out whether they’re predictive. Those that aren’t — salutation, in this case — were discarded automatically. “It’s perfect for dimensionality reduction,” he said, referring to the process of reducing the number of features on which the model is trained.

The next step in TransmogrifAI’s flow is automated feature engineering. Drawing on the feature types extracted in the first step, the library transforms structured data into vectors, automatically taking, for example, a list of phone numbers and splitting out the country code to see if a phone number is valid.

Once TransmogrifAI has extracted features from the dataset, it’s primed to begin automated model training. At this stage, it runs a cadre of machine learning algorithms in parallel on the data, automatically selects the best-performing model, and samples and recalibrates predictions to avoid imbalanced data.

Core to TransmogrifAI’s training is what Shubha Nabar, senior director of data science for Salesforce Einstein, calls “model explainability” — transparency about the factors influencing a model’s predictions. “From a trust and data privacy perspective, it’s important that the generated model isn’t a ‘black box’,” she said. “[TransmogrifAI] shows the global effects of each feature.”

And that’s just the tip of a very tall iceberg.

TransmogrifAI boasts tools that make it easier to adjust hyperparameters — variables such as sampling rate and filters — that influence and optimize machine learning models. And within integrated development environments that support it, TransmogrifAI highlights typos and syntax errors, suggests code completion, and “types” features with an extensible hierarchy, allowing users to differentiate between nuanced and primitive features.

“[TransmogrifAI] has been transformational for us, [reducing] the average turn-around time for training a performant model to a couple of hours and enabling our data scientists to deploy thousands of models in production with minimal hand-tuning,” Bhaowal said. “The goal of democratizing machine learning can only be achieved through an open exchange of ideas and code, and diverse perspectives from the community will make the technology better for everyone.”

Coincidentally, the public launch of TransmogrifAI comes a day after the open-sourcing of Oracle’s GraphPipe, a tool that makes it easier to deploy machine learning models made by frameworks like Google’s TensorFlow, MXNet, Facebook’s Caffe2, and PyTorch in the cloud.


Also published on Medium.

Envelope Linkedin
Envelope Linkedin
  • October 17, 2019
Products
  • Finnate
  • Metiz
  • Accelerators
Categories
  • AI/ML
  • Business
  • Business Services
  • Corporate
  • Fintech
  • Non Profit
  • Salesforce
  • Technology
  • Trending
Tags
Agile business model AI AI in business Artificial Intelligence Asset management Automation branding Business covid CRM Energy enterprise agile Finance Integrations Machine Learning marketing microsoft pandemic personal branding powerpoint Salesforce social events social media visual presentation Voice women in centelon womens forum work from home working from home
Recent Posts
  • Unlocking Agility in Aged Care in Australia: A 2025 Playbook for Transformation
  • Strategic Recommendations for Australian leaders to drive innovation and impact in 2025
  • The tech insights Australian leaders need for 2025
  • New Aspirations: Embracing AI – Our Journey through Generative AI, Conversational AI, and Machine Learning
  • Three Trends in Business IT for 2023
PrevPreviousAI Integration in CRM
NextSalesforce brings AI power to its search toolNext

Let’s Create Big Stories Together

Expertise Deployments in Salesforce, ERP, CRM, Web & Mobile Developments, Artificial Intelligence, Data Management & Resource Augmentation.

Book a Consultation

Contact Us

Australia

Level 13, 200 Queen Street Melbourne VIC 3000
Australia

India

B Wing, Level 2, Ghule Square DSK Ranwara Road, Bavdhan,
Pune 411-021

Singapore

2 Shenton Way #15-04 SGX Centre 1, Singapore 068804

USA

196 N 3rd Street, Suite 319, San Jose,
CA 95112
ISO Certified 27001:2013
Great Place to Work - Certified™ Nov 2021-22
Centelon © 2025. All rights Reserved

Privacy Policy

Terms of Service

Thanks for showing an interest in our products.

Our team will get back to you at the earliest to book a requested demo call at your preferable time.

 

Back to Website