Remove unwanted words and format scraped data in ZeroWork
Table of Contents
Introduction
In this tutorial, we'll learn how to remove unwanted words and format scraped data using ZeroWork. This process is essential for ensuring that the names extracted from profiles are clean and usable, particularly when sending automated messages. By the end of this guide, you'll be able to effectively clean your data and extract first and last names without any extraneous titles.
Step 1: Run the Existing TaskBot to Identify Issues
Before making changes, it's important to see the current output of your TaskBot. This step will help you understand the specific problems with the data being extracted.
- Open the data table in ZeroWork.
- Run the existing TaskBot to see how names are currently being processed.
- Note any unwanted titles (e.g., "Dr", "PhD", "MBA") that are affecting the extraction of first names.
Step 2: Set Up the TaskBot for Data Cleaning
Now that you have identified the issues, it's time to modify your TaskBot to remove unwanted words before extracting names.
- Delete the existing data in your TaskBot to start fresh.
- Add the building block called Format Data.
- Select the action Remove Words.
Adding Unwanted Words
- In the input field for the words to remove, enter the unwanted titles separated by commas. For example:
Dr, PhD, MBA, Master of Arts
Step 3: Connect the Data Flow
After setting up the removal of unwanted words, you need to connect this action to the rest of your data processing.
- Select the profiles where the full names are stored.
- Ensure you configure the TaskBot to save the full name before you format it.
- Connect the formatting step to the action of saving the name.
Step 4: Split the Full Name into First and Last Names
Once the unwanted words are removed, you can proceed to split the cleaned full name into first and last names.
- Use the appropriate action in your TaskBot to split the names.
- Ensure that the first name is now extracted correctly without any titles.
Step 5: Run the Modified TaskBot
With all changes in place, it's time to test the new setup.
- Run the modified TaskBot and allow it to process the profiles.
- Open the results to verify that the unwanted words have been removed and that the first names are now accurate.
Conclusion
In this tutorial, you have learned how to effectively remove unwanted words from scraped data and format it for automated processes in ZeroWork. By following these steps, you can ensure that your automated messages address individuals correctly, enhancing engagement and professionalism. For further learning, consider exploring more advanced features in ZeroWork or checking the complete crash course playlist linked in the introduction.