Automate Anything: Using Sikuli and Python + Real Examples!

Automate anything visible on the screen using Python and Sikulix on Win, Mac, Linux

Hello and welcome to the Amazing world of Sikuli Automation.

What you’ll learn

  • Expert level understanding of Sikuli Tool.
  • Automate anything visible on the screen.
  • Automate anything on Windows, Mac and Linux.
  • Automate Web, Desktop, Mobile applications.
  • Basic Python scripting.
  • Automate Manual Testing with Python.
  • Automate Daily Admin/Repetitive Tasks.
  • Ready to face interviews on Sikuli tool.

Course Content

  • Introduction –> 4 lectures • 10min.
  • Installation & Setup –> 7 lectures • 14min.
  • Sikuli-IDE Interface –> 8 lectures • 13min.
  • Basic Functions –> 6 lectures • 14min.
  • Matching and Offset –> 3 lectures • 10min.
  • Mouse Functions –> 9 lectures • 21min.
  • Keyboard Functions –> 6 lectures • 17min.
  • Python Basics with Sikuli –> 14 lectures • 1hr 4min.
  • Find Functions –> 6 lectures • 20min.
  • I/O Handling –> 6 lectures • 7min.

Automate Anything: Using Sikuli and Python + Real Examples!

Requirements

  • Very basic programming knowledge.

Hello and welcome to the Amazing world of Sikuli Automation.

This course is an introduction to Sikuli, a tool that lets you automate anything you see on the screen. It uses image recognition to identify and control GUI components. It is useful when there is no easy access to a GUI’s internal or source code. You can easily Automate the Boring Stuff with Python Programming.

What is SikuliX?

  • Sikuli automates anything visible on the screen of your computer running Windows, Mac or some Linux/Unix
  • It uses image recognition to identify and control GUI components, like a real user.
  • It can run the mouse and the keyboard to interact with the identified GUI elements through image recognition
  • Sikuli comes with basic text recognition that can be used to search text in images.
  • Sikuli was developed at the User Interface Design Group at MIT
  • Sikuli has an OCR-mode, in which image matches are performed after converting those image patterns to text. The OCR-functionality is powered by Tesseract, an open-source optical character recognition engine whose development is sponsored by Google.