The Main Application Window
Once a project is loaded, the Main Application Window becomes your primary workspace. It features a two-panel layout designed for an efficient workflow.
Left Panel (Image View)
The Image View displays your project’s images sequentially. This is an interactive canvas where OCR-detected text boxes are rendered directly on the images.
- Top Controls
- Settings (cog icon): Opens the application settings dialog.
- Progress Bar Indicating Proress of Process OCR.
- Image Area
- Navigation: Use your mouse wheel or the scrollbar to move through the images.
- Text Box Interaction:
- Select: Click a text box to select it. It will be highlighted with a blue frame, and its properties will load in the Right Panel.
- Move: Drag a selected text box to reposition it.
- Resize: Drag the handles on the corners and sides of a selected text box.
- Rotate: Drag the rotation handle (circular arrow) above a selected text box.
- Perspective Transform: Hold
Ctrlwhile dragging a corner handle to apply perspective distortion.
- Overlay Controls: A floating overlay at the bottom-center provides quick access buttons:
*(Placeholder: Replace with actual screenshot of the Image View with selected text box and overlay buttons)*
Right Panel (Control & Edit View)
The Right Panel contains all the tools for managing your project’s data, text, and styles.
*(Placeholder: Replace with actual screenshot of the Right Panel)*
- Top Controls:
- Process OCR: Starts the batch OCR process for all images.
- Stop OCR: Stops an ongoing OCR process.
- Manual OCR: Toggles Manual OCR mode.
- Profile Selector: A dropdown menu to switch between different text profiles.
- Import/Export Menu (three bars icon): Opens a menu for importing and exporting data.
- Main Widgets:
- Results Widget: A table of all OCR text entries for detailed editing and management.
- Text Box Style Panel: Appears when a text box is selected, providing detailed styling options.
- Find/Replace Widget: A tool for searching and replacing text throughout your project.
- Bottom Controls:
- AI Translation: Launches the Gemini Translation window.
- Advanced Mode Checkbox: Toggles advanced filtering and sorting options in the Results Widget.
Menu Bar
The application menu bar at the top of the window provides access to all major functions.
- File: Manage projects (
New,Open,Save (Ctrl+S)), import/export data, and exit the application. - Edit: Access tools like
Find/Replace (Ctrl+F). - Tools: Access primary workflows like
Batch OCR,Manual OCR,Stitch/Split Images, andAI Translation. - View: Toggle interface options like
Advanced Mode. - Settings (Ctrl+,): Open the application settings.
- Help: View application information and check for updates.