What's New

Starting with 2.0.3, we are no longer supporting downlevel (3.x-) API features.

Starting with 2.0.3, the change history prior to 2.0.2 has been archived.

2.0.3

All-new version for Android 4 (API 17+) devices.
Audio Capture is disabled.
Android 4 no longer supports applications in Global Search.

Overview

Thanks for choosing Dictator!

Dictator is a dictation-taking application.

Not all devices are equipped with Speech Recognition. Dictator warns you if it cannot detect Speech Recognition installed on your device.

Text with this visual appearance describes behavior that is different, depending on the version of Android on your device.

Components

Dictator Components

These exist within the Dictator application.

The Main Screen is the "home page" showing the sortable Item List, Current Item, and Tag Cloud.
The Open Mic is the screen you use to repeatedly record new dictations.
The Edit Item is the screen you use to edit saved dictations.
The Quick Record is a special-purpose screen for one-touch eyes-free recording.

Dictation Data Fields

Created (read-only): timestamp.
Title (string): title. Filled in automatically.
Body (string): body. Filled by Speech Recognition.
Keywords (string): keywords. Separate multiple keywords with comma. Filled in from the Current Keyword setting (see below).

The data fields are editable in the Item Editor Screen.

Device Components

These already exist on your device, for use by all applications.

Action Bar: this is the way Android 4 applications organize their menu commands and navigation items.
Speech Recognition: The ability to record audio and convert it into text. Dictator does not implement Speech Recognition.
Text-to-Speech: The ability to convert text into digital audio. You must have TTS resources installed separately.
Accelerometer: The Shake feature uses this to determine when you are shaking the device.
Proximity: The Proximity Sensor feature uses this to determine when the device is in "talk" position.
Volume Up/Down Keys: The Quick Keys feature uses these physical keys to trigger and/or cancel dictation.
Pointer: Ability to select list elements other than by Touch Events, with a D-pad or Trackball.
Global Search: All dictations are searchable from the device's Global Search.
Audio Capture: for devices that provide audio data during Speech Recognition, this data can be saved as audio files.
Media Player: Captured audio files are optionally added to your device's Media Player.
External Storage: Audio files are stored on your device's External Storage.

Speech Recognition may require use of a data connection. If you are recording, and your device switches between Cellular Data and WiFi, you may receive Network Error from Speech Recognition.

Common Visual Elements

Dictator reuses many visual elements across all screens. They are described in the following sections.

Audio View

The Audio view displays during recording.

Cancel Recording button.
Displays what is being recorded and the language.
VU meter. This updates as you speak.

Encoder View

The Encoder View displays during recording. It shows the progress of audio capture.

Recording icon.
Encoding progress. Encoding is paused during audio recording, and the encoder accumulates the data.
Current output file size.

Playback View

The Playback View displays after recording all prompts. It allows you to listen to the audio capture.

Name of audio file.
Play/Pause button.
Playback progress.
Total Duration of audio file.

Readback View

If enabled, the Readback view displays the Speech Recognition results.

The Readback view displays for 5 seconds.

More Results

Returning More Results is an optional feature of Speech Recognition providers.

The Speech Recognition results usually contain many different versions of what it thinks it recognized; the "top" result is used to set the field value for a prompt.

If more results are available, a button is displayed next to the corresponding field. Selecting this button displays the list; selecting from the list overwrites that field. Selecting Back cancels.

Example

Everyone's voice and speech are different. This example is based on real results! Suppose you actually spoke:

Do this that and the other.

The top result is this:

Do this that any other.

However, you will find a more-accurate version in the More Results list:

Do this that any other
Do this at any other
Do this step any other
Dude is there any other
Do this to any other
Dude is that any other
Do this to at any other
Do this that the other
Do this step in the other
Do this that in the other
Do this that and the other
Do this bethany over
Do this tiffany over
Do this to happen the other
Do this that any over

Dictator saves the Results of every dictation item, while you have the app open.

Confidence Score

Some providers also return a Confidence Score, indicating how "good" the match is. If present, Dictator color-codes this information, with Green being 100%, and Red being 0%, with colors in-between for intermediate values.

Current Keyword

Dictator will remember the current keyword, and automatically apply it to each dictation. You can enter an initial current keyword from the Action Bar (see below).

The current keyword is an auto-complete, and remembers previously-entered keywords.

Main Screen

This is opening screen when you launch the app launcher icon.

Layout

The Main Screen is divided into these areas:

The Action Bar is the Android 4 UI for accessing frequently-used actions.
The Current Item displays the most-recent dictation.
The Item List shows your items sorted as selected on the Action Bar.
The Statistics View shows Word and Item Counts and a Tag Cloud (enabled in Settings).

Action Bar

The Action Bar provides access to these functions:

Navigation Mode: control the sort order of dictations: Date, Title, Keywords.
New Item (enabled in Settings): start a new dictation. When completed, this becomes the Current Item.
Extend Item (enabled in Settings): extend the Current Item, if one exists.
Open Mic: start Open Mic recording mode.
Current Keyword: activate the Current Keyword edit text to enter or change the Current Keyword.
Search: activate the standard Search Widget to search dictations.
Settings: launch the Settings Screen.

Context Menu

Activate by long-clicking on a dictation item. The commands available depend on whether it was extended.

For non-extended items:

Send: Send this item.
Extend: Extend this item. If successful, the other commands will become available.
Edit: Go to the Edit Screen.
More Results: Display the More Results list.
Delete: Delete this item. Only displays if there is no Audio content.

For extended items, these additional commands:

Undo: Delete the extended text.

Quick Keys

You can use the Volume Keys for these functions:

Up: Start a new dictation. Only the Body field is recorded. If successful, this becomes the Current Item.
Down: If there is a Current Item, start the Extend command.

During the recording prompt, you can hit either Volume Key to cancel recording.

Shake

You can shake the device for these functions:

If no Current Item is set, start a New Dictation.
If a Current Item is set, extend it.
While recording, shake to cancel.

Shake the device two times within one second to activate. Dictator provides audio or haptic feedback (depending on Ringer Mode) to indicate it recognized the shake.

Proximity Sensor

If present, you can actiate the proximity sensor in the "near" position to perform the same actions as Shake. Engage the Proximity sensor by putting the device up to your face, or otherwise active it by obstruction.

Dictator provides audio or haptic feedback (depending on Ringer Mode) to indicate it recognized the sensor.

Dictation

Regardless of which screen you are in, the dictation recording process is the same. The settings for recording are in the Settings->General page.

Trigger Recording

There are a number of ways to trigger recording, each has a corresponding setting:

Command Buttons: these appear on the Action Bar.
Volume Keys: the meaning of Up/Down is screen-specific; Up always starts recording.
Shake: shake the device 2 times within 1 second to trigger.
Proximity Sensor: make the sensor read "near" to trigger.

Dictator briefly displays a popup showing the active triggers when the Main Screen displays.

Recording Process

Once you have triggered recording, the process proceeds as follows:

If enabled, an earcon plays to confirm activation.
The Audio view displays during recording. This is your prompt to speak. The device also Vibrates at this time, if enabled.
Speak your dictation, or Cancel Recording via the Audio View or an enabled trigger.
The Readback View displays the Speech Recognition results for the current prompt.
If enabled, read back the number of More Results.

The device's Ringer Mode is checked before playing any audio. Audio is played only when Normal Ringer Mode, unless the Ignore Ringer Mode setting is enabled. Haptic feedback is performed regardless of Ringer Mode.

Open Mic

This screen gives you access to long-duration hands-free dictation-taking. Speech Recognition is invoked as quickly as possible, over-and-over, until you stop it.

Speech Recognition is started immediately upon opening the Screen, unless you have enabled Shake-to-Start. There is an optional Earcon for the start and end of audio recording.

This Screen keeps your device's display active. This is required to use Speech Recognition.

The list of dictations is not saved; you must manually select and save items.

Items recorded in this screen are filled in as follows:

Title: Elapsed time (hours/minutes/seconds/ms) from start of Screen, e.g. "00:01:02.321".
Keywords: Timestamp of start of Screen, e.g. "Tue Mar 08 2011 12:01:01 EST". This collates all the items from this session together in the Keyword sort order.
Body: Results of Speech Recognition.

You may have better performance when connected to Wifi networks.

Because Speech Recognition may use the network, there are periods where transcription is not possible, i.e. while waiting for the results to be processed and sent back over the network.

Commands

Context Menu

Activate by long-clicking on a list item:

More Results: select from More Results.
Merge Up: merge with next-higher time index. The selected item is deleted from the list and the database if it was saved. The target item is re-saved if it was previously saved.
Merge Down: merge with next-lower time index. The selected item is deleted from the list and the database if it was saved. The target item is re-saved if it was previously saved.
Save: save item to the database.
Delete: delete item from the list and the database if it was saved.

Action Bar

Resume: Starts recording after you have previously stopped it (see above).
Save All: Saves all items.

Quick Record Screen

This is a screen launched from the special Quick Record shortcut you can add to your Home Screen, via the standard means for your device.

Quick Record is made to quickly take a dictation, optionally access the More Results, then go away after a timer expires.

Recording starts immediately, unless you have Shake or Proximity triggers active.

When recording completes, you briefly get an Action Mode with these commands:

Delete: delete this dictation.
Retry: retry recording, discarding the previous entry.

Accessing any command from the Action Mode cancels the exit timer.

Settings Screen

This screen gives you access to application-wide settings.

Some settings display only on appropriate devices.

About

A link to the eScape Mobile Forum on The Internet.

General

These settings control the common behavior of the dictation flow across all the screens.

Use Custom Languages

Enables the Languages setting to select non-default Speech Recognition and Text languages.

Languages

Screen to set the custom languages. Both languages should match!

Custom Volume Level

Enables the Volume Level settings. Otherwise current stream volume is used.

Earcon (Prompt) Volume

Control the prompt tone volume.

Read-back Volume

Control the Text-to-Speech volume.

Ignore Ringer Mode

Disables audio feedback when your device is on Silent or Vibrate. Instead of audio feedback, haptic feedback is used.

Volume Keys

Whether to enable the Volume Up and Down keys for recording features.

Also enables Either Volume Key to cancel recording.

Shake Device

Whether to enable the Shake trigger for recording features.

Shake device 2 times within 1 second to activate.

Proximity Sensor

Whether to enable the Proximity sensor for recording features.

Sensor must read in the near position, typically up to your face as if taking a phone call.

Read-back Speech

Whether to read your Speech Recognition number one result back to you.

Prompt For Speech

Whether to play an Earcon at the start of dictation.

Read Results Count

Whether to read back the number of More Results.

Vibrate on Speech

Vibrate when it is time to speak.

Vibrate on Error

Vibrate when there is a Speech Recognition error.

Main Screen

Display Commands

Whether to display the trigger commands on the Action Bar.

Display Stats

Whether to display the Stats Panel (across Bottom).

Display Minimum Frequency

Whether to display tags whose frequency is the minimum value (typically 1). This is ignored if minimum equals maximum frequency.

Max Tags

The maximum number of tags to display. Tags display in rows of 5.

Open Mic Screen

Log Silence

Whether to make entries when silence is detected.

Log Errors

Whether to make entries when errors occur.

Quick Recod Screen

Auto-save Timer

Set the number of seconds to wait before automatically closing the screen. If you interact with the screen, this timer is cancelled.

Global Search

Newer versions of Android no longer support Global Search of installed applications.

All fields are searchable. Selecting a result item launches the Edit Dictation screen.

Enable Suggestions

Newly-installed applications are Disabled by default! To enable search suggestions from Dictator:

Go to the device's Home Screen,
Press the Search Key. This launches the Search Screen.
Press Menu Key, to display the Option menu.
Select Search Settings,
Select Searchable Items,
Enable Dictator, and any other applications you see in the list you want to search.

Searching

You can search Dictator the following ways:

Via a Search Widget on your Home Screen. Depending on how you set up your widget, you may have to select Dictator from a list of available sources.
From within Dictator, using the Search key.

Audio Capture

Audio Capture is currently disabled.

For devices that provide audio feedback, you have the Audio Capture feature.

Audio is stored in Ogg Vorbis format, 8kHz VBR.

Audio files are stored on your External Storage, in the Public Music folder. Audio files will remain after Dictator is uninstalled.

Audio Playback

Where available, Dictator displays a simple UI for audio playback. The audio stream used for playback is Ringtone.

Add Media

After saving audio files, you can optionally have them added to your device's Media Player.

Media files are saved with the following metadata:

Genre: Dictation.
Album: Dictator Audio Capture.
Artist: Dictator by eScape.

Table of Contents

What's New

2.0.3

Overview

Components

Dictator Components

Dictation Data Fields

Device Components

Common Visual Elements

Audio View

Encoder View

Playback View

Readback View

More Results

Example

Confidence Score

Current Keyword

Main Screen

Layout

Action Bar

Context Menu

Quick Keys

Shake

Proximity Sensor

Dictation

Trigger Recording

Recording Process

Open Mic

Commands

Context Menu

Action Bar

Quick Record Screen

Settings Screen

About

General

Use Custom Languages

Languages

Custom Volume Level

Earcon (Prompt) Volume

Read-back Volume

Ignore Ringer Mode

Volume Keys

Shake Device

Proximity Sensor

Read-back Speech

Prompt For Speech

Read Results Count

Vibrate on Speech

Vibrate on Error

Main Screen

Display Commands

Display Stats

Display Minimum Frequency

Max Tags

Open Mic Screen

Log Silence

Log Errors

Quick Recod Screen

Auto-save Timer

Global Search

Enable Suggestions

Searching

Audio Capture

Audio Playback

Add Media