Types for Google Cloud Vision API Client#

class google.cloud.vision_v1p2beta1.types.AnnotateFileResponse#

Response to a single file annotation request. A file may contain one or more images, which individually have their own responses.

input_config#

Information about the file for which this response is generated.

responses#

Individual responses to images found within the file.

input_config

Field google.cloud.vision.v1p2beta1.AnnotateFileResponse.input_config

responses

Field google.cloud.vision.v1p2beta1.AnnotateFileResponse.responses

class google.cloud.vision_v1p2beta1.types.AnnotateImageRequest#

Request for performing Google Cloud Vision API tasks over a user-provided image, with user-requested features.

image#

The image to be processed.

features#

Requested features.

image_context#

Additional context that may accompany the image.

features

Field google.cloud.vision.v1p2beta1.AnnotateImageRequest.features

image

Field google.cloud.vision.v1p2beta1.AnnotateImageRequest.image

image_context

Field google.cloud.vision.v1p2beta1.AnnotateImageRequest.image_context

class google.cloud.vision_v1p2beta1.types.AnnotateImageResponse#

Response to an image annotation request.

face_annotations#

If present, face detection has completed successfully.

landmark_annotations#

If present, landmark detection has completed successfully.

logo_annotations#

If present, logo detection has completed successfully.

label_annotations#

If present, label detection has completed successfully.

text_annotations#

If present, text (OCR) detection has completed successfully.

full_text_annotation#

If present, text (OCR) detection or document (OCR) text detection has completed successfully. This annotation provides the structural hierarchy for the OCR detected text.

safe_search_annotation#

If present, safe-search annotation has completed successfully.

image_properties_annotation#

If present, image properties were extracted successfully.

crop_hints_annotation#

If present, crop hints have completed successfully.

web_detection#

If present, web detection has completed successfully.

error#

If set, represents the error message for the operation. Note that filled-in image annotations are guaranteed to be correct, even when error is set.

context#

If present, contextual information is needed to understand where this image comes from.

context

Field google.cloud.vision.v1p2beta1.AnnotateImageResponse.context

crop_hints_annotation

Field google.cloud.vision.v1p2beta1.AnnotateImageResponse.crop_hints_annotation

error

Field google.cloud.vision.v1p2beta1.AnnotateImageResponse.error

face_annotations

Field google.cloud.vision.v1p2beta1.AnnotateImageResponse.face_annotations

full_text_annotation

Field google.cloud.vision.v1p2beta1.AnnotateImageResponse.full_text_annotation

image_properties_annotation

Field google.cloud.vision.v1p2beta1.AnnotateImageResponse.image_properties_annotation

label_annotations

Field google.cloud.vision.v1p2beta1.AnnotateImageResponse.label_annotations

landmark_annotations

Field google.cloud.vision.v1p2beta1.AnnotateImageResponse.landmark_annotations

logo_annotations

Field google.cloud.vision.v1p2beta1.AnnotateImageResponse.logo_annotations

safe_search_annotation

Field google.cloud.vision.v1p2beta1.AnnotateImageResponse.safe_search_annotation

text_annotations

Field google.cloud.vision.v1p2beta1.AnnotateImageResponse.text_annotations

web_detection

Field google.cloud.vision.v1p2beta1.AnnotateImageResponse.web_detection

class google.cloud.vision_v1p2beta1.types.Any#
type_url#

Field google.protobuf.Any.type_url

value#

Field google.protobuf.Any.value

class google.cloud.vision_v1p2beta1.types.AsyncAnnotateFileRequest#

An offline file annotation request.

input_config#

Required. Information about the input file.

features#

Required. Requested features.

image_context#

Additional context that may accompany the image(s) in the file.

output_config#

Required. The desired output location and metadata (e.g. format).

features

Field google.cloud.vision.v1p2beta1.AsyncAnnotateFileRequest.features

image_context

Field google.cloud.vision.v1p2beta1.AsyncAnnotateFileRequest.image_context

input_config

Field google.cloud.vision.v1p2beta1.AsyncAnnotateFileRequest.input_config

output_config

Field google.cloud.vision.v1p2beta1.AsyncAnnotateFileRequest.output_config

class google.cloud.vision_v1p2beta1.types.AsyncAnnotateFileResponse#

The response for a single offline file annotation request.

output_config#

The output location and metadata from AsyncAnnotateFileRequest.

output_config

Field google.cloud.vision.v1p2beta1.AsyncAnnotateFileResponse.output_config

class google.cloud.vision_v1p2beta1.types.AsyncBatchAnnotateFilesRequest#

Multiple async file annotation requests are batched into a single service call.

requests#

Individual async file annotation requests for this batch.

requests

Field google.cloud.vision.v1p2beta1.AsyncBatchAnnotateFilesRequest.requests

class google.cloud.vision_v1p2beta1.types.AsyncBatchAnnotateFilesResponse#

Response to an async batch file annotation request.

responses#

The list of file annotation responses, one for each request in AsyncBatchAnnotateFilesRequest.

responses

Field google.cloud.vision.v1p2beta1.AsyncBatchAnnotateFilesResponse.responses

class google.cloud.vision_v1p2beta1.types.BatchAnnotateImagesRequest#

Multiple image annotation requests are batched into a single service call.

requests#

Individual image annotation requests for this batch.

requests

Field google.cloud.vision.v1p2beta1.BatchAnnotateImagesRequest.requests

class google.cloud.vision_v1p2beta1.types.BatchAnnotateImagesResponse#

Response to a batch image annotation request.

responses#

Individual responses to image annotation requests within the batch.

responses

Field google.cloud.vision.v1p2beta1.BatchAnnotateImagesResponse.responses

class google.cloud.vision_v1p2beta1.types.Block#

Logical element on the page.

property#

Additional information detected for the block.

bounding_box#

The bounding box for the block. The vertices are in the order of top-left, top-right, bottom-right, bottom-left. When a rotation of the bounding box is detected the rotation is represented as around the top-left corner as defined when the text is read in the ‘natural’ orientation. For example: - when the text is horizontal it might look like: :: 0—-1 | | 3—-2 - when it’s rotated 180 degrees around the top-left corner it becomes: :: 2—-3 | | 1—-0 and the vertice order will still be (0, 1, 2, 3).

paragraphs#

List of paragraphs in this block (if this blocks is of type text).

block_type#

Detected block type (text, image etc) for this block.

confidence#

Confidence of the OCR results on the block. Range [0, 1].

block_type

Field google.cloud.vision.v1p2beta1.Block.block_type

bounding_box

Field google.cloud.vision.v1p2beta1.Block.bounding_box

confidence

Field google.cloud.vision.v1p2beta1.Block.confidence

paragraphs

Field google.cloud.vision.v1p2beta1.Block.paragraphs

property

Field google.cloud.vision.v1p2beta1.Block.property

class google.cloud.vision_v1p2beta1.types.BoolValue#
value#

Field google.protobuf.BoolValue.value

class google.cloud.vision_v1p2beta1.types.BoundingPoly#

A bounding polygon for the detected image annotation.

vertices#

The bounding polygon vertices.

normalized_vertices#

The bounding polygon normalized vertices.

normalized_vertices

Field google.cloud.vision.v1p2beta1.BoundingPoly.normalized_vertices

vertices

Field google.cloud.vision.v1p2beta1.BoundingPoly.vertices

class google.cloud.vision_v1p2beta1.types.BytesValue#
value#

Field google.protobuf.BytesValue.value

class google.cloud.vision_v1p2beta1.types.CancelOperationRequest#
name#

Field google.longrunning.CancelOperationRequest.name

class google.cloud.vision_v1p2beta1.types.Color#
alpha#

Field google.type.Color.alpha

blue#

Field google.type.Color.blue

green#

Field google.type.Color.green

red#

Field google.type.Color.red

class google.cloud.vision_v1p2beta1.types.ColorInfo#

Color information consists of RGB channels, score, and the fraction of the image that the color occupies in the image.

color#

RGB components of the color.

score#

Image-specific score for this color. Value in range [0, 1].

pixel_fraction#

The fraction of pixels the color occupies in the image. Value in range [0, 1].

color

Field google.cloud.vision.v1p2beta1.ColorInfo.color

pixel_fraction

Field google.cloud.vision.v1p2beta1.ColorInfo.pixel_fraction

score

Field google.cloud.vision.v1p2beta1.ColorInfo.score

class google.cloud.vision_v1p2beta1.types.CropHint#

Single crop hint that is used to generate a new crop when serving an image.

bounding_poly#

The bounding polygon for the crop region. The coordinates of the bounding box are in the original image’s scale, as returned in ImageParams.

confidence#

Confidence of this being a salient region. Range [0, 1].

importance_fraction#

Fraction of importance of this salient region with respect to the original image.

bounding_poly

Field google.cloud.vision.v1p2beta1.CropHint.bounding_poly

confidence

Field google.cloud.vision.v1p2beta1.CropHint.confidence

importance_fraction

Field google.cloud.vision.v1p2beta1.CropHint.importance_fraction

class google.cloud.vision_v1p2beta1.types.CropHintsAnnotation#

Set of crop hints that are used to generate new crops when serving images.

crop_hints#

Crop hint results.

crop_hints

Field google.cloud.vision.v1p2beta1.CropHintsAnnotation.crop_hints

class google.cloud.vision_v1p2beta1.types.CropHintsParams#

Parameters for crop hints annotation request.

aspect_ratios#

Aspect ratios in floats, representing the ratio of the width to the height of the image. For example, if the desired aspect ratio is 4/3, the corresponding float value should be 1.33333. If not specified, the best possible crop is returned. The number of provided aspect ratios is limited to a maximum of 16; any aspect ratios provided after the 16th are ignored.

aspect_ratios

Field google.cloud.vision.v1p2beta1.CropHintsParams.aspect_ratios

class google.cloud.vision_v1p2beta1.types.DeleteOperationRequest#
name#

Field google.longrunning.DeleteOperationRequest.name

class google.cloud.vision_v1p2beta1.types.DominantColorsAnnotation#

Set of dominant colors and their corresponding scores.

colors#

RGB color values with their score and pixel fraction.

colors

Field google.cloud.vision.v1p2beta1.DominantColorsAnnotation.colors

class google.cloud.vision_v1p2beta1.types.DoubleValue#
value#

Field google.protobuf.DoubleValue.value

class google.cloud.vision_v1p2beta1.types.EntityAnnotation#

Set of detected entity features.

mid#

Opaque entity ID. Some IDs may be available in Google Knowledge Graph Search API.

locale#

The language code for the locale in which the entity textual description is expressed.

description#

Entity textual description, expressed in its locale language.

score#

Overall score of the result. Range [0, 1].

confidence#

Deprecated. Use ``score`` instead. The accuracy of the entity detection in an image. For example, for an image in which the “Eiffel Tower” entity is detected, this field represents the confidence that there is a tower in the query image. Range [0, 1].

topicality#

The relevancy of the ICA (Image Content Annotation) label to the image. For example, the relevancy of “tower” is likely higher to an image containing the detected “Eiffel Tower” than to an image containing a detected distant towering building, even though the confidence that there is a tower in each image may be the same. Range [0, 1].

bounding_poly#

Image region to which this entity belongs. Not produced for LABEL_DETECTION features.

locations#

The location information for the detected entity. Multiple LocationInfo elements can be present because one location may indicate the location of the scene in the image, and another location may indicate the location of the place where the image was taken. Location information is usually present for landmarks.

properties#

Some entities may have optional user-supplied Property (name/value) fields, such a score or string that qualifies the entity.

bounding_poly

Field google.cloud.vision.v1p2beta1.EntityAnnotation.bounding_poly

confidence

Field google.cloud.vision.v1p2beta1.EntityAnnotation.confidence

description

Field google.cloud.vision.v1p2beta1.EntityAnnotation.description

locale

Field google.cloud.vision.v1p2beta1.EntityAnnotation.locale

locations

Field google.cloud.vision.v1p2beta1.EntityAnnotation.locations

mid

Field google.cloud.vision.v1p2beta1.EntityAnnotation.mid

properties

Field google.cloud.vision.v1p2beta1.EntityAnnotation.properties

score

Field google.cloud.vision.v1p2beta1.EntityAnnotation.score

topicality

Field google.cloud.vision.v1p2beta1.EntityAnnotation.topicality

class google.cloud.vision_v1p2beta1.types.FaceAnnotation#

A face annotation object contains the results of face detection.

bounding_poly#

The bounding polygon around the face. The coordinates of the bounding box are in the original image’s scale, as returned in ImageParams. The bounding box is computed to “frame” the face in accordance with human expectations. It is based on the landmarker results. Note that one or more x and/or y coordinates may not be generated in the BoundingPoly (the polygon will be unbounded) if only a partial face appears in the image to be annotated.

fd_bounding_poly#

The fd_bounding_poly bounding polygon is tighter than the boundingPoly, and encloses only the skin part of the face. Typically, it is used to eliminate the face from any image analysis that detects the “amount of skin” visible in an image. It is not based on the landmarker results, only on the initial face detection, hence the fd (face detection) prefix.

landmarks#

Detected face landmarks.

roll_angle#

Roll angle, which indicates the amount of clockwise/anti- clockwise rotation of the face relative to the image vertical about the axis perpendicular to the face. Range [-180,180].

pan_angle#

Yaw angle, which indicates the leftward/rightward angle that the face is pointing relative to the vertical plane perpendicular to the image. Range [-180,180].

tilt_angle#

Pitch angle, which indicates the upwards/downwards angle that the face is pointing relative to the image’s horizontal plane. Range [-180,180].

detection_confidence#

Detection confidence. Range [0, 1].

landmarking_confidence#

Face landmarking confidence. Range [0, 1].

joy_likelihood#

Joy likelihood.

sorrow_likelihood#

Sorrow likelihood.

anger_likelihood#

Anger likelihood.

surprise_likelihood#

Surprise likelihood.

under_exposed_likelihood#

Under-exposed likelihood.

blurred_likelihood#

Blurred likelihood.

headwear_likelihood#

Headwear likelihood.

class Landmark#

A face-specific landmark (for example, a face feature).

type#

Face landmark type.

position#

Face landmark position.

position

Field google.cloud.vision.v1p2beta1.FaceAnnotation.Landmark.position

type

Field google.cloud.vision.v1p2beta1.FaceAnnotation.Landmark.type

anger_likelihood

Field google.cloud.vision.v1p2beta1.FaceAnnotation.anger_likelihood

blurred_likelihood

Field google.cloud.vision.v1p2beta1.FaceAnnotation.blurred_likelihood

bounding_poly

Field google.cloud.vision.v1p2beta1.FaceAnnotation.bounding_poly

detection_confidence

Field google.cloud.vision.v1p2beta1.FaceAnnotation.detection_confidence

fd_bounding_poly

Field google.cloud.vision.v1p2beta1.FaceAnnotation.fd_bounding_poly

headwear_likelihood

Field google.cloud.vision.v1p2beta1.FaceAnnotation.headwear_likelihood

joy_likelihood

Field google.cloud.vision.v1p2beta1.FaceAnnotation.joy_likelihood

landmarking_confidence

Field google.cloud.vision.v1p2beta1.FaceAnnotation.landmarking_confidence

landmarks

Field google.cloud.vision.v1p2beta1.FaceAnnotation.landmarks

pan_angle

Field google.cloud.vision.v1p2beta1.FaceAnnotation.pan_angle

roll_angle

Field google.cloud.vision.v1p2beta1.FaceAnnotation.roll_angle

sorrow_likelihood

Field google.cloud.vision.v1p2beta1.FaceAnnotation.sorrow_likelihood

surprise_likelihood

Field google.cloud.vision.v1p2beta1.FaceAnnotation.surprise_likelihood

tilt_angle

Field google.cloud.vision.v1p2beta1.FaceAnnotation.tilt_angle

under_exposed_likelihood

Field google.cloud.vision.v1p2beta1.FaceAnnotation.under_exposed_likelihood

class google.cloud.vision_v1p2beta1.types.Feature#

The type of Google Cloud Vision API detection to perform, and the maximum number of results to return for that type. Multiple Feature objects can be specified in the features list.

type#

The feature type.

max_results#

Maximum number of results of this type. Does not apply to TEXT_DETECTION, DOCUMENT_TEXT_DETECTION, or CROP_HINTS.

model#

Model to use for the feature. Supported values: “builtin/stable” (the default if unset) and “builtin/latest”.

max_results

Field google.cloud.vision.v1p2beta1.Feature.max_results

model

Field google.cloud.vision.v1p2beta1.Feature.model

type

Field google.cloud.vision.v1p2beta1.Feature.type

class google.cloud.vision_v1p2beta1.types.FloatValue#
value#

Field google.protobuf.FloatValue.value

class google.cloud.vision_v1p2beta1.types.GcsDestination#

The Google Cloud Storage location where the output will be written to.

uri#

Google Cloud Storage URI where the results will be stored. Results will be in JSON format and preceded by its corresponding input URI. This field can either represent a single file, or a prefix for multiple outputs. Prefixes must end in a /. Examples: - File: gs://bucket- name/filename.json - Prefix: gs://bucket-name/prefix/here/ - File: gs://bucket-name/prefix/here If multiple outputs, each response is still AnnotateFileResponse, each of which contains some subset of the full list of AnnotateImageResponse. Multiple outputs can happen if, for example, the output JSON is too large and overflows into multiple sharded files.

uri

Field google.cloud.vision.v1p2beta1.GcsDestination.uri

class google.cloud.vision_v1p2beta1.types.GcsSource#

The Google Cloud Storage location where the input will be read from.

uri#

Google Cloud Storage URI for the input file. This must only be a GCS object. Wildcards are not currently supported.

uri

Field google.cloud.vision.v1p2beta1.GcsSource.uri

class google.cloud.vision_v1p2beta1.types.GetOperationRequest#
name#

Field google.longrunning.GetOperationRequest.name

class google.cloud.vision_v1p2beta1.types.Image#

Client image to perform Google Cloud Vision API tasks over.

content#

Image content, represented as a stream of bytes. Note: As with all bytes fields, protobuffers use a pure binary representation, whereas JSON representations use base64.

source#

Google Cloud Storage image location, or publicly-accessible image URL. If both content and source are provided for an image, content takes precedence and is used to perform the image annotation request.

content

Field google.cloud.vision.v1p2beta1.Image.content

source

Field google.cloud.vision.v1p2beta1.Image.source

class google.cloud.vision_v1p2beta1.types.ImageAnnotationContext#

If an image was produced from a file (e.g. a PDF), this message gives information about the source of that image.

uri#

The URI of the file used to produce the image.

page_number#

If the file was a PDF or TIFF, this field gives the page number within the file used to produce the image.

page_number

Field google.cloud.vision.v1p2beta1.ImageAnnotationContext.page_number

uri

Field google.cloud.vision.v1p2beta1.ImageAnnotationContext.uri

class google.cloud.vision_v1p2beta1.types.ImageContext#

Image context and/or feature-specific parameters.

lat_long_rect#

Not used.

language_hints#

List of languages to use for TEXT_DETECTION. In most cases, an empty value yields the best results since it enables automatic language detection. For languages based on the Latin alphabet, setting language_hints is not needed. In rare cases, when the language of the text in the image is known, setting a hint will help get better results (although it will be a significant hindrance if the hint is wrong). Text detection returns an error if one or more of the specified languages is not one of the supported languages.

crop_hints_params#

Parameters for crop hints annotation request.

web_detection_params#

Parameters for web detection.

crop_hints_params

Field google.cloud.vision.v1p2beta1.ImageContext.crop_hints_params

language_hints

Field google.cloud.vision.v1p2beta1.ImageContext.language_hints

lat_long_rect

Field google.cloud.vision.v1p2beta1.ImageContext.lat_long_rect

web_detection_params

Field google.cloud.vision.v1p2beta1.ImageContext.web_detection_params

class google.cloud.vision_v1p2beta1.types.ImageProperties#

Stores image properties, such as dominant colors.

dominant_colors#

If present, dominant colors completed successfully.

dominant_colors

Field google.cloud.vision.v1p2beta1.ImageProperties.dominant_colors

class google.cloud.vision_v1p2beta1.types.ImageSource#

External image source (Google Cloud Storage or web URL image location).

gcs_image_uri#

Use ``image_uri`` instead. The Google Cloud Storage URI of the form gs://bucket_name/object_name. Object versioning is not supported. See Google Cloud Storage Request URIs for more info.

image_uri#

The URI of the source image. Can be either: 1. A Google Cloud Storage URI of the form gs://bucket_name/object_name. Object versioning is not supported. See Google Cloud Storage Request URIs for more info. 2. A publicly-accessible image HTTP/HTTPS URL. When fetching images from HTTP/HTTPS URLs, Google cannot guarantee that the request will be completed. Your request may fail if the specified host denies the request (e.g. due to request throttling or DOS prevention), or if Google throttles requests to the site for abuse prevention. You should not depend on externally-hosted images for production applications. When both gcs_image_uri and image_uri are specified, image_uri takes precedence.

gcs_image_uri

Field google.cloud.vision.v1p2beta1.ImageSource.gcs_image_uri

image_uri

Field google.cloud.vision.v1p2beta1.ImageSource.image_uri

class google.cloud.vision_v1p2beta1.types.InputConfig#

The desired input location and metadata.

gcs_source#

The Google Cloud Storage location to read the input from.

mime_type#

The type of the file. Currently only “application/pdf” and “image/tiff” are supported. Wildcards are not supported.

gcs_source

Field google.cloud.vision.v1p2beta1.InputConfig.gcs_source

mime_type

Field google.cloud.vision.v1p2beta1.InputConfig.mime_type

class google.cloud.vision_v1p2beta1.types.Int32Value#
value#

Field google.protobuf.Int32Value.value

class google.cloud.vision_v1p2beta1.types.Int64Value#
value#

Field google.protobuf.Int64Value.value

class google.cloud.vision_v1p2beta1.types.LatLng#
latitude#

Field google.type.LatLng.latitude

longitude#

Field google.type.LatLng.longitude

class google.cloud.vision_v1p2beta1.types.LatLongRect#

Rectangle determined by min and max LatLng pairs.

min_lat_lng#

Min lat/long pair.

max_lat_lng#

Max lat/long pair.

max_lat_lng

Field google.cloud.vision.v1p2beta1.LatLongRect.max_lat_lng

min_lat_lng

Field google.cloud.vision.v1p2beta1.LatLongRect.min_lat_lng

class google.cloud.vision_v1p2beta1.types.ListOperationsRequest#
filter#

Field google.longrunning.ListOperationsRequest.filter

name#

Field google.longrunning.ListOperationsRequest.name

page_size#

Field google.longrunning.ListOperationsRequest.page_size

page_token#

Field google.longrunning.ListOperationsRequest.page_token

class google.cloud.vision_v1p2beta1.types.ListOperationsResponse#
next_page_token#

Field google.longrunning.ListOperationsResponse.next_page_token

operations#

Field google.longrunning.ListOperationsResponse.operations

class google.cloud.vision_v1p2beta1.types.LocationInfo#

Detected entity location information.

lat_lng#

lat/long location coordinates.

lat_lng

Field google.cloud.vision.v1p2beta1.LocationInfo.lat_lng

class google.cloud.vision_v1p2beta1.types.NormalizedVertex#

X coordinate.

y#

Y coordinate.

x#

Field google.cloud.vision.v1p2beta1.NormalizedVertex.x

y

Field google.cloud.vision.v1p2beta1.NormalizedVertex.y

class google.cloud.vision_v1p2beta1.types.Operation#
deserialize()#

Creates new method instance from given serialized data.

done#

Field google.longrunning.Operation.done

error#

Field google.longrunning.Operation.error

metadata#

Field google.longrunning.Operation.metadata

name#

Field google.longrunning.Operation.name

response#

Field google.longrunning.Operation.response

class google.cloud.vision_v1p2beta1.types.OperationInfo#
metadata_type#

Field google.longrunning.OperationInfo.metadata_type

response_type#

Field google.longrunning.OperationInfo.response_type

class google.cloud.vision_v1p2beta1.types.OperationMetadata#

Contains metadata for the BatchAnnotateImages operation.

state#

Current state of the batch operation.

create_time#

The time when the batch request was received.

update_time#

The time when the operation result was last updated.

create_time

Field google.cloud.vision.v1p2beta1.OperationMetadata.create_time

state

Field google.cloud.vision.v1p2beta1.OperationMetadata.state

update_time

Field google.cloud.vision.v1p2beta1.OperationMetadata.update_time

class google.cloud.vision_v1p2beta1.types.OutputConfig#

The desired output location and metadata.

gcs_destination#

The Google Cloud Storage location to write the output(s) to.

batch_size#

The max number of response protos to put into each output JSON file on GCS. The valid range is [1, 100]. If not specified, the default value is 20. For example, for one pdf file with 100 pages, 100 response protos will be generated. If batch_size = 20, then 5 json files each containing 20 response protos will be written under the prefix gcs_destination.uri. Currently, batch_size only applies to GcsDestination, with potential future support for other output configurations.

batch_size

Field google.cloud.vision.v1p2beta1.OutputConfig.batch_size

gcs_destination

Field google.cloud.vision.v1p2beta1.OutputConfig.gcs_destination

class google.cloud.vision_v1p2beta1.types.Page#

Detected page from OCR.

property#

Additional information detected on the page.

width#

Page width. For PDFs the unit is points. For images (including TIFFs) the unit is pixels.

height#

Page height. For PDFs the unit is points. For images (including TIFFs) the unit is pixels.

blocks#

List of blocks of text, images etc on this page.

confidence#

Confidence of the OCR results on the page. Range [0, 1].

blocks

Field google.cloud.vision.v1p2beta1.Page.blocks

confidence

Field google.cloud.vision.v1p2beta1.Page.confidence

height

Field google.cloud.vision.v1p2beta1.Page.height

property

Field google.cloud.vision.v1p2beta1.Page.property

width

Field google.cloud.vision.v1p2beta1.Page.width

class google.cloud.vision_v1p2beta1.types.Paragraph#

Structural unit of text representing a number of words in certain order.

property#

Additional information detected for the paragraph.

bounding_box#

The bounding box for the paragraph. The vertices are in the order of top-left, top-right, bottom-right, bottom-left. When a rotation of the bounding box is detected the rotation is represented as around the top-left corner as defined when the text is read in the ‘natural’ orientation. For example: * when the text is horizontal it might look like: 0—-1 | | 3 —-2 * when it’s rotated 180 degrees around the top-left corner it becomes: 2—-3 | | 1—-0 and the vertice order will still be (0, 1, 2, 3).

words#

List of words in this paragraph.

confidence#

Confidence of the OCR results for the paragraph. Range [0, 1].

bounding_box

Field google.cloud.vision.v1p2beta1.Paragraph.bounding_box

confidence

Field google.cloud.vision.v1p2beta1.Paragraph.confidence

property

Field google.cloud.vision.v1p2beta1.Paragraph.property

words

Field google.cloud.vision.v1p2beta1.Paragraph.words

class google.cloud.vision_v1p2beta1.types.Position#

A 3D position in the image, used primarily for Face detection landmarks. A valid Position must have both x and y coordinates. The position coordinates are in the same scale as the original image.

x#

X coordinate.

y#

Y coordinate.

z#

Z coordinate (or depth).

x

Field google.cloud.vision.v1p2beta1.Position.x

y

Field google.cloud.vision.v1p2beta1.Position.y

z

Field google.cloud.vision.v1p2beta1.Position.z

class google.cloud.vision_v1p2beta1.types.Property#

A Property consists of a user-supplied name/value pair.

name#

Name of the property.

value#

Value of the property.

uint64_value#

Value of numeric properties.

name

Field google.cloud.vision.v1p2beta1.Property.name

uint64_value

Field google.cloud.vision.v1p2beta1.Property.uint64_value

value

Field google.cloud.vision.v1p2beta1.Property.value

class google.cloud.vision_v1p2beta1.types.SafeSearchAnnotation#

Set of features pertaining to the image, computed by computer vision methods over safe-search verticals (for example, adult, spoof, medical, violence).

adult#

Represents the adult content likelihood for the image. Adult content may contain elements such as nudity, pornographic images or cartoons, or sexual activities.

spoof#

Spoof likelihood. The likelihood that an modification was made to the image’s canonical version to make it appear funny or offensive.

medical#

Likelihood that this is a medical image.

violence#

Likelihood that this image contains violent content.

racy#

Likelihood that the request image contains racy content. Racy content may include (but is not limited to) skimpy or sheer clothing, strategically covered nudity, lewd or provocative poses, or close-ups of sensitive body areas.

adult

Field google.cloud.vision.v1p2beta1.SafeSearchAnnotation.adult

medical

Field google.cloud.vision.v1p2beta1.SafeSearchAnnotation.medical

racy

Field google.cloud.vision.v1p2beta1.SafeSearchAnnotation.racy

spoof

Field google.cloud.vision.v1p2beta1.SafeSearchAnnotation.spoof

violence

Field google.cloud.vision.v1p2beta1.SafeSearchAnnotation.violence

class google.cloud.vision_v1p2beta1.types.Status#
code#

Field google.rpc.Status.code

details#

Field google.rpc.Status.details

message#

Field google.rpc.Status.message

class google.cloud.vision_v1p2beta1.types.StringValue#
value#

Field google.protobuf.StringValue.value

class google.cloud.vision_v1p2beta1.types.Symbol#

A single symbol representation.

property#

Additional information detected for the symbol.

bounding_box#

The bounding box for the symbol. The vertices are in the order of top-left, top-right, bottom-right, bottom-left. When a rotation of the bounding box is detected the rotation is represented as around the top-left corner as defined when the text is read in the ‘natural’ orientation. For example: * when the text is horizontal it might look like: 0—-1 | | 3 —-2 * when it’s rotated 180 degrees around the top-left corner it becomes: 2—-3 | | 1—-0 and the vertice order will still be (0, 1, 2, 3).

text#

The actual UTF-8 representation of the symbol.

confidence#

Confidence of the OCR results for the symbol. Range [0, 1].

bounding_box

Field google.cloud.vision.v1p2beta1.Symbol.bounding_box

confidence

Field google.cloud.vision.v1p2beta1.Symbol.confidence

property

Field google.cloud.vision.v1p2beta1.Symbol.property

text

Field google.cloud.vision.v1p2beta1.Symbol.text

class google.cloud.vision_v1p2beta1.types.TextAnnotation#

TextAnnotation contains a structured representation of OCR extracted text. The hierarchy of an OCR extracted text structure is like this: TextAnnotation -> Page -> Block -> Paragraph -> Word -> Symbol Each structural component, starting from Page, may further have their own properties. Properties describe detected languages, breaks etc.. Please refer to the [TextAnnotation.TextProperty][google.cloud.vision.v1p2beta1.TextAnnotation.TextProperty] message definition below for more detail.

pages#

List of pages detected by OCR.

text#

UTF-8 text detected on the pages.

class DetectedBreak#

Detected start or end of a structural component.

type#

Detected break type.

is_prefix#

True if break prepends the element.

is_prefix

Field google.cloud.vision.v1p2beta1.TextAnnotation.DetectedBreak.is_prefix

type

Field google.cloud.vision.v1p2beta1.TextAnnotation.DetectedBreak.type

class DetectedLanguage#

Detected language for a structural component.

language_code#

The BCP-47 language code, such as “en-US” or “sr-Latn”. For more information, see http://www.unicode.org/reports/tr35/#Uni code_locale_identifier.

confidence#

Confidence of detected language. Range [0, 1].

confidence

Field google.cloud.vision.v1p2beta1.TextAnnotation.DetectedLanguage.confidence

language_code

Field google.cloud.vision.v1p2beta1.TextAnnotation.DetectedLanguage.language_code

class TextProperty#

Additional information detected on the structural component.

detected_languages#

A list of detected languages together with confidence.

detected_break#

Detected start or end of a text segment.

detected_break

Field google.cloud.vision.v1p2beta1.TextAnnotation.TextProperty.detected_break

detected_languages

Field google.cloud.vision.v1p2beta1.TextAnnotation.TextProperty.detected_languages

pages

Field google.cloud.vision.v1p2beta1.TextAnnotation.pages

text

Field google.cloud.vision.v1p2beta1.TextAnnotation.text

class google.cloud.vision_v1p2beta1.types.Timestamp#
nanos#

Field google.protobuf.Timestamp.nanos

seconds#

Field google.protobuf.Timestamp.seconds

class google.cloud.vision_v1p2beta1.types.UInt32Value#
value#

Field google.protobuf.UInt32Value.value

class google.cloud.vision_v1p2beta1.types.UInt64Value#
value#

Field google.protobuf.UInt64Value.value

class google.cloud.vision_v1p2beta1.types.Vertex#

X coordinate.

y#

Y coordinate.

x#

Field google.cloud.vision.v1p2beta1.Vertex.x

y

Field google.cloud.vision.v1p2beta1.Vertex.y

class google.cloud.vision_v1p2beta1.types.WebDetection#

Relevant information for the image from the Internet.

web_entities#

Deduced entities from similar images on the Internet.

full_matching_images#

Fully matching images from the Internet. Can include resized copies of the query image.

partial_matching_images#

Partial matching images from the Internet. Those images are similar enough to share some key-point features. For example an original image will likely have partial matching for its crops.

pages_with_matching_images#

Web pages containing the matching images from the Internet.

visually_similar_images#

The visually similar image results.

best_guess_labels#

Best guess text labels for the request image.

class WebEntity#

Entity deduced from similar images on the Internet.

entity_id#

Opaque entity ID.

score#

Overall relevancy score for the entity. Not normalized and not comparable across different image queries.

description#

Canonical description of the entity, in English.

description

Field google.cloud.vision.v1p2beta1.WebDetection.WebEntity.description

entity_id

Field google.cloud.vision.v1p2beta1.WebDetection.WebEntity.entity_id

score

Field google.cloud.vision.v1p2beta1.WebDetection.WebEntity.score

class WebImage#

Metadata for online images.

url#

The result image URL.

score#

(Deprecated) Overall relevancy score for the image.

score

Field google.cloud.vision.v1p2beta1.WebDetection.WebImage.score

url

Field google.cloud.vision.v1p2beta1.WebDetection.WebImage.url

class WebLabel#

Label to provide extra metadata for the web detection.

label#

Label for extra metadata.

language_code#

The BCP-47 language code for label, such as “en-US” or “sr-Latn”. For more information, see http://www.unicode.org/re ports/tr35/#Unicode_locale_identifier.

label

Field google.cloud.vision.v1p2beta1.WebDetection.WebLabel.label

language_code

Field google.cloud.vision.v1p2beta1.WebDetection.WebLabel.language_code

class WebPage#

Metadata for web pages.

url#

The result web page URL.

score#

(Deprecated) Overall relevancy score for the web page.

page_title#

Title for the web page, may contain HTML markups.

full_matching_images#

Fully matching images on the page. Can include resized copies of the query image.

partial_matching_images#

Partial matching images on the page. Those images are similar enough to share some key-point features. For example an original image will likely have partial matching for its crops.

full_matching_images

Field google.cloud.vision.v1p2beta1.WebDetection.WebPage.full_matching_images

page_title

Field google.cloud.vision.v1p2beta1.WebDetection.WebPage.page_title

partial_matching_images

Field google.cloud.vision.v1p2beta1.WebDetection.WebPage.partial_matching_images

score

Field google.cloud.vision.v1p2beta1.WebDetection.WebPage.score

url

Field google.cloud.vision.v1p2beta1.WebDetection.WebPage.url

best_guess_labels

Field google.cloud.vision.v1p2beta1.WebDetection.best_guess_labels

full_matching_images

Field google.cloud.vision.v1p2beta1.WebDetection.full_matching_images

pages_with_matching_images

Field google.cloud.vision.v1p2beta1.WebDetection.pages_with_matching_images

partial_matching_images

Field google.cloud.vision.v1p2beta1.WebDetection.partial_matching_images

visually_similar_images

Field google.cloud.vision.v1p2beta1.WebDetection.visually_similar_images

web_entities

Field google.cloud.vision.v1p2beta1.WebDetection.web_entities

class google.cloud.vision_v1p2beta1.types.WebDetectionParams#

Parameters for web detection request.

include_geo_results#

Whether to include results derived from the geo information in the image.

include_geo_results

Field google.cloud.vision.v1p2beta1.WebDetectionParams.include_geo_results

class google.cloud.vision_v1p2beta1.types.Word#

A word representation.

property#

Additional information detected for the word.

bounding_box#

The bounding box for the word. The vertices are in the order of top-left, top-right, bottom-right, bottom-left. When a rotation of the bounding box is detected the rotation is represented as around the top-left corner as defined when the text is read in the ‘natural’ orientation. For example: * when the text is horizontal it might look like: 0—-1 | | 3 —-2 * when it’s rotated 180 degrees around the top-left corner it becomes: 2—-3 | | 1—-0 and the vertice order will still be (0, 1, 2, 3).

symbols#

List of symbols in the word. The order of the symbols follows the natural reading order.

confidence#

Confidence of the OCR results for the word. Range [0, 1].

bounding_box

Field google.cloud.vision.v1p2beta1.Word.bounding_box

confidence

Field google.cloud.vision.v1p2beta1.Word.confidence

property

Field google.cloud.vision.v1p2beta1.Word.property

symbols

Field google.cloud.vision.v1p2beta1.Word.symbols