Loggers¶
Lightning supports the most popular logging frameworks (TensorBoard, Comet, Weights and Biases, etc…).
To use a logger, simply pass it into the Trainer
.
Lightning uses TensorBoard by default.
from pytorch_lightning import Trainer
from pytorch_lightning import loggers
tb_logger = loggers.TensorBoardLogger('logs/')
trainer = Trainer(logger=tb_logger)
Choose from any of the others such as MLflow, Comet, Neptune, WandB, …
comet_logger = loggers.CometLogger(save_dir='logs/')
trainer = Trainer(logger=comet_logger)
To use multiple loggers, simply pass in a list
or tuple
of loggers …
tb_logger = loggers.TensorBoardLogger('logs/')
comet_logger = loggers.CometLogger(save_dir='logs/')
trainer = Trainer(logger=[tb_logger, comet_logger])
Note
All loggers log by default to os.getcwd()
. To change the path without creating a logger set
Trainer(default_root_dir='/your/path/to/save/checkpoints')
Custom Logger¶
You can implement your own logger by writing a class that inherits from
LightningLoggerBase
. Use the rank_zero_only()
decorator to make sure that only the first process in DDP training logs data.
from pytorch_lightning.utilities import rank_zero_only
from pytorch_lightning.loggers import LightningLoggerBase
class MyLogger(LightningLoggerBase):
@rank_zero_only
def log_hyperparams(self, params):
# params is an argparse.Namespace
# your code to record hyperparameters goes here
pass
@rank_zero_only
def log_metrics(self, metrics, step):
# metrics is a dictionary of metric names and values
# your code to record metrics goes here
pass
def save(self):
# Optional. Any code necessary to save logger data goes here
pass
@rank_zero_only
def finalize(self, status):
# Optional. Any code that needs to be run after training
# finishes goes here
pass
If you write a logger that may be useful to others, please send a pull request to add it to Lighting!
Using loggers¶
Call the logger anywhere except __init__
in your
LightningModule
by doing:
from pytorch_lightning import LightningModule
class LitModel(LightningModule):
def training_step(self, batch, batch_idx):
# example
self.logger.experiment.whatever_method_summary_writer_supports(...)
# example if logger is a tensorboard logger
self.logger.experiment.add_image('images', grid, 0)
self.logger.experiment.add_graph(model, images)
def any_lightning_module_function_or_hook(self):
self.logger.experiment.add_histogram(...)
Read more in the Experiment Logging use case.
Supported Loggers¶
-
class
pytorch_lightning.loggers.
LightningLoggerBase
(agg_key_funcs=None, agg_default_func=numpy.mean)[source] Bases:
abc.ABC
Base class for experiment loggers.
- Parameters
agg_key_funcs¶ (
Optional
[Mapping
[str
,Callable
[[Sequence
[float
]],float
]]]) – Dictionary which maps a metric name to a function, which will aggregate the metric values for the same steps.agg_default_func¶ (
Callable
[[Sequence
[float
]],float
]) – Default function to aggregate metric values. If some metric name is not presented in the agg_key_funcs dictionary, then the agg_default_func will be used for aggregation.
Note
The agg_key_funcs and agg_default_func arguments are used only when one logs metrics with the
agg_and_log_metrics()
method.-
_aggregate_metrics
(metrics, step=None)[source] Aggregates metrics.
- Parameters
- Return type
- Returns
Step and aggregated metrics. The return value could be
None
. In such case, metrics are added to the aggregation list, but not aggregated yet.
-
_finalize_agg_metrics
()[source] This shall be called before save/close.
-
static
_flatten_dict
(params, delimiter='/')[source] Flatten hierarchical dict, e.g.
{'a': {'b': 'c'}} -> {'a/b': 'c'}
.- Parameters
- Return type
- Returns
Flattened dict.
Examples
>>> LightningLoggerBase._flatten_dict({'a': {'b': 'c'}}) {'a/b': 'c'} >>> LightningLoggerBase._flatten_dict({'a': {'b': 123}}) {'a/b': 123}
-
_reduce_agg_metrics
()[source] Aggregate accumulated metrics.
-
static
_sanitize_params
(params)[source] Returns params with non-primitvies converted to strings for logging.
>>> params = {"float": 0.3, ... "int": 1, ... "string": "abc", ... "bool": True, ... "list": [1, 2, 3], ... "namespace": Namespace(foo=3), ... "layer": torch.nn.BatchNorm1d} >>> import pprint >>> pprint.pprint(LightningLoggerBase._sanitize_params(params)) {'bool': True, 'float': 0.3, 'int': 1, 'layer': "<class 'torch.nn.modules.batchnorm.BatchNorm1d'>", 'list': '[1, 2, 3]', 'namespace': 'Namespace(foo=3)', 'string': 'abc'}
-
agg_and_log_metrics
(metrics, step=None)[source] Aggregates and records metrics. This method doesn’t log the passed metrics instantaneously, but instead it aggregates them and logs only if metrics are ready to be logged.
-
close
()[source] Do any cleanup that is necessary to close an experiment.
- Return type
None
-
finalize
(status)[source] Do any processing that is necessary to finalize an experiment.
-
abstract
log_hyperparams
(params)[source] Record hyperparameters.
-
abstract
log_metrics
(metrics, step=None)[source] Records metrics. This method logs metrics as as soon as it received them. If you want to aggregate metrics for one specific step, use the
agg_and_log_metrics()
method.
-
save
()[source] Save log data.
- Return type
None
-
update_agg_funcs
(agg_key_funcs=None, agg_default_func=numpy.mean)[source] Update aggregation methods.
- Parameters
agg_key_funcs¶ (
Optional
[Mapping
[str
,Callable
[[Sequence
[float
]],float
]]]) – Dictionary which maps a metric name to a function, which will aggregate the metric values for the same steps.agg_default_func¶ (
Callable
[[Sequence
[float
]],float
]) – Default function to aggregate metric values. If some metric name is not presented in the agg_key_funcs dictionary, then the agg_default_func will be used for aggregation.
-
abstract property
experiment
[source] Return the experiment object associated with this logger.
- Return type
-
class
pytorch_lightning.loggers.
LoggerCollection
(logger_iterable)[source] Bases:
pytorch_lightning.loggers.base.LightningLoggerBase
The
LoggerCollection
class is used to iterate all logging actions over the given logger_iterable.- Parameters
logger_iterable¶ (
Iterable
[LightningLoggerBase
]) – An iterable collection of loggers
-
close
()[source] Do any cleanup that is necessary to close an experiment.
- Return type
None
-
finalize
(status)[source] Do any processing that is necessary to finalize an experiment.
-
log_hyperparams
(params)[source] Record hyperparameters.
-
log_metrics
(metrics, step=None)[source] Records metrics. This method logs metrics as as soon as it received them. If you want to aggregate metrics for one specific step, use the
agg_and_log_metrics()
method.
-
save
()[source] Save log data.
- Return type
None
-
property
experiment
[source] Return the experiment object associated with this logger.
-
class
pytorch_lightning.loggers.
TensorBoardLogger
(save_dir, name='default', version=None, **kwargs)[source] Bases:
pytorch_lightning.loggers.base.LightningLoggerBase
Log to local file system in TensorBoard format. Implemented using
SummaryWriter
. Logs are saved toos.path.join(save_dir, name, version)
. This is the default logger in Lightning, it comes preinstalled.Example
>>> from pytorch_lightning import Trainer >>> from pytorch_lightning.loggers import TensorBoardLogger >>> logger = TensorBoardLogger("tb_logs", name="my_model") >>> trainer = Trainer(logger=logger)
- Parameters
name¶ (
Optional
[str
]) – Experiment name. Defaults to'default'
. If it is the empty string then no per-experiment subdirectory is used.version¶ (
Union
[int
,str
,None
]) – Experiment version. If version is not specified the logger inspects the save directory for existing versions, then automatically assigns the next available version. If it is a string then it is used as the run-specific subdirectory name, otherwise'version_${version}'
is used.**kwargs¶ – Other arguments are passed directly to the
SummaryWriter
constructor.
-
finalize
(status)[source] Do any processing that is necessary to finalize an experiment.
-
log_hyperparams
(params, metrics=None)[source] Record hyperparameters.
-
log_metrics
(metrics, step=None)[source] Records metrics. This method logs metrics as as soon as it received them. If you want to aggregate metrics for one specific step, use the
agg_and_log_metrics()
method.
-
save
()[source] Save log data.
- Return type
None
-
property
experiment
[source] Actual tensorboard object. To use TensorBoard features in your
LightningModule
do the following.Example:
self.logger.experiment.some_tensorboard_function()
- Return type
SummaryWriter
-
property
log_dir
[source] The directory for this run’s tensorboard checkpoint. By default, it is named
'version_${self.version}'
but it can be overridden by passing a string value for the constructor’s version parameter instead ofNone
or an int.- Return type
-
property
root_dir
[source] Parent directory for all tensorboard checkpoint subdirectories. If the experiment name parameter is
None
or the empty string, no experiment subdirectory is used and the checkpoint will be saved in “save_dir/version_dir”- Return type
-
class
pytorch_lightning.loggers.
CometLogger
(api_key=None, save_dir=None, workspace=None, project_name=None, rest_api_key=None, experiment_name=None, experiment_key=None, **kwargs)[source] Bases:
pytorch_lightning.loggers.base.LightningLoggerBase
Log using Comet.ml. Install it with pip:
pip install comet-ml
Comet requires either an API Key (online mode) or a local directory path (offline mode).
ONLINE MODE
Example
>>> import os >>> from pytorch_lightning import Trainer >>> from pytorch_lightning.loggers import CometLogger >>> # arguments made to CometLogger are passed on to the comet_ml.Experiment class >>> comet_logger = CometLogger( ... api_key=os.environ.get('COMET_API_KEY'), ... workspace=os.environ.get('COMET_WORKSPACE'), # Optional ... save_dir='.', # Optional ... project_name='default_project', # Optional ... rest_api_key=os.environ.get('COMET_REST_API_KEY'), # Optional ... experiment_name='default' # Optional ... ) >>> trainer = Trainer(logger=comet_logger)
OFFLINE MODE
Example
>>> from pytorch_lightning.loggers import CometLogger >>> # arguments made to CometLogger are passed on to the comet_ml.Experiment class >>> comet_logger = CometLogger( ... save_dir='.', ... workspace=os.environ.get('COMET_WORKSPACE'), # Optional ... project_name='default_project', # Optional ... rest_api_key=os.environ.get('COMET_REST_API_KEY'), # Optional ... experiment_name='default' # Optional ... ) >>> trainer = Trainer(logger=comet_logger)
- Parameters
api_key¶ (
Optional
[str
]) – Required in online mode. API key, found on Comet.mlsave_dir¶ (
Optional
[str
]) – Required in offline mode. The path for the directory to save local comet logsworkspace¶ (
Optional
[str
]) – Optional. Name of workspace for this userproject_name¶ (
Optional
[str
]) – Optional. Send your experiment to a specific project. Otherwise will be sent to Uncategorized Experiments. If the project name does not already exist, Comet.ml will create a new project.rest_api_key¶ (
Optional
[str
]) – Optional. Rest API key found in Comet.ml settings. This is used to determine version numberexperiment_name¶ (
Optional
[str
]) – Optional. String representing the name for this particular experiment on Comet.ml.experiment_key¶ (
Optional
[str
]) – Optional. If set, restores from existing experiment.
-
finalize
(status)[source] When calling
self.experiment.end()
, that experiment won’t log any more data to Comet. That’s why, if you need to log any more data, you need to create an ExistingCometExperiment. For example, to log data when testing your model after training, because when training is finalizedCometLogger.finalize()
is called.This happens automatically in the
experiment()
property, whenself._experiment
is set toNone
, i.e.self.reset_experiment()
.- Return type
None
-
log_hyperparams
(params)[source] Record hyperparameters.
-
log_metrics
(metrics, step=None)[source] Records metrics. This method logs metrics as as soon as it received them. If you want to aggregate metrics for one specific step, use the
agg_and_log_metrics()
method.
-
property
experiment
[source] Actual Comet object. To use Comet features in your
LightningModule
do the following.Example:
self.logger.experiment.some_comet_function()
- Return type
BaseExperiment
-
class
pytorch_lightning.loggers.
MLFlowLogger
(experiment_name='default', tracking_uri=None, tags=None, save_dir=None)[source] Bases:
pytorch_lightning.loggers.base.LightningLoggerBase
Log using MLflow. Install it with pip:
pip install mlflow
Example
>>> from pytorch_lightning import Trainer >>> from pytorch_lightning.loggers import MLFlowLogger >>> mlf_logger = MLFlowLogger( ... experiment_name="default", ... tracking_uri="file:./ml-runs" ... ) >>> trainer = Trainer(logger=mlf_logger)
Use the logger anywhere in you
LightningModule
as follows:>>> from pytorch_lightning import LightningModule >>> class LitModel(LightningModule): ... def training_step(self, batch, batch_idx): ... # example ... self.logger.experiment.whatever_ml_flow_supports(...) ... ... def any_lightning_module_function_or_hook(self): ... self.logger.experiment.whatever_ml_flow_supports(...)
- Parameters
-
finalize
(status='FINISHED')[source] Do any processing that is necessary to finalize an experiment.
-
log_hyperparams
(params)[source] Record hyperparameters.
-
log_metrics
(metrics, step=None)[source] Records metrics. This method logs metrics as as soon as it received them. If you want to aggregate metrics for one specific step, use the
agg_and_log_metrics()
method.
-
property
experiment
[source] Actual MLflow object. To use mlflow features in your
LightningModule
do the following.Example:
self.logger.experiment.some_mlflow_function()
- Return type
MlflowClient
-
class
pytorch_lightning.loggers.
NeptuneLogger
(api_key=None, project_name=None, close_after_fit=True, offline_mode=False, experiment_name=None, upload_source_files=None, params=None, properties=None, tags=None, **kwargs)[source] Bases:
pytorch_lightning.loggers.base.LightningLoggerBase
Log using Neptune. Install it with pip:
pip install neptune-client
The Neptune logger can be used in the online mode or offline (silent) mode. To log experiment data in online mode,
NeptuneLogger
requires an API key. In offline mode, the logger does not connect to Neptune.ONLINE MODE
Example
>>> from pytorch_lightning import Trainer >>> from pytorch_lightning.loggers import NeptuneLogger >>> # arguments made to NeptuneLogger are passed on to the neptune.experiments.Experiment class >>> # We are using an api_key for the anonymous user "neptuner" but you can use your own. >>> neptune_logger = NeptuneLogger( ... api_key='ANONYMOUS', ... project_name='shared/pytorch-lightning-integration', ... experiment_name='default', # Optional, ... params={'max_epochs': 10}, # Optional, ... tags=['pytorch-lightning', 'mlp'] # Optional, ... ) >>> trainer = Trainer(max_epochs=10, logger=neptune_logger)
OFFLINE MODE
Example
>>> from pytorch_lightning.loggers import NeptuneLogger >>> # arguments made to NeptuneLogger are passed on to the neptune.experiments.Experiment class >>> neptune_logger = NeptuneLogger( ... offline_mode=True, ... project_name='USER_NAME/PROJECT_NAME', ... experiment_name='default', # Optional, ... params={'max_epochs': 10}, # Optional, ... tags=['pytorch-lightning', 'mlp'] # Optional, ... ) >>> trainer = Trainer(max_epochs=10, logger=neptune_logger)
Use the logger anywhere in you
LightningModule
as follows:>>> from pytorch_lightning import LightningModule >>> class LitModel(LightningModule): ... def training_step(self, batch, batch_idx): ... # log metrics ... self.logger.experiment.log_metric('acc_train', ...) ... # log images ... self.logger.experiment.log_image('worse_predictions', ...) ... # log model checkpoint ... self.logger.experiment.log_artifact('model_checkpoint.pt', ...) ... self.logger.experiment.whatever_neptune_supports(...) ... ... def any_lightning_module_function_or_hook(self): ... self.logger.experiment.log_metric('acc_train', ...) ... self.logger.experiment.log_image('worse_predictions', ...) ... self.logger.experiment.log_artifact('model_checkpoint.pt', ...) ... self.logger.experiment.whatever_neptune_supports(...)
If you want to log objects after the training is finished use
close_after_fit=False
:neptune_logger = NeptuneLogger( ... close_after_fit=False, ... ) trainer = Trainer(logger=neptune_logger) trainer.fit() # Log test metrics trainer.test(model) # Log additional metrics from sklearn.metrics import accuracy_score accuracy = accuracy_score(y_true, y_pred) neptune_logger.experiment.log_metric('test_accuracy', accuracy) # Log charts from scikitplot.metrics import plot_confusion_matrix import matplotlib.pyplot as plt fig, ax = plt.subplots(figsize=(16, 12)) plot_confusion_matrix(y_true, y_pred, ax=ax) neptune_logger.experiment.log_image('confusion_matrix', fig) # Save checkpoints folder neptune_logger.experiment.log_artifact('my/checkpoints') # When you are done, stop the experiment neptune_logger.experiment.stop()
See also
An Example experiment showing the UI of Neptune.
Tutorial on how to use Pytorch Lightning with Neptune.
- Parameters
api_key¶ (
Optional
[str
]) – Required in online mode. Neptune API token, found on https://neptune.ai. Read how to get your API key. It is recommended to keep it in the NEPTUNE_API_TOKEN environment variable and then you can leaveapi_key=None
.project_name¶ (
Optional
[str
]) – Required in online mode. Qualified name of a project in a form of “namespace/project_name” for example “tom/minst-classification”. IfNone
, the value of NEPTUNE_PROJECT environment variable will be taken. You need to create the project in https://neptune.ai first.offline_mode¶ (
bool
) – Optional defaultFalse
. IfTrue
no logs will be sent to Neptune. Usually used for debug purposes.close_after_fit¶ (
Optional
[bool
]) – Optional defaultTrue
. IfFalse
the experiment will not be closed after training and additional metrics, images or artifacts can be logged. Also, remember to close the experiment explicitly by runningneptune_logger.experiment.stop()
.experiment_name¶ (
Optional
[str
]) – Optional. Editable name of the experiment. Name is displayed in the experiment’s Details (Metadata section) and in experiments view as a column.upload_source_files¶ (
Optional
[List
[str
]]) – Optional. List of source files to be uploaded. Must be list of str or single str. Uploaded sources are displayed in the experiment’s Source code tab. IfNone
is passed, the Python file from which the experiment was created will be uploaded. Pass an empty list ([]
) to upload no files. Unix style pathname pattern expansion is supported. For example, you can pass'\*.py'
to upload all python source files from the current directory. For recursion lookup use'\**/\*.py'
(for Python 3.5 and later). For more information seeglob
library.params¶ (
Optional
[Dict
[str
,Any
]]) – Optional. Parameters of the experiment. After experiment creation params are read-only. Parameters are displayed in the experiment’s Parameters section and each key-value pair can be viewed in the experiments view as a column.properties¶ (
Optional
[Dict
[str
,Any
]]) – Optional. Default is{}
. Properties of the experiment. They are editable after the experiment is created. Properties are displayed in the experiment’s Details section and each key-value pair can be viewed in the experiments view as a column.tags¶ (
Optional
[List
[str
]]) – Optional. Default is[]
. Must be list of str. Tags of the experiment. They are editable after the experiment is created (see:append_tag()
andremove_tag()
). Tags are displayed in the experiment’s Details section and can be viewed in the experiments view as a column.
-
append_tags
(tags)[source] Appends tags to the neptune experiment.
-
finalize
(status)[source] Do any processing that is necessary to finalize an experiment.
-
log_artifact
(artifact, destination=None)[source] Save an artifact (file) in Neptune experiment storage.
-
log_hyperparams
(params)[source] Record hyperparameters.
-
log_image
(log_name, image, step=None)[source] Log image data in Neptune experiment
- Parameters
log_name¶ (
str
) – The name of log, i.e. bboxes, visualisations, sample_images.image¶ (
Union
[str
,Image
,Any
]) – The value of the log (data-point). Can be one of the following types: PIL image, matplotlib.figure.Figure, path to image file (str)step¶ (
Optional
[int
]) – Step number at which the metrics should be recorded, must be strictly increasing
- Return type
None
-
log_metric
(metric_name, metric_value, step=None)[source] Log metrics (numeric values) in Neptune experiments.
-
log_metrics
(metrics, step=None)[source] Log metrics (numeric values) in Neptune experiments.
-
log_text
(log_name, text, step=None)[source] Log text data in Neptune experiments.
-
set_property
(key, value)[source] Set key-value pair as Neptune experiment property.
-
property
experiment
[source] Actual Neptune object. To use neptune features in your
LightningModule
do the following.Example:
self.logger.experiment.some_neptune_function()
- Return type
Experiment
-
class
pytorch_lightning.loggers.
TestTubeLogger
(save_dir, name='default', description=None, debug=False, version=None, create_git_tag=False)[source] Bases:
pytorch_lightning.loggers.base.LightningLoggerBase
Log to local file system in TensorBoard format but using a nicer folder structure (see full docs). Install it with pip:
pip install test_tube
Example
>>> from pytorch_lightning import Trainer >>> from pytorch_lightning.loggers import TestTubeLogger >>> logger = TestTubeLogger("tt_logs", name="my_exp_name") >>> trainer = Trainer(logger=logger)
Use the logger anywhere in your
LightningModule
as follows:>>> from pytorch_lightning import LightningModule >>> class LitModel(LightningModule): ... def training_step(self, batch, batch_idx): ... # example ... self.logger.experiment.whatever_method_summary_writer_supports(...) ... ... def any_lightning_module_function_or_hook(self): ... self.logger.experiment.add_histogram(...)
- Parameters
description¶ (
Optional
[str
]) – A short snippet about this experimentversion¶ (
Optional
[int
]) – Experiment version. If version is not specified the logger inspects the save directory for existing versions, then automatically assigns the next available version.create_git_tag¶ (
bool
) – IfTrue
creates a git tag to save the code used in this experiment.
-
close
()[source] Do any cleanup that is necessary to close an experiment.
- Return type
None
-
finalize
(status)[source] Do any processing that is necessary to finalize an experiment.
-
log_hyperparams
(params)[source] Record hyperparameters.
-
log_metrics
(metrics, step=None)[source] Records metrics. This method logs metrics as as soon as it received them. If you want to aggregate metrics for one specific step, use the
agg_and_log_metrics()
method.
-
save
()[source] Save log data.
- Return type
None
-
property
experiment
[source] Actual TestTube object. To use TestTube features in your
LightningModule
do the following.Example:
self.logger.experiment.some_test_tube_function()
- Return type
Experiment
-
class
pytorch_lightning.loggers.
WandbLogger
(name=None, save_dir=None, offline=False, id=None, anonymous=False, version=None, project=None, tags=None, log_model=False, experiment=None, entity=None, group=None)[source] Bases:
pytorch_lightning.loggers.base.LightningLoggerBase
Log using Weights and Biases. Install it with pip:
pip install wandb
- Parameters
offline¶ (
bool
) – Run offline (data can be streamed later to wandb servers).id¶ (
Optional
[str
]) – Sets the version, mainly used to resume a previous run.anonymous¶ (
bool
) – Enables or explicitly disables anonymous logging.version¶ (
Optional
[str
]) – Sets the version, mainly used to resume a previous run.project¶ (
Optional
[str
]) – The name of the project to which this run will belong.tags¶ (
Optional
[List
[str
]]) – Tags associated with this run.log_model¶ (
bool
) – Save checkpoints in wandb dir to upload on W&B servers.experiment¶ – WandB experiment object
entity¶ – The team posting this run (default: your username or your default team)
group¶ (
Optional
[str
]) – A unique string shared by all runs in a given group
Example
>>> from pytorch_lightning.loggers import WandbLogger >>> from pytorch_lightning import Trainer >>> wandb_logger = WandbLogger() >>> trainer = Trainer(logger=wandb_logger)
See also
Tutorial on how to use W&B with Pytorch Lightning.
-
log_hyperparams
(params)[source] Record hyperparameters.
-
log_metrics
(metrics, step=None)[source] Records metrics. This method logs metrics as as soon as it received them. If you want to aggregate metrics for one specific step, use the
agg_and_log_metrics()
method.
-
property
experiment
[source] Actual wandb object. To use wandb features in your
LightningModule
do the following.Example:
self.logger.experiment.some_wandb_function()
- Return type
Run
-
class
pytorch_lightning.loggers.
TrainsLogger
(project_name=None, task_name=None, task_type='training', reuse_last_task_id=True, output_uri=None, auto_connect_arg_parser=True, auto_connect_frameworks=True, auto_resource_monitoring=True)[source] Bases:
pytorch_lightning.loggers.base.LightningLoggerBase
Log using allegro.ai TRAINS. Install it with pip:
pip install trains
Example
>>> from pytorch_lightning import Trainer >>> from pytorch_lightning.loggers import TrainsLogger >>> trains_logger = TrainsLogger( ... project_name='pytorch lightning', ... task_name='default', ... output_uri='.', ... ) TRAINS Task: ... TRAINS results page: ... >>> trainer = Trainer(logger=trains_logger)
Use the logger anywhere in your
LightningModule
as follows:>>> from pytorch_lightning import LightningModule >>> class LitModel(LightningModule): ... def training_step(self, batch, batch_idx): ... # example ... self.logger.experiment.whatever_trains_supports(...) ... ... def any_lightning_module_function_or_hook(self): ... self.logger.experiment.whatever_trains_supports(...)
- Parameters
project_name¶ (
Optional
[str
]) – The name of the experiment’s project. Defaults toNone
.task_name¶ (
Optional
[str
]) – The name of the experiment. Defaults toNone
.task_type¶ (
str
) – The name of the experiment. Defaults to'training'
.reuse_last_task_id¶ (
bool
) – Start with the previously used task id. Defaults toTrue
.output_uri¶ (
Optional
[str
]) – Default location for output models. Defaults toNone
.auto_connect_arg_parser¶ (
bool
) – Automatically grab theArgumentParser
and connect it with the task. Defaults toTrue
.auto_connect_frameworks¶ (
bool
) – IfTrue
, automatically patch to trains backend. Defaults toTrue
.auto_resource_monitoring¶ (
bool
) – IfTrue
, machine vitals will be sent along side the task scalars. Defaults toTrue
.
Examples
>>> logger = TrainsLogger("pytorch lightning", "default", output_uri=".") TRAINS Task: ... TRAINS results page: ... >>> logger.log_metrics({"val_loss": 1.23}, step=0) >>> logger.log_text("sample test") sample test >>> import numpy as np >>> logger.log_artifact("confusion matrix", np.ones((2, 3))) >>> logger.log_image("passed", "Image 1", np.random.randint(0, 255, (200, 150, 3), dtype=np.uint8))
-
classmethod
bypass_mode
()[source] Returns the bypass mode state.
Note
GITHUB_ACTIONS env will automatically set bypass_mode to
True
unless overridden specifically withTrainsLogger.set_bypass_mode(False)
.- Return type
- Returns
If True, all outside communication is skipped.
-
finalize
(status=None)[source] Do any processing that is necessary to finalize an experiment.
-
log_artifact
(name, artifact, metadata=None, delete_after_upload=False)[source] Save an artifact (file/object) in TRAINS experiment storage.
- Parameters
name¶ (
str
) – Artifact name. Notice! it will override the previous artifact if the name already exists.artifact¶ (
Union
[str
,Path
,Dict
[str
,Any
],ndarray
,Image
]) –Artifact object to upload. Currently supports:
string /
pathlib.Path
are treated as path to artifact file to upload If a wildcard or a folder is passed, a zip file containing the local files will be created and uploaded.dict will be stored as .json file and uploaded
pandas.DataFrame
will be stored as .csv.gz (compressed CSV file) and uploadednumpy.ndarray
will be stored as .npz and uploadedPIL.Image.Image
will be stored to .png file and uploaded
metadata¶ (
Optional
[Dict
[str
,Any
]]) – Simple key/value dictionary to store on the artifact. Defaults toNone
.delete_after_upload¶ (
bool
) – IfTrue
, the local artifact will be deleted (only applies ifartifact
is a local file). Defaults toFalse
.
- Return type
None
-
log_hyperparams
(params)[source] Log hyperparameters (numeric values) in TRAINS experiments.
-
log_image
(title, series, image, step=None)[source] Log Debug image in TRAINS experiment
- Parameters
title¶ (
str
) – The title of the debug image, i.e. “failed”, “passed”.series¶ (
str
) – The series name of the debug image, i.e. “Image 0”, “Image 1”.image¶ (
Union
[str
,ndarray
,Image
,Tensor
]) –Debug image to log. If
numpy.ndarray
ortorch.Tensor
, the image is assumed to be the following:shape: CHW
color space: RGB
value range: [0., 1.] (float) or [0, 255] (uint8)
step¶ (
Optional
[int
]) – Step number at which the metrics should be recorded. Defaults to None.
- Return type
None
-
log_metric
(title, series, value, step=None)[source] Log metrics (numeric values) in TRAINS experiments. This method will be called by the users.
- Parameters
- Return type
None
-
log_metrics
(metrics, step=None)[source] Log metrics (numeric values) in TRAINS experiments. This method will be called by Trainer.
- Parameters
- Return type
None
-
log_text
(text)[source] Log console text data in TRAINS experiment.
-
classmethod
set_bypass_mode
(bypass)[source] Will bypass all outside communication, and will drop all logs. Should only be used in “standalone mode”, when there is no access to the trains-server.
-
classmethod
set_credentials
(api_host=None, web_host=None, files_host=None, key=None, secret=None)[source] Set new default TRAINS-server host and credentials. These configurations could be overridden by either OS environment variables or trains.conf configuration file.
Note
Credentials need to be set prior to Logger initialization.
- Parameters
api_host¶ (
Optional
[str
]) – Trains API server url, example:host='http://localhost:8008'
web_host¶ (
Optional
[str
]) – Trains WEB server url, example:host='http://localhost:8080'
files_host¶ (
Optional
[str
]) – Trains Files server url, example:host='http://localhost:8081'
key¶ (
Optional
[str
]) – user key/secret pair, example:key='thisisakey123'
secret¶ (
Optional
[str
]) – user key/secret pair, example:secret='thisisseceret123'
- Return type
None
-
property
experiment
[source] Actual TRAINS object. To use TRAINS features in your
LightningModule
do the following.Example:
self.logger.experiment.some_trains_function()
- Return type
Task
-
property
id
[source] ID is a uuid (string) representing this specific experiment in the entire system.
-
property
name
[source] Name is a human readable non-unique name (str) of the experiment.