|
LBANN
0.103.0
LivermoreBigArtificialNeuralNetworkToolkit
|
Namespaces | |
| dump_weights_internal | |
Classes | |
| class | adaptive_learning_rate |
| class | alternate_updates |
| class | autoencoder_strategy |
| Subclass of image_output_strategy to dump autoencoder images. More... | |
| class | categorical_accuracy_strategy |
| class | check_dataset |
| class | check_gradients |
| Gradient checking callback. More... | |
| class | check_init |
| class | check_metric |
| class | check_nan |
| class | check_small |
| class | checkpoint |
| Checkpoint at given interval in given directory. More... | |
| class | compute_model_size |
| Compute number of parameters in model. More... | |
| class | confusion_matrix |
| class | cosine_decay_learning_rate |
| class | debug |
| Phase specific "printf debugging". More... | |
| class | debug_io |
| class | drop_fixed_learning_rate |
| class | dump_error_signals |
| class | dump_gradients |
| Dump gradient matrices to files. More... | |
| class | dump_minibatch_sample_indices |
| Dump sample indices for each minibatch to files. More... | |
| class | dump_model_graph |
| Dump model graph callback. More... | |
| class | dump_outputs |
| Dump layer output tensors to files. More... | |
| class | dump_weights |
| Dump weights to files. More... | |
| class | early_stopping |
| class | export_onnx |
| Callback to export a trained model to onnx format. More... | |
| class | gpu_memory_usage |
| class | hang |
| class | image_output_strategy |
| Interface for strategies for determining which images to output to the summarizer. More... | |
| class | learning_rate |
| class | linear_growth_learning_rate |
| class | load_model |
| class | ltfb |
| Tournament training. More... | |
| class | minibatch_schedule |
| class | mixup |
| class | monitor_io |
| class | optimizerwise_adaptive_learning_rate |
| class | perturb_adam |
| Hyperparameter exploration with Adam optimizers. More... | |
| class | perturb_dropout |
| Hyperparameter exploration with dropouts. More... | |
| class | perturb_learning_rate |
| Hyperparameter exploration of optimizer learning rate. More... | |
| class | perturb_weights |
| Perturb values in a weights tensor. More... | |
| class | poly_learning_rate |
| class | print_model_description |
| Print human-readable description of model to standard input. More... | |
| class | print_statistics |
| class | profiler |
| class | progress_bar |
| prints a progress bar during training More... | |
| class | replace_weights |
| class | save_images |
| class | save_model |
| class | save_topk_models |
| Save the top K models for, e.g., inference and other analysis. More... | |
| class | set_learning_rate |
| class | set_weights_value |
| Set values in a weights object at a given training step. More... | |
| class | step_learning_rate |
| class | step_minibatch |
| class | summarize_images |
| Callback to dump images to event files based on strategy. More... | |
| class | summary |
| class | sync_layers |
| class | timeline |
| class | timer |
| class | variable_minibatch |
Enumerations | |
| enum | callback_phase { callback_phase::batch, callback_phase::epoch, callback_phase::validation, callback_phase::inference, callback_phase::invalid } |
Functions | |
| std::unique_ptr< callback_base > | build_alternate_updates_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| LBANN_ADD_DEFAULT_CALLBACK_BUILDER (check_dataset, build_check_dataset_callback_from_pbuf) | |
| std::unique_ptr< callback_base > | build_check_gradients_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| LBANN_ADD_DEFAULT_CALLBACK_BUILDER (check_init, build_check_init_callback_from_pbuf) | |
| std::unique_ptr< callback_base > | build_check_metric_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::string | get_trainer_checkpoint_dirname (const std::string &trainer_name, const std::string &dir) |
| std::string | get_last_shared_checkpoint_filename (const std::string &alg_name, const std::string &dir) |
| std::string | get_last_shared_checkpoint_filename (const std::string &trainer_name, const std::string &alg_name, const std::string &dir) |
| std::string | get_shared_checkpoint_dirname (const std::string &alg_name, const std::string &dir, visitor_hook hook, execution_mode mode, size_t epoch, size_t step) |
| std::string | get_shared_checkpoint_dirname (const std::string &trainer_name, const std::string &alg_name, const std::string &dir, visitor_hook hook, execution_mode mode, size_t epoch, size_t step) |
| std::string | get_last_distributed_checkpoint_filename (const std::string &alg_name, const std::string &dir) |
| std::string | get_last_distributed_checkpoint_filename (const std::string &trainer_name, const std::string &alg_name, const std::string &dir) |
| std::string | get_distributed_checkpoint_dirname (const std::string &alg_name, const int rank_in_trainer, const std::string &dir, visitor_hook hook, execution_mode mode, size_t epoch, size_t step) |
| std::string | get_distributed_checkpoint_dirname (const std::string &trainer_name, const std::string &alg_name, const int rank_in_trainer, const std::string &dir, visitor_hook hook, execution_mode mode, size_t epoch, size_t step) |
| bool | write_latest (std::string filename, visitor_hook hook, execution_mode mode, size_t epoch, size_t train) |
| bool | read_latest (std::string filename, visitor_hook *hook, execution_mode *mode, size_t *epochLast, size_t *trainLast) |
| Reads the "latest" file and returns the epoch number and sample offset for most recent checkpoint. More... | |
| std::unique_ptr< callback_base > | build_checkpoint_callback_from_pbuf (const google::protobuf::Message &) |
| std::unique_ptr< callback_base > | build_compute_model_size_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_confusion_matrix_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_debug_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_debug_io_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_dump_error_signals_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_dump_gradients_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_dump_mb_indices_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_dump_model_graph_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_dump_outputs_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_dump_weights_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_early_stopping_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_export_onnx_callback_from_pbuf (const google::protobuf::Message &proto_msg, const std::shared_ptr< lbann_summary > &) |
| LBANN_ADD_DEFAULT_CALLBACK_BUILDER (gpu_memory_usage, build_gpu_memory_usage_callback_from_pbuf) | |
| std::unique_ptr< callback_base > | build_hang_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_step_learning_rate_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_set_learning_rate_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_adaptive_learning_rate_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_drop_fixed_learning_rate_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_linear_growth_learning_rate_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_poly_learning_rate_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_optimizerwise_adaptive_learning_rate_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_cosine_decay_learning_rate_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_load_model_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_ltfb_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_mixup_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_monitor_io_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_perturb_adam_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_perturb_dropout_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_perturb_learning_rate_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_perturb_weights_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_print_model_description_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_print_statistics_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_profiler_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_progress_bar_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_replace_weights_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_save_images_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::string | get_save_model_dirname (const std::string &trainer_name, const std::string &model_name, const std::string &dir) |
| std::unique_ptr< callback_base > | build_save_model_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_save_topk_models_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_set_weights_value_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< image_output_strategy > | build_categorical_accuracy_strategy_from_pbuf (google::protobuf::Message const &) |
| std::unique_ptr< image_output_strategy > | build_track_sample_ids_strategy_from_pbuf (google::protobuf::Message const &) |
| Layer const & | get_layer_by_name (model const &m, std::string const &layer_name) |
| Get a layer from model based on name. More... | |
| std::unique_ptr< callback_base > | build_summarize_images_callback_from_pbuf (const google::protobuf::Message &, const std::shared_ptr< lbann_summary > &summarizer) |
| std::unique_ptr< callback_base > | build_summary_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_sync_layers_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_timeline_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_timer_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_step_minibatch_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
| std::unique_ptr< callback_base > | build_minibatch_schedule_callback_from_pbuf (const google::protobuf::Message &, std::shared_ptr< lbann_summary > const &) |
|
strong |
| Enumerator | |
|---|---|
| batch | |
| epoch | |
| validation | |
| inference | |
| invalid | |
Definition at line 42 of file checkpoint.hpp.
| std::unique_ptr<callback_base> lbann::callback::build_adaptive_learning_rate_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_alternate_updates_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<image_output_strategy> lbann::callback::build_categorical_accuracy_strategy_from_pbuf | ( | google::protobuf::Message const & | ) |
| std::unique_ptr<callback_base> lbann::callback::build_check_gradients_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_check_metric_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_checkpoint_callback_from_pbuf | ( | const google::protobuf::Message & | ) |
| std::unique_ptr<callback_base> lbann::callback::build_compute_model_size_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_confusion_matrix_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_cosine_decay_learning_rate_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_debug_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_debug_io_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_drop_fixed_learning_rate_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_dump_error_signals_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_dump_gradients_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_dump_mb_indices_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_dump_model_graph_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_dump_outputs_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_dump_weights_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_early_stopping_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_export_onnx_callback_from_pbuf | ( | const google::protobuf::Message & | proto_msg, |
| const std::shared_ptr< lbann_summary > & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_hang_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_linear_growth_learning_rate_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_load_model_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_ltfb_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_minibatch_schedule_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_mixup_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_monitor_io_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_optimizerwise_adaptive_learning_rate_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_perturb_adam_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_perturb_dropout_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_perturb_learning_rate_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_perturb_weights_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_poly_learning_rate_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_print_model_description_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_print_statistics_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_profiler_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_progress_bar_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_replace_weights_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_save_images_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_save_model_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_save_topk_models_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_set_learning_rate_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_set_weights_value_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_step_learning_rate_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_step_minibatch_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_summarize_images_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| const std::shared_ptr< lbann_summary > & | summarizer | ||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_summary_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_sync_layers_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_timeline_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<callback_base> lbann::callback::build_timer_callback_from_pbuf | ( | const google::protobuf::Message & | , |
| std::shared_ptr< lbann_summary > const & | |||
| ) |
| std::unique_ptr<image_output_strategy> lbann::callback::build_track_sample_ids_strategy_from_pbuf | ( | google::protobuf::Message const & | ) |
| std::string lbann::callback::get_distributed_checkpoint_dirname | ( | const std::string & | alg_name, |
| const int | rank_in_trainer, | ||
| const std::string & | dir, | ||
| visitor_hook | hook, | ||
| execution_mode | mode, | ||
| size_t | epoch, | ||
| size_t | step | ||
| ) |
| std::string lbann::callback::get_distributed_checkpoint_dirname | ( | const std::string & | trainer_name, |
| const std::string & | alg_name, | ||
| const int | rank_in_trainer, | ||
| const std::string & | dir, | ||
| visitor_hook | hook, | ||
| execution_mode | mode, | ||
| size_t | epoch, | ||
| size_t | step | ||
| ) |
| std::string lbann::callback::get_last_distributed_checkpoint_filename | ( | const std::string & | alg_name, |
| const std::string & | dir | ||
| ) |
| std::string lbann::callback::get_last_distributed_checkpoint_filename | ( | const std::string & | trainer_name, |
| const std::string & | alg_name, | ||
| const std::string & | dir | ||
| ) |
| std::string lbann::callback::get_last_shared_checkpoint_filename | ( | const std::string & | alg_name, |
| const std::string & | dir | ||
| ) |
| std::string lbann::callback::get_last_shared_checkpoint_filename | ( | const std::string & | trainer_name, |
| const std::string & | alg_name, | ||
| const std::string & | dir | ||
| ) |
Get a layer from model based on name.
| m | The model |
| layer_name | Name of layer |
|
inline |
| std::string lbann::callback::get_shared_checkpoint_dirname | ( | const std::string & | alg_name, |
| const std::string & | dir, | ||
| visitor_hook | hook, | ||
| execution_mode | mode, | ||
| size_t | epoch, | ||
| size_t | step | ||
| ) |
| std::string lbann::callback::get_shared_checkpoint_dirname | ( | const std::string & | trainer_name, |
| const std::string & | alg_name, | ||
| const std::string & | dir, | ||
| visitor_hook | hook, | ||
| execution_mode | mode, | ||
| size_t | epoch, | ||
| size_t | step | ||
| ) |
| std::string lbann::callback::get_trainer_checkpoint_dirname | ( | const std::string & | trainer_name, |
| const std::string & | dir | ||
| ) |
| lbann::callback::LBANN_ADD_DEFAULT_CALLBACK_BUILDER | ( | check_init | , |
| build_check_init_callback_from_pbuf | |||
| ) |
| lbann::callback::LBANN_ADD_DEFAULT_CALLBACK_BUILDER | ( | gpu_memory_usage | , |
| build_gpu_memory_usage_callback_from_pbuf | |||
| ) |
| lbann::callback::LBANN_ADD_DEFAULT_CALLBACK_BUILDER | ( | check_dataset | , |
| build_check_dataset_callback_from_pbuf | |||
| ) |
| bool lbann::callback::read_latest | ( | std::string | filename, |
| visitor_hook * | hook, | ||
| execution_mode * | mode, | ||
| size_t * | epochLast, | ||
| size_t * | trainLast | ||
| ) |
Reads the "latest" file and returns the epoch number and sample offset for most recent checkpoint.
| bool lbann::callback::write_latest | ( | std::string | filename, |
| visitor_hook | hook, | ||
| execution_mode | mode, | ||
| size_t | epoch, | ||
| size_t | train | ||
| ) |