An artificial intelligence inference and training system using SSD offloading, according to one embodiment of the present invention, comprises: storage servers for storing data and comprising a plurality of SSDs which each comprise a first computing device; a transfer learning server connected to the storage servers via a network and generating an artificial intelligence model which is updated through periodic training; an inference server for extracting metadata for the data by using the artificial intelligence model received from the transfer learning server; and a database for storing the extracted metadata, wherein the transfer learning server comprises a second computing device, and a first part of the training is carried out through the first computing devices, and a second part, including the remaining training parts other than the first part of the training, is carried out through the second computing device.