DataLoaders.jl


			
			
			



			DataLoader
	
			
			(
    
			
			data
			
			,
    
			
			
			
			batchsize
			
			 
			
			= 
			
			1;
    
			
			
			
			
			partial
			
			 
			
			= 
			
			true
			
			,
    
			
			
			



			collate
	
			
			 
			
			= 
			
			true
			
			,
    
			
			
			
			buffered
			
			 
			
			= 
			



			collate
	
			
			,
    
			
			
			
			parallel
			
			 
			
			= 
			
			
			
			
			Threads
			
			.
			
			
			nthreads
			
			(
			
			) 
			
			> 
			
			1
			
			,
    
			
			
			
			useprimary
			
			 
			
			= 
			
			false
			
			,

			
			)

Create an efficient iterator of batches over data container data .

Arguments

Positional

data : A data container supporting the LearnBase data access pattern
batchsize = 1 : Number of samples to batch together . Disable batching by setting to nothing .

Keyword

partial::Bool = true : Whether to include the last batch when nobs(dataset) is not divisible by batchsize . true ensures all batches have the same size, but some samples might be dropped .
buffered::Bool = collate : If buffered is true , loads data inplace using getobs! . See Data containers for details on buffered loading .
parallel::Bool = Threads.nthreads() > 1) : Whether to load data in parallel, keeping the primary thread is . Default is true if more than one thread is available .
useprimary::Bool = false : If false , keep the main thread free when loading data in parallel . Is ignored if parallel is false .

Examples

Creating a data loader with batch size 16 and iterating over it:

Creating a data loader that uses buffers to load batches:

Turning off collating :


			
			
			
			
			dataloader
			
			 
			
			= 
			
			



			DataLoader
	
			
			(
			
			data
			
			, 
			
			16
			
			, 
			
			



			collate
	
			
			=
			
			false
			
			)

# Batches are a vector of observations

			
			
			
			
			length
			
			(
			
			
			first
			
			(
			
			dataloader
			
			)
			
			) 
			
			== 
			
			16