DataFrame/sort
sort a DataFrame by values in a given column
Calling Sequence
Parameters
Options
Description
Examples
Compatibility
sort( DF, index )
sort( DF, index, ordering, options )
DF
-
a DataFrame object
index
name, string or integer value; specifies the column index for the chosen column
ordering
symbol or Boolean function of two arguments; sort ordering
options
(optional) equation(s) of the form option=value
nonstrict: specifies to use a nonstrict comparison function type
strict: specifies to use a strict comparison function type
key: specifies to use a key comparison function type
output: sorted or permutation; specify whether to return the sorted DataFrame or the permutation of rows that sorts the DataFrame accordingly
The sort command sorts the rows of a DataFrame according to the sorted values in a given column.
The index argument selects which column is used to sort the rows. As a special case, you can supply the value 0 to select sorting by the row labels.
If the ordering option is given, it specifies the ordering for sorting elements. By default, elements are sorted in ascending order for numeric values and lexicographic order for strings and names.
`<`: If ordering is the symbol `<` or numeric, then the DataFrame is sorted in ascending numerical order of values in a given column. The index column of the DataFrame must contain elements of type({numeric, real_infinity}),
`>`: If ordering is the symbol `>`, then the DataFrame is sorted into descending numerical order.
address: If ordering is the symbol address, then the elements are sorted by address (a non-deterministic run-time specific property of the underlying data structure).
length: If ordering is the symbol length, then the elements are sorted by length where length is as determined by the length function.
lexorder: If ordering is the symbol lexorder or string, then lists of strings or symbols are sorted into lexicographic order.
Otherwise, ordering must be a Boolean-valued function, F, of two arguments. Specifically, F⁡a,b returns false if and only if b must precede a in the sorted output. That is F⁡a,b is a non-strict less than comparison function. In addition, F⁡a,b must be defined for all pairs a,b for a and b in the input structure and F⁡a,b must be transitive, that is, if F⁡a,b=true and F⁡b,c=true then F⁡a,c=true.
By specifying comptype=Function, different styles of comparison functions can be given to sort. The supported values for comptype are:
nonstrict: the comparison Function is a non-strict less than function, as described above.
strict: the comparison Function is a strict less than function. That is F⁡a,b returns true if and only if a must precede b in the sorted output. Function must still be defined for all pairs of inputs and be transitive, as described above. This argument is necessary if you want to specify a less than or equal to comparison function and want stable sorting. Specifying a strict less than function without using the strict option will result in an non-stable, sorted output.
key: the Function maps each element in the DataFrame to a key value. The DataFrame is sorted by sorting the corresponding keys. Using a key function is preferable to a comparison function because the key function is called O⁡n times, whereas a comparison function will be called O⁡n⁢log⁡n times. This is generally faster. In addition, sorting the keys may be done in parallel, whereas this may not be possible with a comparison function.
If an argument output = sorted is supplied, sort returns the sorted argument. This is the default behavior.
If an argument output = permutation is supplied, then sort does not return the sorted argument, but the permutation that would be applied to the argument in order to sort it. The permutation is given as a list of integers: the ith entry of the permutation is the integer j such that the jth entry of DataFrame would occur at the ith position in the sorted argument. This means that if a=sort⁡DataFrame,ByColumn,output=permutation then sort⁡DataFrame could be obtained as DataFramea,...
In order to obtain both the sorted argument and the permutation, one can supply the argument output = [sorted, permutation]. This will return a sequence of two elements, the first being the sorted argument, the second the permutation.
Consider some data on berries.
genus≔Rubus,Vitis,Fragaria:
energy≔220,288,136:
carbohydrates≔11.94,18.1,7.68:
top_producer≔Russia,China,USA:
berries≔DataFrame⁡energy|carbohydrates|top_producer|genus,columns=Energy,Carbohydrates,`Top Producer`,Genus,rows=Raspberry,Grape,Strawberry
berries≔EnergyCarbohydratesTop ProducerGenusRaspberry22011.94RussiaRubusGrape28818.1ChinaVitisStrawberry1367.68USAFragaria
It is possible to sort by any column. Here the DataFrame is sorted in order of ascending energy level:
sort⁡berries,Energy
EnergyCarbohydratesTop ProducerGenusStrawberry1367.68USAFragariaRaspberry22011.94RussiaRubusGrape28818.1ChinaVitis
In order of descending carbohydrate level:
sort⁡berries,Carbohydrates,`>`
EnergyCarbohydratesTop ProducerGenusGrape28818.1ChinaVitisRaspberry22011.94RussiaRubusStrawberry1367.68USAFragaria
It is also possible to sort columns with string and named values:
sort⁡berries,Genus
sort⁡berries,`Top Producer`,length
EnergyCarbohydratesTop ProducerGenusStrawberry1367.68USAFragariaGrape28818.1ChinaVitisRaspberry22011.94RussiaRubus
The output option controls the returned output for the sort command. Using permutation returns the list of numeric indices that sort the original DataFrame.
index_order≔sort⁡berries,Energy,`>`,output=permutation
index_order≔2,1,3
berriesindex_order,..
In order to obtain the data frame sorted by the row labels, you can use 0 as the index argument.
sort⁡berries,0
Several other sort options are also supported by the DataFrame sort command:
berries≔Append⁡berries,DataSeries⁡1,3,−2,labels=Raspberry,Grape,Strawberry,label=RandomValues
berries≔EnergyCarbohydratesTop ProducerGenusRandomValuesRaspberry22011.94RussiaRubus1Grape28818.1ChinaVitis3Strawberry1367.68USAFragaria−2
sort⁡berries,RandomValues,key=abs
EnergyCarbohydratesTop ProducerGenusRandomValuesRaspberry22011.94RussiaRubus1Strawberry1367.68USAFragaria−2Grape28818.1ChinaVitis3
The DataFrame/sort command was introduced in Maple 2017.
For more information on Maple 2017 changes, see Updates in Maple 2017.
The DataFrame/sort command was updated in Maple 2023.
The index parameter was updated in Maple 2023.
See Also
DataSeries/sort
sort
Download Help Document