Skip to contents

[Experimental] Attempt to list all columns in the database and group the ones that are common to some datasets. Useful to find keys to pivot or summarise data.

Usage

get_common_cols(lookup = edc_lookup(), min_datasets = 3)

# S3 method for class 'common_cols'
summary(object, ...)

Arguments

lookup

the lookup table, default to edc_lookup()

min_datasets

the minimal number of datasets to be considered

object

an object of class "common_cols"

...

unused

Value

a tibble of class "common_cols"

Examples

db = edc_example()
#> Warning: Option "edc_lookup" has been overwritten.
load_database(db)
x = get_common_cols(min_datasets=1)
x
#> # A tibble: 28 × 7
#>    column  name_in   datasets  n_datasets pct_datasets datasets_in  datasets_out
#>    <chr>   <list>    <list>         <int>        <dbl> <chr>        <chr>       
#>  1 crfname <lgl [8]> <chr [8]>          8        1     ae, long_pu… ""          
#>  2 subjid  <lgl [8]> <chr [8]>          8        1     ae, long_pu… ""          
#>  3 crfstat <lgl [8]> <chr [7]>          7        0.875 ae, data1, … "long_pure" 
#>  4 aegr    <lgl [8]> <chr [1]>          1        0.125 ae           "long_pure,…
#>  5 aesoc   <lgl [8]> <chr [1]>          1        0.125 ae           "long_pure,…
#>  6 age     <lgl [8]> <chr [1]>          1        0.125 enrol        "ae, long_p…
#>  7 arm     <lgl [8]> <chr [1]>          1        0.125 enrol        "ae, long_p…
#>  8 date1   <lgl [8]> <chr [1]>          1        0.125 data1        "ae, long_p…
#>  9 date10  <lgl [8]> <chr [1]>          1        0.125 data3        "ae, long_p…
#> 10 date2   <lgl [8]> <chr [1]>          1        0.125 data1        "ae, long_p…
#> # ℹ 18 more rows
summary(x)
#> # A tibble: 3 × 7
#>   pct_datasets n_datasets n_distinct_datasets n_columns columns    datasets   
#>   <chr>             <int>               <int>     <int> <list>     <list>     
#> 1 100%                  8                   1         2 <chr [2]>  <list [2]> 
#> 2 88%                   7                   1         1 <chr [1]>  <list [1]> 
#> 3 12%                   1                   8        25 <chr [25]> <list [25]>
#> # ℹ 1 more variable: columns_str <chr>