biexponential scale

Discussion:

biexponential scale

Anja Mirenska

12 years ago

Hi everyone,

I wonder if there is a way to display biexponential/hyperlog scales in
ggplot2 or if you can suggest another way to display data that have a wide
range and quite a few negative values around zero (bell-shaped curve of the
"negative" population).

So basically I'd like to get scales similar to these ones:
Failed to load image: http://flowjo.typepad.com/the_daily_dongle/images/2008/01/29/picture_10.png,
http://flowjo.typepad.com/.a/6a00d8341c0ebb53ef0105365fb4fa970c-800wi ,
Failed to load image: http://www.denovosoftware.com/images/whatIsBiex-BD-ScaleToP1.png

Best wishes

Anja

--
You received this message because you are subscribed to the ggplot2 mailing list.
Please provide a reproducible example: https://github.com/hadley/devtools/wiki/Reproducibility

To post: email ggplot2-/***@public.gmane.org
To unsubscribe: email ggplot2+unsubscribe-/***@public.gmane.org
More options: http://groups.google.com/group/ggplot2

Colin Gross

12 years ago

Permalink

For displaying and working with flowcyotometry data, you might want to try
the Bioconductor flowCore and flowViz packages. I know there is also a
package for importing flowjo workspaces.

...

Anja Mirenska

12 years ago

Permalink

Hi Colin,

Thanks for your reply. I've worked with these packages, but in this case
I'd really like to plot my data (which in fact aren't flow cytometry data
and which I analysed with other methods) with ggplot2 if this is possible.

Best wishes

Anja

...

Brandon Hurr

12 years ago

Permalink

Anja,

I think the point of the new scales package is that you can define the
scales in any way you want. How exactly you go about doing that is a
question I'm not skilled enough to answer. I would suggest digesting the
scales package documentation a bit, and/or hope that someone with the
skills can chime in and help.

require(scales)
?scales
?trans_new

log_trans
#so you can see the guts of a trans

function (base = exp(1))
{
trans <- function(x) log(x, base)
inv <- function(x) base^x
trans_new(str_c("log-", format(base)), trans, inv, log_breaks(base =
base),
domain = c(1e-100, Inf))
}
<environment: namespace:scales>

Brandon

...

Anja Mirenska

12 years ago

Permalink

Hi Brandon,

Thanks for the pointer! I'll have a closer look at the scales package, it
didn't come to my mind.

Of course I still would be happy if a "scales-expert" or simply a more
experienced R-user than me could offer additional help.

Best wishes

Anja

Post by Brandon Hurr
?trans_new

Dennis Murphy

12 years ago

Permalink

X-Received: by 10.50.13.130 with SMTP id h2mr538699igc.16.1358258953675;
Tue, 15 Jan 2013 06:09:13 -0800 (PST)
X-BeenThere: ggplot2-/***@public.gmane.org
Received: by 10.50.140.40 with SMTP id rd8ls3108288igb.2.canary; Tue, 15 Jan
2013 06:09:08 -0800 (PST)
X-Received: by 10.50.5.210 with SMTP id u18mr2127635igu.4.1358258948643;
Tue, 15 Jan 2013 06:09:08 -0800 (PST)
X-Received: by 10.50.5.210 with SMTP id u18mr2127632igu.4.1358258948585;
Tue, 15 Jan 2013 06:09:08 -0800 (PST)
Received: from mail-ob0-f178.google.com (mail-ob0-f178.google.com [209.85.214.178])
by gmr-mx.google.com with ESMTPS id ut11si251912igb.3.2013.01.15.06.09.08
(version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128);
Tue, 15 Jan 2013 06:09:08 -0800 (PST)
Received-SPF: pass (google.com: domain of djmuser-***@public.gmane.org designates 209.85.214.178 as permitted sender) client-ip=209.85.214.178;
Received: by mail-ob0-f178.google.com with SMTP id eh20so135246obb.9
for <ggplot2-/***@public.gmane.org>; Tue, 15 Jan 2013 06:09:08 -0800 (PST)
Received: by 10.182.216.38 with SMTP id on6mr12205919obc.65.1358258948405;
Tue, 15 Jan 2013 06:09:08 -0800 (PST)
Received: by 10.76.34.232 with HTTP; Tue, 15 Jan 2013 06:09:08 -0800 (PST)
In-Reply-To: <CAGO5Yg=n93GvOw=HtTuAaOK8m1itJEyEYPbkMyURiwoLZosv6g-JsoAwUIsXosN+***@public.gmane.org>
X-Original-Sender: djmuser-***@public.gmane.org
X-Original-Authentication-Results: gmr-mx.google.com; spf=pass
(google.com: domain of djmuser-***@public.gmane.org designates 209.85.214.178 as
permitted sender) smtp.mail=djmuser-***@public.gmane.org; dkim=pass header.i=@gmail.com
Precedence: list
Mailing-list: list ggplot2-/***@public.gmane.org; contact ggplot2+owners-/***@public.gmane.org
List-ID: <ggplot2.googlegroups.com>
X-Google-Group-Id: 604545605438
List-Post: <http://groups.google.com/group/ggplot2/post?hl=en>, <mailto:ggplot2-/***@public.gmane.org>
List-Help: <http://groups.google.com/support/?hl=en>, <mailto:ggplot2+help-/***@public.gmane.org>
List-Archive: <http://groups.google.com/group/ggplot2?hl=en>
Sender: ggplot2-/***@public.gmane.org
List-Subscribe: <http://groups.google.com/group/ggplot2/subscribe?hl=en>, <mailto:ggplot2+subscribe-/***@public.gmane.org>
List-Unsubscribe: <http://groups.google.com/group/ggplot2/subscribe?hl=en>, <mailto:googlegroups-manage+604545605438+unsubscribe-/***@public.gmane.org>
Archived-At: <http://permalink.gmane.org/gmane.comp.lang.r.ggplot2/6867>

See http://docs.ggplot2.org/current/annotation_logticks.html

Dennis

Post by Anja Mirenska
Hi Brandon,
Thanks for the pointer! I'll have a closer look at the scales package, it
didn't come to my mind.
Of course I still would be happy if a "scales-expert" or simply a more
experienced R-user than me could offer additional help.
Best wishes
Anja

Post by Brandon Hurr
?trans_new

--
You received this message because you are subscribed to the ggplot2 mailing list.
https://github.com/hadley/devtools/wiki/Reproducibility
More options: http://groups.google.com/group/ggplot2

Anja Mirenska

12 years ago

Permalink

Hi Dennis,

This only deals with annotations, not with the scales themselves. However,
I want the scale to be partly linear and partly logarithmic, i.e., the
region around 0 should be linear (e.g. -100 to +100, this window should be
adjustable) and the remaining scale log10, so it's about transformation of
the scale rather than its annotation.

Best wishes

Anja

...

Brian Diggs

12 years ago

Permalink

I wrote a couple of blog posts about how to create new transformations:

http://blog.ggplot2.org/post/25938265813/defining-a-new-transformation-for-ggplot2-scales

http://blog.ggplot2.org/post/29433173749/defining-a-new-transformation-for-ggplot2-scales-part

The first is probably more closely related to what you want to do, and
is hopefully laid out with an easy enough to follow example.

Post by Anja Mirenska
Best wishes
Anja

Post by Brandon Hurr
?trans_new

--
Brian S. Diggs, PhD
Senior Research Associate, Department of Surgery
Oregon Health & Science University

Anja Mirenska

12 years ago

Permalink

Hi Brian,

Thanks, the first blog post is very clear and helpful! I've tried to create
a biexponential transformation building upon your example. A conditional
transformation of the values did work, but the scale itself remained the
same, so that e.g. 1000 became 3 (log-transformed) and was plotted at the
break "3". The problem is that I want the original data values to remain
the same (so 1000 should still be 1000 rather than 3) while the scale
format should change from linear to logarithmic halfway through. So I guess
it's more a matter of the breaks and format functions rather than
transformation, right? However, I still don't have a clue how to define
this kind of breaks. Maybe someone could give me another hint?

Best wishes

Anja

...

Brian Diggs

12 years ago

Permalink

...

Can you show us what you have so far, and the mathematical definition of
the transformation? My guess the problem is in the breaks and labels
part, but it is hard to say without (minimal) a reproducible example.

...

--
Brian S. Diggs, PhD
Senior Research Associate, Department of Surgery
Oregon Health & Science University

Anja Mirenska

12 years ago

Permalink

Hi Brian,

Let's take this simple example:

dat <- data.frame(x=seq(-100, 1000, by=10), y=seq(-100, 1000, by=10))
ggplot(dat, aes(x,y)) + geom_point()

Here I've got two equally spaced linear scales. Now I try to define a
transformation function (actually, I don't want to transform the data, but
rather to change the scale):

biexp_trans <- function(lim = 100){
trans <- function(x){
vec <- vector(mode = "numeric", length = length(x))
for (i in seq_along(x)){
if (x[i] <= lim){vec[i] <- x[i]} else {vec[i] <- log(x[i], 10)}
}
return(vec)
}
inv <- function(x) {
vec <- vector(mode = "numeric", length = length(x))
for (i in seq_along(x)){
if (x[i] <= lim){vec[i] <- x[i]} else {vec[i] <- 10 ^ x[i]}
}
return(vec)
}
trans_new("biexp-", trans, inv)
}

ggplot(dat, aes(x,y)) + geom_point() + scale_y_continuous(trans="biexp")

The data transformation obviously works, but I don't want to plot the
logarithm of the data, I still want a 1000 to be a 1000. I only want to
change the way the data are displayed: low values (including negative
values) should be displayed on a linear scale, while values above a
particular limit should be plotted on a logarithmic scale. Do you have any
idea how I could achieve this?

Best wishes

Anja

Post by Brian Diggs

Post by Anja Mirenska
Hi Brian,
Thanks, the first blog post is very clear and helpful! I've tried to create
a biexponential transformation building upon your example. A conditional
transformation of the values did work, but the scale itself remained the
same, so that e.g. 1000 became 3 (log-transformed) and was plotted at the
break "3". The problem is that I want the original data values to remain
the same (so 1000 should still be 1000 rather than 3) while the scale
format should change from linear to logarithmic halfway through. So I guess
it's more a matter of the breaks and format functions rather than
transformation, right? However, I still don't have a clue how to define
this kind of breaks. Maybe someone could give me another hint?

Can you show us what you have so far, and the mathematical definition of
the transformation? My guess the problem is in the breaks and labels part,
but it is hard to say without (minimal) a reproducible example.
Best wishes

Post by Anja Mirenska
Anja

Post by Anja Mirenska
Hi Brandon,

Post by Anja Mirenska
Thanks for the pointer! I'll have a closer look at the scales package, it
didn't come to my mind.
Of course I still would be happy if a "scales-expert" or simply a more
experienced R-user than me could offer additional help.

http://blog.ggplot2.org/post/****25938265813/defining-a-new-**<http://blog.ggplot2.org/post/**25938265813/defining-a-new-**>
transformation-for-ggplot2-****scales<http://blog.ggplot2.**
org/post/25938265813/defining-**a-new-transformation-for-**
ggplot2-scales<http://blog.ggplot2.org/post/25938265813/defining-a-new-transformation-for-ggplot2-scales>
http://blog.ggplot2.org/post/****29433173749/defining-a-new-**<http://blog.ggplot2.org/post/**29433173749/defining-a-new-**>
transformation-for-ggplot2-****scales-part<http://blog.**
ggplot2.org/post/29433173749/**defining-a-new-transformation-**
for-ggplot2-scales-part<http://blog.ggplot2.org/post/29433173749/defining-a-new-transformation-for-ggplot2-scales-part>
The first is probably more closely related to what you want to do, and is
hopefully laid out with an easy enough to follow example.
Best wishes

Post by Anja Mirenska
Anja
****
gmane.org <brandon.hurr-**Re5JQEeQqe8AvxtiuMwx3w-**
?trans_new

--
Brian S. Diggs, PhD
Senior Research Associate, Department of Surgery
Oregon Health & Science University
--
You received this message because you are subscribed to the ggplot2 mailing list.
Please provide a reproducible example: https://github.com/hadley/**
devtools/wiki/Reproducibility<**https://github.com/hadley/**
devtools/wiki/Reproducibility<https://github.com/hadley/devtools/wiki/Reproducibility>
More options: http://groups.google.com/****group/ggplot2<http://groups.google.com/**group/ggplot2>
<http://groups.**google.com/group/ggplot2<http://groups.google.com/group/ggplot2>

--
Brian S. Diggs, PhD
Senior Research Associate, Department of Surgery
Oregon Health & Science University
--
You received this message because you are subscribed to the ggplot2 mailing list.
Please provide a reproducible example: https://github.com/hadley/**
devtools/wiki/Reproducibility<https://github.com/hadley/devtools/wiki/Reproducibility>
More options: http://groups.google.com/**group/ggplot2<http://groups.google.com/group/ggplot2>

...

Brian Diggs

12 years ago

Permalink

Post by Anja Mirenska
Hi Brian,
dat <- data.frame(x=seq(-100, 1000, by=10), y=seq(-100, 1000, by=10))
ggplot(dat, aes(x,y)) + geom_point()
Here I've got two equally spaced linear scales. Now I try to define a
transformation function (actually, I don't want to transform the data, but
biexp_trans <- function(lim = 100){
trans <- function(x){
vec <- vector(mode = "numeric", length = length(x))
for (i in seq_along(x)){
if (x[i] <= lim){vec[i] <- x[i]} else {vec[i] <- log(x[i], 10)}
}
return(vec)
}
inv <- function(x) {
vec <- vector(mode = "numeric", length = length(x))
for (i in seq_along(x)){
if (x[i] <= lim){vec[i] <- x[i]} else {vec[i] <- 10 ^ x[i]}
}
return(vec)
}
trans_new("biexp-", trans, inv)
}
ggplot(dat, aes(x,y)) + geom_point() + scale_y_continuous(trans="biexp")
The data transformation obviously works, but I don't want to plot the
logarithm of the data, I still want a 1000 to be a 1000. I only want to
change the way the data are displayed: low values (including negative
values) should be displayed on a linear scale, while values above a
particular limit should be plotted on a logarithmic scale. Do you have any
idea how I could achieve this?
Best wishes
Anja

...

Thanks for the example; it gives me something to work from.

Your transformation are not quite right yet. In particular, you map x to
x when less than the limit, but to log(x) when greater than the limit.

Post by Anja Mirenska
biexp_trans()$trans(dat$x)

[1] -100.000000 -90.000000 -80.000000 -70.000000 -60.000000
[6] -50.000000 -40.000000 -30.000000 -20.000000 -10.000000
[11] 0.000000 10.000000 20.000000 30.000000 40.000000
[16] 50.000000 60.000000 70.000000 80.000000 90.000000
[21] 100.000000 2.041393 2.079181 2.113943 2.146128
[26] 2.176091 2.204120 2.230449 2.255273 2.278754
[31] 2.301030 2.322219 2.342423 2.361728 2.380211
[36] 2.397940 2.414973 2.431364 2.447158 2.462398
[41] 2.477121 2.491362 2.505150 2.518514 2.531479
[46] 2.544068 2.556303 2.568202 2.579784 2.591065
[51] 2.602060 2.612784 2.623249 2.633468 2.643453
[56] 2.653213 2.662758 2.672098 2.681241 2.690196
[61] 2.698970 2.707570 2.716003 2.724276 2.732394
[66] 2.740363 2.748188 2.755875 2.763428 2.770852
[71] 2.778151 2.785330 2.792392 2.799341 2.806180
[76] 2.812913 2.819544 2.826075 2.832509 2.838849
[81] 2.845098 2.851258 2.857332 2.863323 2.869232
[86] 2.875061 2.880814 2.886491 2.892095 2.897627
[91] 2.903090 2.908485 2.913814 2.919078 2.924279
[96] 2.929419 2.934498 2.939519 2.944483 2.949390
[101] 2.954243 2.959041 2.963788 2.968483 2.973128
[106] 2.977724 2.982271 2.986772 2.991226 2.995635
[111] 3.000000

If the transformed value is 2, was the original value 2 or 100? So what
you want is a scale that increases logarithmically above the limit, but
has unique values. This also points out another issue: the relative size
of the two parts of the scales. Numerically, right now, each decade on
the logarithmic scale is the same size as a single unit on the linear
scale. Looking at the example graphs you gave, this isn't the case. Each
decade is around the same size as the original limit (or bigger) [that
is, the space from 0 to 100 in your examples is about the same as the
space between 100 and 1000, 1000 and 10000, 10000 and 100000, etc.]
Adding in this scaling, putting in an offset, and making sure that the
transition around the limit is continuous gives the following
transformation and inverse functions:

trans <- function(x){
ifelse(x <= lim, x, lim + decade.size *
(suppressWarnings(log(x, 10)) - log(lim, 10)))
}
inv <- function(x) {
ifelse(x <= lim, x, 10^(((x-lim)/decade.size) + log(lim,10)))
}

Note that I've also vectorized the functions rather than have an
explicit loop.

This trans on your data (with lim=100 and decade.size=100) gives

Post by Anja Mirenska
trans(dat$x)

[1] -100.0000 -90.0000 -80.0000 -70.0000 -60.0000 -50.0000
[7] -40.0000 -30.0000 -20.0000 -10.0000 0.0000 10.0000
[13] 20.0000 30.0000 40.0000 50.0000 60.0000 70.0000
[19] 80.0000 90.0000 100.0000 104.1393 107.9181 111.3943
[25] 114.6128 117.6091 120.4120 123.0449 125.5273 127.8754
[31] 130.1030 132.2219 134.2423 136.1728 138.0211 139.7940
[37] 141.4973 143.1364 144.7158 146.2398 147.7121 149.1362
[43] 150.5150 151.8514 153.1479 154.4068 155.6303 156.8202
[49] 157.9784 159.1065 160.2060 161.2784 162.3249 163.3468
[55] 164.3453 165.3213 166.2758 167.2098 168.1241 169.0196
[61] 169.8970 170.7570 171.6003 172.4276 173.2394 174.0363
[67] 174.8188 175.5875 176.3428 177.0852 177.8151 178.5330
[73] 179.2392 179.9341 180.6180 181.2913 181.9544 182.6075
[79] 183.2509 183.8849 184.5098 185.1258 185.7332 186.3323
[85] 186.9232 187.5061 188.0814 188.6491 189.2095 189.7627
[91] 190.3090 190.8485 191.3814 191.9078 192.4279 192.9419
[97] 193.4498 193.9519 194.4483 194.9390 195.4243 195.9041
[103] 196.3788 196.8483 197.3128 197.7724 198.2271 198.6772
[109] 199.1226 199.5635 200.0000

Post by Anja Mirenska
inv(trans(dat$x))

[1] -100 -90 -80 -70 -60 -50 -40 -30 -20 -10 0 10 20
[14] 30 40 50 60 70 80 90 100 110 120 130 140 150
[27] 160 170 180 190 200 210 220 230 240 250 260 270 280
[40] 290 300 310 320 330 340 350 360 370 380 390 400 410
[53] 420 430 440 450 460 470 480 490 500 510 520 530 540
[66] 550 560 570 580 590 600 610 620 630 640 650 660 670
[79] 680 690 700 710 720 730 740 750 760 770 780 790 800
[92] 810 820 830 840 850 860 870 880 890 900 910 920 930
[105] 940 950 960 970 980 990 1000

inv is really the inverse of trans.

Post by Anja Mirenska
trans(c(99.99, 100, 100.01))

[1] 99.9900 100.0000 100.0043

Post by Anja Mirenska
inv(trans(c(99.99, 100, 100.01)))

[1] 99.99 100.00 100.01

For breaks, I just created a function which called pretty_breaks and/or
log_breaks as appropriate given the range of the data. Putting this all
together (I named it biexp2_trans so that I could have both versions at
once; you would drop the "2" part):

biexp2_trans <- function(lim = 100, decade.size = lim){
trans <- function(x){
ifelse(x <= lim,
x,
lim + decade.size * (suppressWarnings(log(x, 10)) -
log(lim, 10)))
}
inv <- function(x) {
ifelse(x <= lim,
x,
10^(((x-lim)/decade.size) + log(lim,10)))
}
breaks <- function(x) {
if (all(x <= lim)) {
pretty_breaks()(x)
} else if (all(x > lim)) {
log_breaks(10)(x)
} else {
unique(c(pretty_breaks()(c(x[1],lim)),
log_breaks(10)(c(lim, x[2]))))
}
}
trans_new(paste0("biexp-",format(lim)), trans, inv, breaks)
}

And here are examples of use, including some with a larger range of data
and showing the effect of decade.size.

ggplot(dat, aes(x,y)) + geom_point() + scale_y_continuous(trans="biexp2")

ggplot(dat, aes(x,y)) + geom_point() +
scale_y_continuous(trans=biexp2_trans(lim=100, decade.size=200))

ggplot(dat, aes(x,y)) + geom_point() +
scale_y_continuous(trans=biexp2_trans(lim=100, decade.size=200)) +
scale_x_continuous(trans=biexp2_trans(lim=100, decade.size=200))

dat2 <- data.frame(x=c(seq(-100, 1000, by=10), seq(1000, 100000, by=1000)),
y=c(seq(-100, 1000, by=10), seq(1000, 100000, by=1000)))

ggplot(dat2, aes(x,y)) + geom_point() + scale_y_continuous(trans="biexp2")

ggplot(dat2, aes(x,y)) + geom_point() +
scale_y_continuous(trans=biexp2_trans(lim=100, decade.size=200))

Post by Anja Mirenska

Post by Brian Diggs

Can you show us what you have so far, and the mathematical definition of
the transformation? My guess the problem is in the breaks and labels part,
but it is hard to say without (minimal) a reproducible example.
Best wishes

Post by Anja Mirenska
Anja

Post by Anja Mirenska
Hi Brandon,

Post by Anja Mirenska
Anja
****
gmane.org <brandon.hurr-**Re5JQEeQqe8AvxtiuMwx3w-**
?trans_new

--
Brian S. Diggs, PhD
Senior Research Associate, Department of Surgery
Oregon Health & Science University

...

--
Brian S. Diggs, PhD
Senior Research Associate, Department of Surgery
Oregon Health & Science University

Anja Mirenska

12 years ago

Permalink

Brian,

Thank you very much for the code and also for the lucid explanation!

...