KeepNotes blog

Listing Column Names in SAS - SASlearner

Posted on 2022-04-10 In Programming-Notes , SAS
Symbols count in article: 2.7k Reading time ≈ 2 mins.

Sometimes we would want to list column names of a SAS dataset, which can be useful in other programs, such as proc freq, proc transpose.

Output Log to Text file in SAS - SASlearner

Posted on 2022-04-10 In Programming-Notes , SAS
Symbols count in article: 1.6k Reading time ≈ 1 mins.

Standing up a flag here, I'm planning to keep a series of notes to minute my questions met in sas learning. Practice is the best teacher, practice makes perfect!

Hexo-自定义about页面

Posted on 2022-04-08 In Learning-Notes , Essay
Symbols count in article: 590 Reading time ≈ 1 mins.

由于Hexo博客中的about页面默认是通过markdown渲染的，假如想用自定义的页面，比如HTML/CSS方式，那么该如何实现呢？

R package development - as a beginner

Posted on 2022-01-25 Edited on 2022-04-11 In Programming-Notes , R
Symbols count in article: 3.8k Reading time ≈ 3 mins.

对团队R包开发的协作经验，做个小结；并为个人开发R包做个基础知识准备

Date Format (SAS & R)

Posted on 2021-11-14 In Programming-Notes , SAS
Symbols count in article: 4.4k Reading time ≈ 4 mins.

Assume that you input a character string 20211109 with a sas format like yymmdd8. that specifies how sas must interpret the character string, you will get a numeric number 22593.

Graphical displays of data (SAS & R)

Posted on 2021-11-10 In Programming-Notes , SAS
Symbols count in article: 8.2k Reading time ≈ 7 mins.

This is reference to the "Chapter 8 A graphical compendium" in <SAS and R: Data Management, Statistical Analysis, and Graphics (second edition)>.

I believe that the capability of data science is more than just building predictive models, data visualization is also an integral part, especially in a convincing way.

Statistical functions and procedures (SAS & R)

Posted on 2021-11-07 In Programming-Notes , SAS
Symbols count in article: 5.2k Reading time ≈ 5 mins.

This is reference to the "Chapter 3 Statistical and mathematical functions", "Chapter 4 Programming and operating system interface" and "Chapter 5 Common statistical procedures" in <SAS and R: Data Management, Statistical Analysis, and Graphics (second edition)>.

Dataset manipulation (SAS & R)

Posted on 2021-10-31 In Programming-Notes , SAS
Symbols count in article: 5.1k Reading time ≈ 5 mins.

This is reference to the 2.3 section of Data management and 2.4 Date and time variables in <SAS and R: Data Management, Statistical Analysis, and Graphics (second edition)>.

在R中常说的数据集操作是指处理数据框类型的数据，当然有时也会是其他的数据类型。在SAS中就是数据集，SAS相比其他编程方法来说数据类型还是太少了。。

处理数据，常见的不外乎combination, collation, and subsetting

Derived variables and string manipulation (SAS & R)

Posted on 2021-10-28 In Programming-Notes , SAS
Symbols count in article: 4.7k Reading time ≈ 4 mins.

This is reference to the 2.1 to 2.2 section of Data management in <SAS and R: Data Management, Statistical Analysis, and Graphics (second edition)>.

Data input and output (SAS & R)

Posted on 2021-10-25 In Programming-Notes , SAS
Symbols count in article: 2k Reading time ≈ 2 mins.

我一直想找一个合适的方法来记录SAS的学习笔记，最好能结合以往的编程经验（如R or Python）；我想到了当初学习Python的时候是根据实际需要，结合R/Perl的既往经验来互补学习，那么SAS也是可以这样。

How to connect to Gitlab Repository

Posted on 2021-08-31 Edited on 2021-11-10 In Learning-Notes , Essay
Symbols count in article: 3.3k Reading time ≈ 3 mins.

What is Git?

Git is a version control system used to track changes in computer files. Git's primary purpose is to manage any changes made in one or more projects over a given period of time. It helps coordinate work among members of a project team and tracks progress over time. Git also helps both programming professionals and non-technical users by monitoring their project files.

What is Gitlab?

GitLab is a web-based Git repository that provides free open and private repositories, issue-following capabilities, and wikis. It is a complete DevOps platform that enables professionals to perform all the tasks in a project—from project planning and source code management to monitoring and security. Furthermore, it allows teams to collaborate and build better software.

RStudio Shiny Publish - 中文编码

Posted on 2021-08-29 In Shiny
Symbols count in article: 4.2k Reading time ≈ 4 mins.

以下是我在shiny publish中遇到的问题以及解决方法，以供大家参考

Reading notes for SAS Programming in the Pharmaceutical Industry book

Posted on 2021-08-29 Edited on 2021-11-10 In Programming-Notes , SAS
Symbols count in article: 7.3k Reading time ≈ 7 mins.

This post is the reading notes for the e-book “SAS Programming in the Pharmaceutical Industry” to keep recording some knowledge points about pharma programming.

Alluvial plot and Sankey diagram in R

Posted on 2021-07-15 Edited on 2021-07-16 In Programming-Notes , R
Symbols count in article: 5.7k Reading time ≈ 5 mins.

I have a strong interest in data visualization. For me, the purpose to learn this skill is driven by having a good understanding and examination for the question one wants to solve.

Programming in pharma using SAS

Posted on 2021-07-08 In Programming-Notes , SAS
Symbols count in article: 4.4k Reading time ≈ 4 mins.

It is not to be denied that sas is an essential skill for statistical programmers in the pharma field. Of course sas is a programming language which can be derived to different using requirements in different fields. So I think we should follow the actual requirements in the pharmaceutical industry to learn SAS if you want to be a qualified statistical programmer. Therefore the purpose of this post is to record some actual applications by sas so that I can understand and remember sas syntax clearly.

Embed R images into RTF in SAS

Posted on 2021-06-24 In Programming-Notes , SAS
Symbols count in article: 2.9k Reading time ≈ 3 mins.

We all know that IML/SAS make us use R code in SAS by submit /R statement. A few months ago, I consulted with SAS support for how to import plots by R in IML into RTF templates directly as I could not find any useful information in google. Unfortunately SAS support told me if the plot was created in R, it would need to be saved within the submit block as well using R code. It means if you want to directly import R graphics to RTF, maybe you should use some R function to achieve it.

Logistic Regression in SAS

Posted on 2021-06-20 In Programming-Notes , SAS
Symbols count in article: 6.4k Reading time ≈ 6 mins.

I have kept a note about logistic regression for biomarkers using R, and mentioned that I’d like to compare the code of R and SAS. Therefore how to use SAS to estimate a logistic regression model?

Logistic Regression for biomarker

Posted on 2021-06-19 In Biometrics , BiomedicalStats
Symbols count in article: 7.2k Reading time ≈ 7 mins.

Logistic Regression is one of the machine learning(ML) used for solving classification problems. It is used to predict the result of a categorical dependent variable based on one or more continuous or categorical independent variables. I have summarized Its basic principle in one blog (https://www.bioinfo-scrounger.com/archives/750/) referring to the book of "Statistical Learning Method".

Owning a hex sticker

Posted on 2021-06-09 In Essay
Symbols count in article: 2k Reading time ≈ 2 mins.

It seems that making hex stickers have become popular for R packages with the range of packages associated with RStudio. Therefore If you would like to own it, please try these approaches as shown below.

Maximally Selected Rank Statistics

Posted on 2021-06-07 In Statistics
Symbols count in article: 3.4k Reading time ≈ 3 mins.

When we mention how to find the cutoff, the first response in our brain may be the ROC curve. Absolutely, ROC curve is a very common approach in the biomarker field to look for a fit cutoff to a reagent. However in the ROC curve, the dependent variable must be two categorical variables, which is not universal to different types of data, such as quantitative variables, survival(censored) variables.