兼职教授

当前位置:首页 >> 师资队伍 >> 兼职教授
  

王龙 兼职教授

  王龙,IBM T. J. Watson研究院高级研究员、北卡罗莱纳州立大学兼职教授和北京石油化工学院人工智能研究院兼职教授。

  负责IBM健康云(IBMWatson Health)的安全可靠性部门(下辖多个系统可靠性服务和系统安全性服务的组)。同时也是北卡罗莱纳州立大学兼职教授,还指导了包括爱荷华大学、中佛罗里达大学和北京大学的多名博士生的论文。在国际一流期刊和学术会议上发表了数十篇论文,申请了几十项专利,是国际电气电子工程师协会(IEEE)高级会员。担任多个国际顶级会议的组织委员会委员和程序委员会委员,是包括TDSC(IEEE Transactions on Dependable and Secure Computing)在内的多个国际顶级期刊和会议的审稿人,是国际上知名的复杂系统可靠性和 checkpoint/recovery 领域的专家学者。

  在IBM T. J. Watson研究院从事了近十年的研究工作,主持了多项IBM云系统相关的科研课题,包括IBM健康云的系统和医疗数据的安全性与可靠性(9.8亿元人民币的项目),IBM企业专有云的灾难重建(8800 万元),和IBM公有云的性能检测与矫正(3700万元)。这些科研项目的成果已经应用在了IBM的多个云计算平台上,包括SmartCloud Enterprise(一个IBM公有云),CloudManaged Services(面向企业的 managed services的专有云),和Watson Health Cloud(IBM健康云)。其中“企业专有云的灾难重建”课题获得了IBM的杰出成就奖。目前本人负责整个IBM健康云的安全和可靠部门。除了产品应用外,这些科研课题的成果还体现在数十篇国际顶级会议和期刊的论文以及数十项专利中。

 

教育经历

  •  2000年6月 本科 北京大学  计算机科学
  •  2002年5月 硕士 美国伊利诺伊州香槟市伊利诺伊大学(UIUC)
  •  2010年12月 博士 美国伊利诺伊州香槟市伊利诺伊大学 电气与计算机工程


Awards and Honors
 

  • IBM Outstanding Accomplishments on Inventions, 12/2018, 02/2017, 09/2015, 05/2014, 10/2012
  • IBM Award for Outstanding Accomplishments on Cloud Managed Service, 03/2015
  • Best Paper Nomination, IEEE International Symposium on Software Reliability Engineering (ISSRE) Industry Track, 2018
  • IEEE Senior Member, 2016
  • Boeing Scholarship Award to outstanding graduate student, 2007-2008
  • Best Paper, IEEE Pacific Rim International Symposium on Dependable Computing (PRDC) 2006 (recommended to Journal publication by the PRDC committee)
  • Student Travel Grant to attend DSN 2008, DSN 2005
  • Thesis with honors from Department of Computer Science, Beijing University, 200
     

Invited Talks

  • "Availability Architecture for Achieving Data Consistency in Composite Data Pipelines", Peer-reviewed and Selected talk at IBM Conference on Performance | Availability | Security (PERVAIL), Munich, Germany, October 2019.
     
  • "Designing Resiliency for Big Data Software-as-a-Service Systems,” Invited talk at the "Industry-Day" workshop in The University of British Columbia, Vancouver, BC, Canada, February 2019.
     
  • "Log-based Abnormal Task Detection and Root Cause Analysis for Spark,” Invited talk at a research seminar in University of Illinois at Urbana Champaign, Urbana, IL, USA, September 2017.
     
  • "Failure Diagnosis for Distributed Systems using Targeted Fault Injection,” Invited talk at Beijing University, Beijing, China, October 2016.
     
  • "A Common Approach for Providing Application-Aware Reliability through Operating System,” Invited talk at IBM China Research Laboratory (CRL), Beijing, China, August 2008

Refereed Journals and Book Chapters
* indicates I am the (co-)corresponding author or (co-)first author of the paper

1."Transparently Capturing Execution Path of Service Request Processing for Anomaly Detection," Yong Yang, Long Wang*, Jing Gu, Ying Li, submitted to IEEE Transactions on Parallel and Distributed Systems (TPDS).

2."LADRA: Log-based abnormal task detection and root-cause analysis in big data processing with Spark," Siyang Lu, Xiang Wei, Bingbing Rao, Byungchul Tak, Long Wang, Liqiang Wang, Future Generation Computer Systems Journal (FGCS), Volume 95, June 2019.

3."Failure Diagnosis for Distributed Systems using Targeted Fault Injection," Cuong Pham, Long Wang*, Byung Chul Tak, Salman Baset, Chunqiang Tang, Zbigniew Kalbarczyk, Ravishankar K. Iyer, IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume: 28, Issue: 2, Feb. 2017.

4."Public Cloud Service Agreements: What to Expect and What to Negotiate" (book), Claude Baudoin, Long Wang, Jordan Flynn, John Meegan, et al., Cloud Standards Customer CouncilAug. 2016.

5."A Methodology for Continuous Evaluation of Cloud Resiliency,"Xiaoyong Yuan, Long Wang*, Tiancheng Liu, Yue Zhang, American Journal of Engineering and Applied Sciences (AJEAS), 2016.

6."VM-μCheckpoint: Design, Modeling, and Assessment of Lightweight In-Memory VM Checkpointing," Long Wang*, Zbigniew Kalbarczyk, Ravishankar K. Iyer, Arun Iyengar, IEEE Transactions on Dependable and Secure Computing (TDSC), vol.12, no. 2, 2015.

7."Toward Achieving Operational Excellence in a Cloud," Salman A. Baset, Long Wang, Byung Chul Tak, Chuong Pham, Chunqiang Tang, IBM Journal of Research and Development, Volume 58, Issue 2, 2014.

8."Reliability MicroKernel: Providing Application-Aware Reliability in OS," Long Wang*, Zbigniew Kalbarczyk, Weining Gu, Ravishankar K. Iyer, IEEE Transactions on Reliability (TR), Vol. 56, No. 4, Dec. 2007 (invited paper).

9."Application Fault Tolerance Employing ARMOR Middleware," Zbigniew Kalbarczyk, Ravishankar K. Iyer, Long Wang, IEEE Internet Computing, Vol 9, Issue 2, 2005.

Proceedings of Refereed Conferences

1."Scheduling Physical Machine Maintenance on Qualified Clouds: What if Migration is not Allowed?”, Long Wang*, Harigovind V Ramasamy, Richard Harper, submitted to The IEEE Int’l Conference on Cloud Computing (CLOUD), 2020.

2."System Restore in a Multi-Cloud Data Pipeline Platform," Long Wang*, Harigovind V Ramasamy, Valentina Salapura, et al., The Int’l Conference on Dependable Systems and Networks (DSN), Industry Track, 2019.

3."Transparently Capturing Execution Path of Service/Job Request Processing," Yong Yang, Long Wang*, Jing Gu, Ying Li. International Conference on Service-Oriented Computing (ICSOC) 2018. Lecture Notes in Computer Science, vol 11236.

4."KEREP: Experience in Extracting Knowledge on Distributed System Behavior through Request Execution Path," Jing Gu, Long Wang*, Yong Yang and Ying Li, Best Paper Nominee, IEEE International Symposium on Software Reliability Engineering (ISSRE), Industry Track, 2018.

5."DevOps Practices for Building Secure and Resilient Cloud-Native Web Applications," Long Wang*, Harigovind V Ramasamy, Richard Harper, Ruchi Mahindru, The Int’l Conference on Dependable Systems and Networks (DSN), Tutorial, 2018.

6."Planning, Building, and Managing Resiliency on the Cloud," Harigovind V Ramasamy, Long Wang and Richard Harper, ACM Symposium on Operating Systems Principles (SOSP) Tutorial, 2017.

7."Log-based Abnormal Task Detection and Root Cause Analysis for Spark," Siyang Lu, Bingbing Rao, Xiang Wei, Byungchul Tak, Long Wang, Liqiang Wang, IEEE International Conference on Web Services (ICWS), 2017.

8."Providing Resiliency to Orchestration and Automation Engines in Hybrid Cloud,"Long Wang*, Harigovind V Ramasamy, Alexei Karve, Richard Harper, The Int’l Conference on Dependable Systems and Networks (DSN), Industry Track, 2017.

9."Predicting Misconfiguration-induced Unsuccessful Executions of Jobs in Big Data System," Hongyan Tang, Ying Li, Long Wang, Jing Gu, Zhonghai Wu, IEEE Computer Society Signature Conference on Computers, Software and Applications (COMPSAC), 2017.

10."Disaster Recovery for Cloud-Hosted Enterprise Applications," Long Wang*, Richard Harper, Ruchi Mahindru, Harigovind V Ramasamy, The IEEE Int’l Conference on Cloud Computing (CLOUD), San Francisco, USA, 2016.

11."Auto-tuning Performance of MPI Parallel Programs Using Resource Management in Container-Based Virtual Cloud," Hongyi Ma, Liqiang Wang, Byung Chul Tak, Long Wang, Chunqiang Tang, The IEEE Int’l Conference on Cloud Computing (CLOUD), San Francisco, USA, 2016.

12."Activating Protection and Exercising Recovery Against Large-Scale Outages on the Cloud," Long Wang*, Harigovind V Ramasamy, Richard Harper, Ruchi Mahindru, The Int’l Conference on Dependable Systems and Networks (DSN), Tutorial, Toulouse, France, 2016.

13."Designing Survivability for Big Data Software-as-a-Service Systems," Hari Ramasamy, Long Wang, Richard Harper, IEEE International Symposium on Software Reliability Engineering (ISSRE), Tutorial, 2016.

14."Building and Managing Business Resiliency on the Cloud," Long Wang*, Richard Harper, Harigovind V Ramasamy, Mahesh Viswanathan, ACM Middleware conference (MIDDLEWARE), Tutorial, Vancouver, Canada, 2015.

15."Experiences with Building Disaster Recovery for Enterprise-Class Clouds," Long Wang*, Harigovind V Ramasamy, Richard Harper, Mahesh Viswanathan, E. Plattier, The Int’l Conference on Dependable Systems and Networks (DSN), Rio de Janeiro, Brazil, 2015.

16."Disaster Recovery for Enterprise-Class Clouds," Long Wang*, Richard Harper, Harigovind V Ramasamy, MaheshViswanathan, The Int’l Conference on Dependable Systems and Networks (DSN), Tutorial, Rio de Janeiro, Brazil, 2015.

17."Approximate Fault Localization using Message Flow Reconstruction and Targeted Fault Injection," Cuong Pham, Long Wang*, Byung Chul Tak, Salman Baset, Chunqiang Tang, Zbigniew Kalbarczyk, Ravishankar K. Iyer, USENIX Annual Technical Conference (USENIX), Poster Session, 2014.

18."CAP3: A Cloud Auto-Provisioning Framework for Parallel Processing Using On-demand and Spot Instances," He Huang, Liqiang Wang, Byung Chul Tak, Long Wang, Chunqiang Tang, The IEEE Int’l Conference on Cloud Computing (CLOUD), Santa Clara, CA, USA, 2013.

19."Dissecting Open Source Cloud Evolution: An OpenStack Case Study," Salman A. Baset, Chunqiang Tang, Byung Chul Tak, Long Wang, 5th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud), San Jose, CA, USA, 2013.

20."PseudoApp: Performance Prediction for Application Migration to Cloud," Byung Chul Tak, Chunqiang Tang, Hai Huang, Long Wang, IEEE International Symposium on Integrated Network Management (IM), Ghent, Belgium, 2013.

21."Universal Script Wrapper – An Innovative Solution for Endpoint Management in Large and Heterogeneous Environments," Sai Zeng, Shang Guo, Fred Wu, Constantin Adam, Long Wang, Cashchakanithara Venugopal, Rajeev Puri, Ramesh Palakodeti, IEEE International Symposium on Integrated Network Management (IM), Ghent, Belgium, 2013.

22."Remediating Overload in Over-subscribed Computing Environments," Long Wang*, Rafah A. Hosn, Chunqiang Tang, The IEEE Int’l Conference on Cloud Computing (CLOUD), Honululu, Hawaii, USA, 2012.

23."Towards an understanding of oversubscription in cloud," Salman A. Baset, Long Wang*, Chunqiang Tang, 2nd USENIX Workshop on Hot Topics in Management of Internet, Cloud, and Enterprise Networks and Services (HotICE), San Jose, CA, USA, 2012.

24."Checkpointing Virtual Machines Against Transient Errors," Long Wang*, Zbigniew Kalbarczyk, Ravishankar K. Iyer, Arun Iyengar, Proc. Of Int’l On-Line Testing Symposium (IOLTS), Corfu Island, Greece, 2010.

25."Formalizing Operating System Behavior for Evaluating System Hang Detector,"Long Wang*, Zbigniew Kalbarczyk, Ravishankar K. Iyer, Proc. of Int'l Symp. on Reliable Distributed Systems (SRDS), Napoli, Italy, 2008.

26."Count&Check: Counting Instructions to Detect Incorrect Paths," Long Wang*, Ravishankar K. Iyer, the CATARS Workshop in The Int’l Conference on Dependable Systems and Networks (DSN), Anchorage, Alaska, USA, 2008.

27."A Model-based Simulation Approach to Error Analysis of IT Services,"Long Wang*, Akhil Sahai, James Pruyne, IFIP/IEEE International Symposium on Integrated Network Management (IM), Munich, Germany, 2007.

28."An OS-level Framework for Providing Application-Aware Reliability," Long Wang*, Zbigniew Kalbarczyk, Weining Gu, Ravishankar K. Iyer, Best Paper, IEEE Pacific Rim International Symposium on Dependable Computing (PRDC), Riverside, CA, USA, 2006.

29."A Self-checking and Reconfigurable Framework for Application Reliability Exploiting Execution Characteristics," Long Wang*, Zbigniew Kalbarczyk, Weining Gu, Ravishankar K. Iyer, The Int’l Conference on Dependable Systems and Networks (DSN), fast abstract, 2006, Philadelphia, PA, USA.

30."Modeling Coordinated Checkpointing for Large-Scale Supercomputers," Long Wang*, Karthik Pattabiraman, Zbigniew Kalbarczyk, Ravishankar K. Iyer, Lawrence Votta, Christopher Vick, Alan  Wood,The Int’l Conference on Dependable Systems and Networks (DSN), Yokohama, Japan, 2005.

31."Checkpointing of Control Structures in Main Memory Database Systems," Long Wang*, Zbigniew Kalbarczyk, Ravishankar K. Iyer, H. Vora, T. Chahande, The International Conference on Dependable Systems and Networks (DSN), Florence, Italy, 2004.

32."Group Communication Protocols under Errors," Claudio Basile, Long Wang, Zbigniew Kalbarczyk, Ravishankar K. Iyer, Proc. of Int'l Symp. on Reliable Distributed Systems (SRDS), Florence, Italy, 2003.