Accelerating NFS/CIFS to produce a tenfold performance improvement for Office/Database applications over the WAN/Grid

Lead Research Organisation: University of Kent
Department Name: Sch of Computing

Abstract

This proposal focuses on studying how applications can best take advantage of the increased transfer bandwidth. A model for end-to-end available bandwidth estimation will be established to close the growing gap between the utilized bandwidth and available bandwidth. In our lab, a core engine jetFS has been developed, which melds distributed file system technology with high performance data transfer techniques to meet the needs of WAN/Grid-based virtual organizations. In a real-world test, jetFS beats classic protocols by 2-25 times. The source code of the developed jetFS is of 40,000 lines in length and we have spent four plus man years in developing and revamping it within the context of an EPSRC/DTI project Grid-oriented Storage . This work has already been accepted for publication in IEEE Transactions on Computers. In this proposal, we will implement NFS/CIFS's alternative protocols (a-NFS/CIFS) by incorporating the developed multi-streamed jetFS engine to accelerate the file sharing traffic. With the accelerated and secure a-NFS/CIFS that remain fully compatible with the existing IT infrastructures, computer users can open and share remote files (such as vi, Emacs, MySQL, OpenOffice Writer/Impress/Calc, and MS Office Word/PowerPoint/Excel) on the Internet without having to install new software, upgrade to any new platform or change the way they work. The a-NFS/CIFS are believed to extend established Internet architectures and protocols to meet the immediate needs and are positioned to adapt to the future needs of Grid computing through the minor versioning provision of the existing data communication protocols. The resulting a-NFS/CIFS will be made available and supported as open-source under the GPL, strengthening the competitiveness of the UK-led open-source products in the global market.

Publications

10 25 50
publication icon
Chen H (2013) A Cost-Efficient and Reliable Resource Allocation Model Based on Cellular Automaton Entropy for Cloud Project Scheduling in International Journal of Advanced Computer Science and Applications

publication icon
Deng Y (2011) LAG: Achieving transparent access to legacy data by leveraging grid environment in Future Generation Computer Systems

publication icon
Wang F (2012) Delayed switching applied to memristor neural networks in Journal of Applied Physics

publication icon
Wang F (2013) A Triangular Periodic Table of Elementary Circuit Elements in IEEE Transactions on Circuits and Systems I: Regular Papers

publication icon
Wang FZ (2013) Adaptive Neuromorphic Architecture (ANA). in Neural networks : the official journal of the International Neural Network Society

publication icon
Zhang L (2015) Skewly replicating hot data to construct a power-efficient storage cluster in Journal of Network and Computer Applications

 
Description CloudJet has been developed to deal with long-distance, bulk data, cross-domain data operations - a capability lacking in the current Cloud solutions. The source code is of 20,000 lines in length.
Exploitation Route Our findings have been reported widely, eg at IEEE Big Data Conference in USA. Others might use our findings via open source.
Sectors Education

URL http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=6906835&tag=1
 
Description The GOS-FS has been used as an underlying engine in the EC-funded EuroAsiaGrid, the EC-funded QuickLinux and other testbeds.
First Year Of Impact 2013
Sector Education
Impact Types Cultural,Societal,Policy & public services

 
Description BT 
Organisation BT Group
Department BT Research
Country United Kingdom 
Sector Private 
PI Contribution CloudJet4BigData as a Big Data platform in Cloud environments.
Collaborator Contribution Fellowship by BT
Impact Frank Zhigang Wang, Theo Dimitrakos (BT), Na Helian, Sining Wu, Yuhui Deng, Ling Li, Rodric Yates, DIANA: Data Interface All-iN-A-place (DIANA) for Big Data, IEEE Big Data Science & Engineering, Beijing, 24-26 September, 2014
Start Year 2012
 
Description CISCO 
Organisation CISCO Systems
Country United States 
Sector Private 
PI Contribution CloudJet to be incorporated into OpenStack
Collaborator Contribution Elastic Networking
Impact A joint Lab & 20 students placed each year at CISCO, California.
Start Year 2010
 
Description IBM 
Organisation IBM
Department IBM UK Ltd
Country United Kingdom 
Sector Private 
PI Contribution CloudJet4BigData as a big data platform in the Cloud.
Collaborator Contribution Design, testing, real-world scenario, chairship of KITC (Kent IT Consultancy) to engage the School of Computing with local communities and Kent students with real-world projects to enrich their industrial experience.
Impact 1. Frank Zhigang Wang, Theo Dimitrakos, Na Helian, Sining Wu, Yuhui Deng, Ling Li, Rodric Yates (IBM), DIANA: Data Interface All-iN-A-place (DIANA) for Big Data, IEEE Big Data Science & Engineering, Beijing, 24-26 September, 2014 2. Frank Zhigang Wang, Sining Wu, Na Helian, Ling Li, Rodric Yates (IBM), "CloudJet4BigData: Streamlining Big Data via an accelerated socket interface", IEEE 3rd International Congress on Big Data, June 27 - July 2, 2014, Alaska, USA
Start Year 2013