Recently, the research and development group of the author needs to support high-performance HTTP upload of large files and HTTP breakpoint continuation. Here is a brief summary, convenient memory:
The server is realized by C language instead of Java and PHP. The server writes to the hard disk immediately, so there is no need to call move_UPLOaded_file, InputStreamReader and other caching technologies to avoid server memory usage and browser request timeouts. HTML5 and IFRAME support (for older browsers), and access to file upload progress. In order to better adapt to the current mobile Internet, the upload service is required to support breakpoint continuation and re-connection. Because the mobile Internet is not very stable; In addition, the possibility of an abnormal drop when uploading a large file is very high. In order to avoid re-uploading, it is necessary to support resumable breakpoints.
To support breakpoint continuation, run the following command:
The client (usually a browser) uploads a file to the server and records the upload progress continuously. If the connection goes offline or other exceptions occur, the client can query the uploaded status of a file to the server and upload the file from the position where the file was uploaded last time.
There are also masters on the Internet to upload large files in the fragmented file upload mode. The method is to cut the files into small pieces, such as a fragment of 4MB. The server receives a small piece of the file each time and saves it as a temporary file, and then merges it after all the fragments are transferred. In my opinion, this works if the original file is small enough, but once the file is hundreds of megabytes or a few gigabytes or tens of gigabytes, merging the file takes a very long time, often resulting in browser response times out or server blocking.
If you implement a separate client (or a browser’s ActiveX plug-in) to upload files yourself, supporting resumable breakpoints is a simple matter of logging file upload status on the client side. Supporting browser breakpoint uploads (without the need to install third-party plug-ins) is generally more difficult than doing your own standalone client uploads, but not too difficult. My implementation ideas are as follows:
1. When the browser uploads a file, it generates a HASH value for the file, which must be generated on the browser.
File upload records cannot be queried only based on file names. File names have great repeatability, and the value of file names + file sizes decreases repeatability. If the file modification time is added, the repeatability decreases further. The best method to calculate the HASH value is to use the content of the file for MD5 calculation, but the calculation is very large (in fact, there is no need to do this), too much time will affect the upload experience.
Based on the above reasons, my HASH calculation idea is as follows:
First, give the browser an ID, which is stored in a Cookie. The browser ID+ file modification time + file name + file size results are MD5 to calculate the HASH value of a file. The browser ID is automatically assigned to the browser when the browser accesses the file upload site.
Function setCookie(cname,cvalue,exdays) {var d = new Date(); d.setTime(d.getTime()+(exdays*24*60*60*1000)); var expires = "expires="+d.toGMTString(); document.cookie = cname + "=" + cvalue + "; " + expires; } function getCookie(cname) { var name = cname + "="; var ca = document.cookie.split('; '); for(var i=0; i<ca.length; i++) { var c = ca[i].trim(); if (c.indexOf(name)==0) return c.substring(name.length,c.length); } return ""; } // // simple file HASH calculation, if you are not very sophisticated, should be able to use the product. // Since multiple data are used to compute file HASH values, the possibility of HASH collisions within the HYFileUploader system should be very small and should be safe to use. // The file ID can be obtained using any algorithm, as long as the same file ID is guaranteed to be the same. // function getFileId (file) {// Give the browser a unique ID to distinguish between different browser instances (different machines or browsers from different manufacturers on the same machine) var clientid = getCookie("HUAYIUPLOAD"); If (clientid == "") {var rand = parseInt(math.random () * 1000); var t = (new Date()).getTime(); clientid =rand+'T'+t; setCookie("HUAYIUPLOAD",clientid,365); } var info = clientid; if (file.lastModified) info += file.lastModified; if (file.name) info += file.name; if (file.size) info += file.size; / / https://cdn.bootcss.com/blueimp-md5/2.10.0/js/md5.min.js var count = md5 (info); return fileid; }Copy the code
In my opinion, you don’t need to read the contents of the file to compute the HASH value, which would be very slow. If you really need to implement HTTP second uploads, you might want to do so so that if different people upload the same file content, you can avoid repeated uploads and simply return the result.
The reason for assigning an ID to the browser is to further avoid HASH conflicts for files of the same name and size on other computers.
2. Query the HASH value of the file
For file uploading, query the upload progress information from the upload server using the HASH value of the file, and upload the file from the upload progress position. The code is as follows:
var fileObj = currentfile; var fileid = getFileId(fileObj); var t = (new Date()).getTime(); Var URL = resume_info_URL + '? Var URL = resume_info_URL + '? fileid='+fileid + '&t='+t; var ajax = new XMLHttpRequest(); ajax.onreadystatechange = function () { if(this.readyState == 4){ if (this.status == 200){ var response = this.responseText; var result = JSON.parse(response); if (! Result) {alert(' server returned incorrect data, possibly incompatible server '); return; } var uploadedBytes = result.file && result.file. Size; if (! result.file.finished && uploadedBytes < fileObj.size) { upload_file(fileObj,uploadedBytes,fileid); ShowUploadedFile (result.file);} else {showUploadedFile(result.file); Var progressBar = document.getelementById (' progressBar '); //progressBar.value = 100; }}else {alert(' Failed to get file breakpoint message '); } } } ajax.open('get',url,true); ajax.send(null);Copy the code
The above is implemented through jquery-file-upload component. For the implementation code through raw Javascript, please refer to the h4Resume.html sample code in demos directory.
If the file has been uploaded before, the server will return the size of the file that has been uploaded. Then we can start uploading data from the size of the file that has been uploaded.
Html5’s Slice of a File object can be used to slice from a File for upload.
Definitions and Usage
The slice() method extracts a portion of a word file and returns the extracted portion as a new string.
grammar
File.slice(start,end)
Parameters to describe
Start Indicates the starting index of the fragment to be extracted. If it is negative, this parameter specifies the position from the end of the string. That is, -1 is the last character of the string, -2 is the second-to-last character, and so on.
End Is the index immediately following the end of the fragment to be extracted. If this parameter is not specified, the substring to be extracted includes the string from start to the end of the original string.
If the argument is negative, it specifies the position from the end of the string.
The code for uploading fragmented files is as follows:
/* fileObj: html5 File object start_offset: the starting position of the uploaded data relative to the File header fileid: */ function upload_file(fileObj,start_offset,fileid) {var XHR = new XMLHttpRequest(); */ function upload_file(fileObj,start_offset,fileid) { var formData = new FormData(); var blobfile; if(start_offset >= fileObj.size){ return false; } var bitrateDiv = document.getElementById("bitrate"); var finishDiv = document.getElementById("finish"); var progressBar = document.getElementById('progressbar'); var progressDiv = document.getElementById('percent-label'); var oldTimestamp = 0; var oldLoadsize = 0; var totalFilesize = fileObj.size; if (totalFilesize == 0) return; var uploadProgress = function (evt) { if (evt.lengthComputable) { var uploadedSize = evt.loaded + start_offset; var percentComplete = Math.round(uploadedSize * 100 / totalFilesize); var timestamp = (new Date()).valueOf(); var isFinish = evt.loaded == evt.total; if (timestamp > oldTimestamp || isFinish) { var duration = timestamp - oldTimestamp; if (duration > 500 || isFinish) { var size = evt.loaded - oldLoadsize; var bitrate = (size * 8 / duration /1024) * 1000; //kbps if (bitrate > 1000) bitrate = Math.round(bitrate / 1000) + 'Mbps'; else bitrate = Math.round(bitrate) + 'Kbps'; var finish = evt.loaded + start_offset; if (finish > 1048576) finish = (Math.round(finish / (1048576/100)) / 100).toString() + 'MB'; else finish = (Math.round(finish / (1024/100) ) / 100).toString() + 'KB'; progressBar.value = percentComplete; progressDiv.innerHTML = percentComplete.toString() + '%'; bitrateDiv.innerHTML = bitrate; finishDiv.innerHTML = finish; oldTimestamp = timestamp; oldLoadsize = evt.loaded; } } } else { progressDiv.innerHTML = 'N/A'; } } xhr.onreadystatechange = function(){ if ( xhr.readyState == 4 && xhr.status == 200 ) { console.log( xhr.responseText ); } else if (xhr.status == 400) { } }; var uploadComplete = function (evt) { progressDiv.innerHTML = '100%'; var result = JSON.parse(evt.target.responseText); if (result.result == 'success') { showUploadedFile(result.files[0]); } else { alert(result.msg); }} var uploadFailed = function (evt) {alert(" uploadFailed! ); } var uploadCanceled = function (evt) {alert(" The upload was canceled or the browser was disconnected!" ); //xhr.timeout = 20000; //xhr.timeout = 20000; //xhr.ontimeout = function(event){// alert(' File upload time is too long, server did not respond within the specified time! '); //} xhr.overrideMimeType("application/octet-stream"); var filesize = fileObj.size; var blob = fileObj.slice(start_offset,filesize); var fileOfBlob = new File([blob], fileObj.name); Formdata.append ('filename', fileobj.name); formData.append('filename', fileobj.name); Formdata.append (' fileID ', fileid); formData.append(' fileID ', fileid); Formdata.append ("file",blob, fileobj.name); // FormData.append ("file",blob, fileobj.name); formData.append('file', fileOfBlob); xhr.upload.addEventListener("progress", uploadProgress, false); xhr.addEventListener("load", uploadComplete, false); xhr.addEventListener("error", uploadFailed, false); xhr.addEventListener("abort", uploadCanceled, false); xhr.open('POST', upload_file_url); // xhr.send(formData); }Copy the code
In order to verify file breakpoint continuation, the author made a simple interface to display the status information in the process of file uploading. The interface is as follows:
Using HTML, you can calculate the upload progress, uploaded size, and uploaded bit rate of a file. If any exception occurs during the upload, you can upload the file again. The uploaded part does not need to be uploaded again.
To verify HTML5 breakpoint continuations, you can download the file from github to the upload server to test it.