M. Abadi, A. Agarwal, P. Barham, E. Brevdo, Z. Chen et al., Oriol Vinyals, 2015.

, Traffic Surveillance: A Review of Vision Based Vehicle Detection, Recognition and Tracking, International Journal of Applied Engineering Research, vol.11, issue.1, pp.713-726, 2016.

, YouTube-8M: A large-scale video classification benchmark, p.15, 2016.

K. All, D. Hasler, and F. Fleuret, FlowBoostâAppearance learning from sparsely annotated video, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.1433-1440, 2011.

M. Andriluka, S. Roth, and B. Schiele, Peopletracking-by-detection and people-detection-by-tracking, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol.73, pp.1-8, 2008.

A. Aytar and . Zisserman, Tabula rasa: Model transfer for object category detection, 2011 International Conference on Computer Vision, vol.35, p.33, 2011.

, Yusuf Aytar. Transfer learning for object category detection, p.29, 2014.

J. Badie and F. Bremond, Global tracker: an online evaluation framework to improve tracking quality, Advanced Video and Signal Based Surveillance (AVSS), p.65, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01062766

K. Seung-hwan-bae and . Yoon, Robust online multi-object tracking based on tracklet confidence and online discriminative appearance learning, Proceedings of the IEEE conference on computer vision and pattern recognition, vol.65, pp.1218-1225, 2014.

H. Bay and A. Ess, Tinne Tuytelaars and Luc Van Gool. Speededup robust features (SURF), vol.85, pp.346-359, 2008.

B. Benfold and I. Reid, Stable multi-target tracking in real-time surveillance video, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.3457-3464, 2011.

K. Bernardin and R. Stiefelhagen, Evaluating multiple object tracking performance: the CLEAR MOT metrics, EURASIP Journal on Image and Video Processing, vol.2008, issue.1, p.73, 2008.

J. Carreira and C. Sminchisescu, Constrained parametric min-cuts for automatic object segmentation, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p.13, 2010.

V. Chari, S. Lacoste-julien, I. Laptev, and J. Sivic, On pairwise costs for network flow multi-object tracking, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, vol.74, p.66, 2015.

K. Chatfield and K. Simonyan, Andrea Vedaldi and Andrew Zisserman, Return of the devil in the details: Delving deep into convolutional nets, vol.100, 2014.

H. Cheng, N. Zheng, and C. Sun, Boosted Gabor features applied to vehicle detection, ICPR, vol.85, pp.662-666, 2006.

D. Cheng, Y. Gong, S. Zhou, J. Wang, and N. Zheng, Person re-identification by multi-channel parts-based cnn with improved triplet loss function, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, p.66, 2016.

S. Chopra, R. Hadsell, and Y. Lecun, Learning a similarity metric discriminatively, with application to face verification, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol.1, p.66, 2005.

D. Clevert, T. Unterthiner, and S. Hochreiter, Fast and accurate deep network learning by exponential linear units (elus), p.22, 2015.

R. Collobert, S. Bengio, and J. Mariéthoz, Torch: a modular machine learning software library, Idiap, p.16, 2002.

M. Cordts, M. Omran, S. Ramos, T. Rehfeld, M. Enzweiler et al., The Cityscapes Dataset for Semantic Urban Scene Understanding, Proc. of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p.15, 2016.

C. Cortes and V. Vapnik, Support-vector networks, Machine learning, vol.20, issue.3, pp.273-297, 1995.

I. J. Cox and S. L. Hingorani, An efficient implementation of Reid's multiple hypothesis tracking algorithm and its evaluation for the purpose of visual tracking, IEEE Transactions, vol.18, issue.2, p.66, 1996.

, CS231n: Convolutional Neural Networks for Visual Recognition

W. Dai, Q. Yang, G. Xue, and Y. Yu, Boosting for transfer learning, International Conference on Machine learning (ICML), p.31, 2007.

W. Dai, Q. Yang, G. Xue, and Y. Yu, Self-taught clustering, International Conference on Machine learning (ICML), vol.30, pp.200-207, 2008.

N. Dalal and B. Triggs, Histograms of oriented gradients for human detection, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol.13, p.12, 2005.
URL : https://hal.archives-ouvertes.fr/inria-00548512

M. Danelljan, M. Fahad-shahbaz-khan, J. Felsberg, and . Van-de-weijer, Adaptive color attributes for real-time visual tracking, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, p.66, 2014.

M. Danelljan, G. Häger, S. Fahad, M. Khan, and . Felsberg, Discriminative scale space tracking, IEEE transactions on pattern analysis and machine intelligence, vol.39, pp.1561-1575, 2017.

I. Daume, I. Daume, and D. Marcu, Domain adaptation for statistical classifiers, Journal of Artificial Intelligence Research (JAIR), vol.26, pp.101-126, 2006.

A. Dehghan, Y. Tian, H. S. Philip, M. Torr, and . Shah, Target identity-aware network flow for online multiple target tracking, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, p.66, 2015.

A. Dehghan and M. Shah, Binary Quadratic Programing for Online Tracking of Hundreds of People in Extremely Crowded Scenes, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.64, 2017.

J. Deng, W. Dong, R. Socher, L. Li, K. Li et al., Imagenet: A large-scale hierarchical image database, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.248-255, 2009.

J. James, D. Dicarlo, and . Cox, Untangling invariant object recognition, Trends in cognitive sciences, vol.11, issue.8, p.15, 2007.

J. Donahue, J. Hoffman, E. Rodner, K. Saenko, and T. Darrell, Semi-supervised domain adaptation with instance constraints, Conference on Computer Vision and Pattern Recognition (CVPR), p.35, 2013.

Y. Dorai, S. Gazzah, F. Chausse, and N. Amara, Tracking multi-object using tracklet and Faster R-CNN: PhD Forum, Proceedings of the 10th International Conference on Distributed Smart Camera, pp.222-223, 2016.

Y. Dorai, F. Chausse, S. Gazzah, and N. Amara, Multi Target Tracking by Linking Tracklets with a Convolutional Neural Network, VISIGRAPP (6: VISAPP), vol.74, pp.492-498, 2017.
URL : https://hal.archives-ouvertes.fr/hal-02121643

A. Doucet, D. Nando, N. Freitas, and . Gordon, Sequential monte carlo methods in practice, vol.90, 2001.

M. Douze, A. Ramisa, and C. Schmid, Combining attributes and fisher vectors for efficient image retrieval, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p.35, 2011.
URL : https://hal.archives-ouvertes.fr/inria-00566293

, Ian Endres and Derek Hoiem. Category independent object proposals, Computer Vision-ECCV 2010, p.13, 2010.

A. Ess, B. Leibe, K. Schindler, and L. Van-gool, A Mobile Vision System for Robust Multi-Person Tracking, IEEE Conference on Computer Vision and Pattern Recognition (CVPR'08), p.73, 2008.

M. Everingham, L. Van-gool, K. I. Christopher, J. Williams, A. Winn et al., The pascal visual object classes (voc) challenge, International journal of computer vision, vol.88, issue.2, pp.303-338, 2010.

L. Fei-fei, R. Fergus, and P. Perona, One-shot learning of object categories, Transactions on Pattern Analysis and Machine Intelligence (PAMI), vol.28, p.32, 2006.

F. Pedro, . Felzenszwalb, P. Daniel, and . Huttenlocher, Efficient graph-based image segmentation, International journal of computer vision, vol.59, issue.2, p.13, 2004.

P. Felzenszwalb, R. Girshick, D. Mcallester, and D. Ramanan, Object detection with discriminatively trained part-based models, IEEE transactions on pattern analysis and machine intelligence, vol.32, pp.1627-1645, 2010.

V. Ferrari, F. Jurie, and C. Schmid, Accurate object detection with deformable shape models learnt from images, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p.12, 2007.
URL : https://hal.archives-ouvertes.fr/hal-00203920

V. Ferrari, F. Jurie, and C. Schmid, From images to shape models for object detection, International journal of computer vision, vol.87, issue.3, p.12, 2010.
URL : https://hal.archives-ouvertes.fr/inria-00548643

J. Ferryman and . Shahrokni, Pets2009: Dataset and challenge, Performance Evaluation of Tracking and Surveillance, 2009.

, Twelfth IEEE International Workshop on, vol.73, pp.1-6, 2009.

M. Fink, Object classification from a single example utilizing class relevance metrics, Advances in Neural Information Processing Systems (NIPS), vol.17, pp.449-456, 2005.

Y. Freund and R. E. Schapire, A desicion-theoretic generalization of on-line learning and an application to boosting, European conference on computational learning theory, p.12, 1995.

J. Gao, W. Fan, J. Jiang, and J. Han, Knowledge transfer via multiple model local structure mapping, ACM International Conference Bibliography on Knowledge Discovery and Data Mining (ACM SIGKDD), p.33, 2008.

X. ]-wenshuo-gao, L. Zhang, H. Yang, and . Liu, An improved Sobel edge detection, Computer Science and Information Technology (ICCSIT), vol.85, pp.67-71, 2010.

T. Gao, M. Stark, and D. Koller, What makes a good detector?-structured priors for learning from few examples, European Conference on Computer Vision (ECCV), vol.35, pp.354-367, 2012.

M. Garcia and . Delakis, A neural architecture for fast and robust face detection, Proceedings. 16th International Conference on, vol.2, p.13, 2002.

A. Geiger, P. Lenz, and R. Urtasun, Are we ready for autonomous driving? the kitti vision benchmark suite, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol.107, pp.3354-3361, 2012.

R. Girshick, J. Donahue, T. Darrell, and J. Malik, Rich feature hierarchies for accurate object detection and semantic segmentation, Proceedings of the IEEE conference on computer vision and pattern recognition, p.14, 2014.

R. Girshick, J. Donahue, T. Darrell, and J. Malik, Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014.

R. Girshick, Fast r-cnn, Proceedings of the IEEE International Conference on Computer Vision, pp.1440-1448, 2015.

, Fast r-cnn, Proceedings of the IEEE International Conference on Computer Vision, vol.94, pp.1440-1448, 2015.

X. Glorot, A. Bordes, and Y. Bengio, Domain adaptation for large-scale sentiment classification: A deep learning approach, Proceedings of the 28th International Conference on Machine Learning (ICML-11), p.41, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00752091

J. Ian, A. Goodfellow, Y. Courville, and . Bengio, Spikeand-slab sparse coding for unsupervised feature discovery, p.41, 2012.

J. Guerry, Reconnaissance visuelle robuste par reseaux de neurones dans des scenarios d'exploration robotique, p.22, 2017.
URL : https://hal.archives-ouvertes.fr/tel-01680372

I. Guyon, G. Dror, V. Lemaire, G. Taylor, and . David-w-aha, Unsupervised and transfer learning challenge, IJCNN, p.86, 2011.

]. Han, W. Xu, H. Tao, and Y. Gong, An algorithm for multiple object trajectory tracking, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol.1, p.66, 2004.

F. Han, Y. Shan, R. Cekander, S. Harpreet, R. Sawhney et al., A two-stage approach to people and vehicle detection with hog-based svm, PMIS, vol.85, pp.133-140, 2006.

K. He, X. Zhang, S. Ren, and J. Sun, Delving deep into rectifiers: Surpassing human-level performance on imagenet classification, Proceedings of the IEEE international conference on computer vision, p.22, 2015.

K. He, X. Zhang, S. Ren, and J. Sun, Deep residual learning for image recognition, Proceedings of the IEEE conference on computer vision and pattern recognition, pp.770-778, 2016.

K. He, G. Gkioxari, P. Dollár, and R. Girshick, , p.14, 2017.

, Deep metric learning using triplet network, International Workshop on Similarity-Based Pattern Recognition, p.66, 2015.

G. Andrew, M. Howard, B. Zhu, D. Chen, W. Kalenichenko et al., Mobilenets: Efficient convolutional neural networks for mobile vision applications, vol.100, 2017.

K. Kyaw, D. Htike, and . Hogg, Efficient non-iterative domain adaptation of pedestrian detectors to video scenes, 22nd International Conference on Pattern Recognition (ICPR), pp.654-659, 2014.

J. Hu, J. Lu, and Y. Tan, Deep Transfer Metric Learning, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p.86, 2015.

C. Huang, H. Ai, Y. Li, and S. Lao, Vector boosting for rotation invariant multi-view face detection, ICCV, vol.85, pp.446-453, 2005.

. ]-jiayuan, A. J. Huang, A. Smola, . Gretton, M. Karsten et al., Correcting sample selection bias by unlabeled data, Advances in neural information processing systems (NIPS), p.31, 2006.

B. Gary, H. Huang, E. Lee, and . Learned-miller, Learning hierarchical representations for face verification with convolutional deep belief networks, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol.86, pp.2518-2525, 2012.

C. Huang, C. C. Loy, and X. Tang, Local similarityaware deep feature embedding, Advances in Neural Information Processing Systems, p.66, 2016.

. Huang-2016b]-jonathan, V. Huang, C. Rathod, M. Sun, A. Zhu et al., Sergio Guadarramaet al. Speed/accuracy trade-offs for modern convolutional object detectors, p.15, 2016.

S. Ioffe and C. Szegedy, Batch normalization: Accelerating deep network training by reducing internal covariate shift, International Conference on Machine Learning, p.22, 2015.

M. Jain, H. Jégou, and P. Gros, Asymmetric hamming embedding: taking the best of our bits for large scale image search, the 19th ACM International Conference on Multimedia (ICM), vol.30, pp.1441-1444, 2011.

Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long et al., Caffe: Convolutional architecture for fast feature embedding, ACM, pp.675-678, 2014.

, ACM, vol.95, p.16, 2014.

J. Jiang and C. Zhai, Instance weighting for domain adaptation in NLP, 45th Annual Meeting of the Association for Computational Linguistics (ACL), vol.7, p.31, 2007.

X. Jin, H. Curt, and . Davis, Vehicle detection from high-resolution satellite imagery using morphological shared-weight neural networks. IVC, vol.85, pp.1422-1431, 2007.

E. Kalogerakis, M. Averkiou, S. Maji, and S. Chaudhuri, 3D Shape Segmentation With Projective Convolutional Networks, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p.65, 2017.

H. Kataoka, K. Iwata, and Y. Satoh, Feature Evaluation of Deep Convolutional Neural Networks for Object Recognition and Detection, 2015.

F. ]-chanho-kim, A. Li, . Ciptadi, and . James-m-rehg, Multiple hypothesis tracking revisited, Proceedings of the IEEE International Conference on Computer Vision, vol.65, pp.4696-4704, 2015.

T. Kong, A. Yao, Y. Chen, and F. Sun, Hypernet: Towards accurate region proposal generation and joint object detection, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, p.14, 2016.

A. Krizhevsky, I. Sutskever, and G. E. Hinton, Imagenet classification with deep convolutional neural networks, ANIPS, vol.24, pp.1097-1105, 2012.

G. Bg-kumar, I. Carneiro, and . Reid, Learning local image descriptors with deep siamese and triplet convolutional networks by minimising global loss functions, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, p.66, 2016.

I. Kuzborskij, F. Orabona, and B. Caputo, From n to n+ 1: Multiclass transfer incremental learning, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p.35, 2013.

L. Leal-taixé, C. Canton, -. Ferrer, and K. Schindler, Learning by tracking: Siamese CNN for robust target association, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, vol.66, pp.33-40, 2016.

Y. Lecun, B. Boser, S. John, D. Denker, R. E. Henderson et al., Backpropagation applied to handwritten zip code recognition, Neural computation, vol.1, issue.4, pp.541-551, 1989.

Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner, Gradient-based learning applied to document recognition, Proceedings of the IEEE, p.21, 1998.

. Bj-lee, Y. H. Park, S. H. Joo, and . Jin, Intelligent Kalman filter for tracking a manoeuvring target, IEE Proceedings-Radar, Sonar and Navigation, vol.151, issue.6, pp.344-350, 2004.

A. Levin, A. Paul, Y. Viola, and . Freund, Unsupervised Improvement of Visual Detectors using Co-Training, ICCV, vol.1, pp.626-633, 2003.

J. Li, X. Liang, S. Shen, T. Xu, J. Feng et al., Scale-aware fast R-CNN for pedestrian detection, vol.85, 2015.

X. Li, M. Ye, M. Fu, P. Xu, and T. Li, Domain adaption of vehicle detector based on convolutional neural networks, International Journal of Control, Automation and Systems, vol.13, issue.4, pp.1020-1031, 2015.

J. Joseph, R. Lim, A. Salakhutdinov, and . Torralba, Transfer learning by borrowing examples for multiclass object detection, Advances in neural information processing systems, vol.35, pp.118-126, 2011.

T. Lin, M. Maire, S. Belongie, J. Hays, P. Perona et al., Microsoft coco: Common objects in context, European Conference on Computer Vision, vol.15, pp.740-755, 2014.

W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed et al., Ssd: Single shot multibox detector, European conference on computer vision, pp.21-37, 2016.

J. Long, E. Shelhamer, and T. Darrell, Fully convolutional networks for semantic segmentation, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, vol.10, pp.3431-3440, 2015.

G. David and . Lowe, Object recognition from local scale-invariant features, International Conference on Computer vision (ICCV), vol.2, pp.1150-1157, 1999.

H. Maâmatou, T. Chateau, and S. Gazzah, Yann Goyat and Najoua Essoukri Ben Amara. Sequential Monte Carlo filter based on multiple strategies for a scene specialization classifier, EURASIP Journal on Image and Video Processing, vol.2016, issue.1, p.40, 2016.

H. Maâmatou, T. Chateau, and S. Gazzah, Yann Goyat and Najoua Essoukri Ben Amara. Sequential Monte Carlo Filter Based on Multiple Strategies for a Scene Specialization Classifier, EURASIP Journal on Image and Video Processing (EURASIP -JIVP), vol.2016, issue.1, p.37, 2016.

H. Maâmatou, T. Chateau, and S. Gazzah, Yann Goyat and Najoua Essoukri Ben Amara. Transductive Transfer Learning to Specialize a Generic Classifier Towards a Specific Scene, VISAPP, vol.40, pp.96-97, 2016.

H. Maâmatou, T. Chateau, S. Gazzah, Y. Goyat, and N. Amara, Transductive Transfer Learning to Specialize a Generic Classifier Towards a Specific Scene, International Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, vol.4, pp.411-422, 2016.

L. Andrew, . Maas, Y. Awni, A. Hannun, and . Ng, Rectifier nonlinearities improve neural network acoustic models, Proc. ICML, vol.30, p.22, 2013.

T. Malisiewicz, A. Gupta, and A. A. Efros, Ensemble of exemplar-svms for object detection and beyond, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, p.17, 2011.

Y. Mao and Z. Yin, Training a scene-specific pedestrian detector using tracklets, Applications of Computer Vision (WACV), 2015.

, IEEE Winter Conference on, vol.40, p.37, 2015.

D. Matti, K. Haz?m, J. Ekenel, and . Thiran, Combining LiDAR space clustering and convolutional neural networks for pedestrian detection, Advanced Video and Signal Based Surveillance (AVSS), vol.82, pp.1-6, 2017.

S. Warren, W. Mcculloch, and . Pitts, A logical calculus of the ideas immanent in nervous activity, The bulletin of mathematical biophysics, vol.5, issue.4, p.15, 1943.

, Enhancing linear programming with motion modeling for multi-target tracking, Applications of Computer Vision (WACV), 2015 IEEE Winter Conference on, p.66, 2015.

X. Mei and H. Ling, Robust visual tracking and vehicle classification via sparse representation, IEEE transactions on pattern analysis and machine intelligence, vol.33, pp.2259-2272, 2011.

G. Mesnil, Y. Dauphin, X. Glorot, S. Rifai, Y. Bengio et al., Guillaume Desjardins, David Warde-Farleyet al. Unsupervised and Transfer Learning Challenge: a Deep Learning Approach, ICML Unsupervised and Transfer Learning, p.41, 2012.

A. Mhalla, T. Chateau, S. Gazzah, and N. Amara, Scene-Specific Pedestrian Detector Using Monte Carlo Framework and Faster R-CNN Deep Model: PhD Forum, Proceedings of the 10th International Conference on Distributed Smart Camera, p.84, 2016.

A. Mhalla, H. Maâmatou, T. Chateau, S. Gazzah, and N. Amara, Faster R-CNN Scene Specialization with a Sequential Monte-Carlo Framework, International Conference on Digital Image Computing: Techniques and Applications (DICTA), pp.1-7, 2016.
URL : https://hal.archives-ouvertes.fr/hal-02077856

A. Mhalla, T. Chateau, H. Maamatou, S. Gazzah, and N. Amara, SMC faster R-CNN: Toward a scene-specialized multi-object detector, Computer Vision and Image Understanding, vol.164, pp.3-15, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01653430

A. Milan, K. Schindler, and S. Roth, Detection-and trajectory-level exclusion in multiple object tracking, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, vol.76, pp.3682-3689, 2013.

A. Milan, S. Roth, and K. Schindler, Continuous energy minimization for multitarget tracking, IEEE transactions on pattern analysis and machine intelligence, vol.36, pp.58-72, 2014.

A. Milan, S. Hamid-rezatofighi, A. R. Dick, D. Ian, K. Reid et al., Online Multi-Target Tracking Using Recurrent Neural Networks, AAAI, pp.4225-4232, 2017.

A. Mohamed, O. Naiel, . Ahmad, Y. Swamy, M. Wu et al., Online multi-person tracking via robust collaborative model, IEEE International Conference on, p.64, 2014.

J. Nair and . Clark, An unsupervised, online learning framework for moving object detection, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol.2, 2004.

H. Nam and B. Han, Learning multi-domain convolutional neural networks for visual tracking, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, vol.65, pp.4293-4302, 2016.

D. Neelima and G. Mamidisetti, A Computer Vision Model for Vehicle Detection in Traffic Surveillance, International Journal of Engineering Science & Advanced Technology (IJESAT), vol.2, issue.5, pp.1203-1209, 2012.

H. Ng, Viet Dung Nguyen, Vassilios Vonikakis and Stefan Winkler. Deep learning for emotion recognition on small datasets using transfer learning, Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, p.41, 2015.

H. Noh, S. Hong, and B. Han, Learning deconvolution network for semantic segmentation, Proceedings of the IEEE International Conference on Computer Vision, vol.10, pp.1520-1528, 2015.

E. Ohn-bar, A. Tawari, S. Martin, M. Mohan, and . Trivedi, On surveillance for safety critical events: In-vehicle video networks for predictive driver assistance systems, Computer Vision and Image Understanding, vol.134, pp.130-140, 2015.

T. Ojala, M. Pietikäinen, and D. Harwood, A comparative study of texture measures with classification based on featured distributions, Pattern recognition, vol.29, issue.1, p.17, 1996.

M. Oquab, L. Bottou, I. Laptev, and J. Sivic, Learning and transferring mid-level image representations using convolutional neural networks, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p.41, 2014.
URL : https://hal.archives-ouvertes.fr/hal-00911179

. Sinno-jialin-pan, T. James, Q. Kwok, and . Yang, Transfer Learning via Dimensionality Reduction, Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence (CAI), vol.8, p.34, 2008.

. Bibliography, Sinno Jialin Pan and Qiang Yang. A survey on transfer learning. KDE, vol.29, p.28, 2010.

X. Pan, Y. Guo, and A. Men, Traffic surveillance system for vehicle flow detection, Computer Modeling and Simulation, 2010. ICCMS'10. Second International Conference on, vol.1, p.27, 2010.

I. W. Sinno-jialin-pan, . Tsang, T. James, Q. Kwok, and . Yang, Domain adaptation via transfer component analysis, IEEE Transactions on Neural Networks, vol.22, issue.2, pp.199-210, 2011.

B. Quanz, J. Huan, and M. Mishra, Knowledge transfer with low-quality data: A feature extraction issue, IEEE Transactions on Knowledge and Data Engineering, vol.24, issue.10, pp.1789-1802, 2012.

A. Quattoni, M. Collins, and T. Darrell, Transfer learning for image classification with sparse prototype representations, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p.34, 2008.

R. Raina, A. Battle, H. Lee, B. Packer, Y. Andrew et al., Self-taught learning: transfer learning from unlabeled data, International conference on Machine learning (ICML), vol.33, pp.759-766, 2007.

J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, You only look once: Unified, real-time object detection, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, vol.27, pp.779-788, 2016.

J. Redmon and A. Farhadi, YOLO9000: better, faster, stronger, vol.82, 2016.

K. Shaoqing-ren, R. He, J. Girshick, and . Sun, Faster R-CNN: Towards real-time object detection with region proposal networks, Advances in neural information processing systems, vol.73, pp.91-99, 2015.

K. Shaoqing-ren, R. He, J. Girshick, and . Sun, Faster R-CNN: Towards real-time object detection with region proposal networks, Advances in neural information processing systems, pp.91-99, 2015.

K. Shaoqing-ren, R. He, J. Girshick, and . Sun, Faster R-CNN: Towards real-time object detection with region proposal networks, Advances in Neural Information Processing Systems (NIPS), pp.91-99, 2015.

, Semi-supervised self-training of object detection models, Seventh Workshop on Applications of Computer Vision (WACV), 2005.

. Samuel-rota, G. Bulo, P. Neuhold, and . Kontschieder, Loss Max-Pooling for Semantic Image Segmentation, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p.65, 2017.

K. Saenko, B. Kulis, M. Fritz, and T. Darrell, Adapting visual category models to new domains, European conference on computer vision (ECCV), p.34, 2010.

P. Sermanet, D. Eigen, X. Zhang, M. Mathieu, R. Fergus et al., Overfeat: Integrated recognition, localization and detection using convolutional networks, 2013.

Z. Shen, Z. Liu, J. Li, Y. Jiang, Y. Chen et al., DSOD: Learning Deeply Supervised Object Detectors From Scratch, The IEEE International Conference on Computer Vision (ICCV), vol.64, 2017.

A. Shrivastava, A. Gupta, and R. Girshick, Training region-based object detectors with online hard example mining, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, p.27, 2016.

G. Shu, A. Dehghan, O. Oreifej, E. Hand, and M. Shah, Part-based multiple-person tracking with partial occlusion handling, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p.64, 2012.

K. Simonyan and A. Zisserman, Very deep convolutional networks for large-scale image recognition, 2014.

S. Sivaraman and M. Trivedi, Looking at vehicles on the road: A survey of vision-based vehicle detection, tracking, and behavior analysis, IEEE Transactions on Intelligent Transportation Systems, vol.14, issue.4, pp.1773-1795, 2013.

I. Smal, W. Niessen, and E. Meijering, Advanced particle filtering for multiple object tracking in dynamic fluorescence microscopy images, BIFNM, vol.46, pp.1048-1051, 2007.

A. Smith, A. Doucet, N. De-freitas, and N. Gordon, Sequential monte carlo methods in practice, vol.46, 2013.

Z. Song, Q. Chen, Z. Huang, Y. Hua, and S. Yan, Contextualizing object detection and classification, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p.35, 2011.

N. Srivastava, G. E. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, Dropout: a simple way to prevent neural networks from overfitting, Journal of machine learning research, vol.15, issue.1, p.22, 2014.

M. Stark, M. Goesele, and B. Schiele, A shape-based object class model for knowledge transfer, International Conference on Computer Vision (ICCV), vol.33, pp.373-380, 2009.

M. Sugiyama, S. Nakajima, H. Kashima, V. Paul, M. Buenau et al., Direct importance estimation with model selection and its application to covariate shift adaptation, Advances in neural information processing systems (NIPS), p.31, 2008.

A. Suleiman, Z. Zhang, and V. Sze,

, A 58.6 mW 30 Frames/s Real-Time Programmable Multiobject Detection Accelerator With Deformable Parts Models on Full HD 1920\times 1080V ideos, IEEEJournalof Solid ?

. Statecircuits, , vol.52, 2017.

Y. Sun, Y. Chen, X. Wang, and X. Tang, Deep learning face representation by joint identification-verification, Advances in neural information processing systems, p.66, 1988.

C. Szegedy, A. Toshev, and D. Erhan, Deep neural networks for object detection, Advances in neural information processing systems, vol.13, pp.2553-2561, 2013.

C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed et al., Going deeper with convolutions, Proceedings of the IEEE conference on computer vision and pattern recognition, p.65, 2015.

Y. Taigman, M. Yang, M. , A. Ranzato, and L. Wolf, Deepface: Closing the gap to human-level performance in face verification, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol.66, pp.1701-1708, 2014.

K. Tang, V. Ramanathan, L. Fei-fei, and D. Koller, Shifting weights: Adapting object detectors from image to video, Advances in Neural Information Processing Systems (NIPS), p.36, 2012.

S. Tang, M. Andriluka, B. Andres, and B. Schiele, Multiple People Tracking by Lifted Multicut and Person Re-identification, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, vol.65, pp.3539-3548, 2017.

R. Tao, E. Gavves, W. M. Arnold, and . Smeulders, Siamese instance search for tracking, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p.66, 2016.

Y. Tian, P. Luo, X. Wang, and X. Tang, Pedestrian detection aided by deep learning semantic tasks, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, vol.85, pp.5079-5087, 2015.

T. Tommasi, Learning to learn by exploiting prior knowledge, vol.35, 2013.

A. Torralba, K. Murphy, and W. Freeman, Sharing features: efficient boosting procedures for multiclass object detection, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol.2, p.13, 2004.

A. Torralba, P. Kevin, W. Murphy, and . Freeman, Sharing visual features for multiclass and multiview object detection, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.29, issue.5, p.13, 2007.

. Jasper-rr-uijlings, E. A. Koen, T. Van-de-sande, . Gevers, W. M. Arnold et al., Selective search for object recognition, International journal of computer vision (IJCV), vol.104, issue.2, pp.154-171, 2013.

J. Valmadre, L. Bertinetto, J. Henriques, A. Vedaldi, H. S. Philip et al., End-To-End Representation Learning for Correlation Filter Based Tracking, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p.65, 2017.

]. Vermaak, J. Simon, P. Godsill, and . Perez, Monte Carlo filtering for multi target tracking and data association, IEEE Transactions on Aerospace and Electronic systems, vol.41, issue.1, p.65, 2005.

P. Viola and M. Jones, Rapid object detection using a boosted cascade of simple features, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages I-I, p.12, 2001.

P. Viola and M. Jones, Robust real-time object detection, International Journal of Computer Vision (IJCV), vol.4, pp.51-52, 2001.

Z. Wang, Y. Song, and C. Zhang, Transferred dimensionality reduction, vol.34, pp.550-565, 2008.

X. Wang, X. Ma, W. Eric, and L. Grimson, Unsupervised activity perception in crowded and complicated scenes using hierarchical bayesian models, p.53, 2009.

. ]-gang, D. Wang, D. Forsyth, and . Hoiem, Comparative object similarity for improved recognition with few or no examples, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p.32, 2010.

W. and X. Wang, Automatic adaptation of a generic pedestrian detector to a specific traffic scene, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol.36, pp.3401-3408, 2011.

M. Wang, W. Li, and X. Wang, Transferring a generic pedestrian detector towards specific scenes, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.3274-3281, 2012.

X. Wang, G. Hua, and T. X. Han, Detection by detections: Non-parametric detector adaptation for a video, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p.36, 2012.

J. Wang, Y. Song, T. Leung, C. Rosenberg, J. Wang et al., Learning fine-grained image similarity with deep ranking, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p.66, 2014.

X. Wang, M. Wang, and W. Li, Scene-specific pedestrian detection for static video surveillance, pp.361-362, 2014.

X. Wang, E. Türetken, F. Fleuret, and P. Fua, Tracking interacting objects optimally using integer programming, European Conference on Computer Vision, vol.65, pp.17-32, 2014.

B. Wang, L. Wang, B. Shuai, Z. Zuo, T. Liu et al., Joint learning of convolutional neural networks and temporally constrained metrics for tracklet association, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp.1-8, 2016.

L. Wang, W. Ouyang, X. Wang, and H. Lu, Stct: Sequentially training convolutional networks for visual tracking, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.1373-1381, 2016.

M. Wang, Y. Liu, and Z. Huang, Large Margin Object Tracking With Circulant Feature Maps, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017.

, Luhong Liang Wei Zheng Hong Chang. Strip Features for Fast Object Detection, vol.85, 2013.

B. Wu and R. Nevatia, Cluster boosted tree classifier for multi-view, multi-pose object detection, ICCV, vol.85, pp.1-8, 2007.

Z. Wu, J. Zhang, and M. Betke, Online Motion Agreement Tracking, BMVC, p.65, 2013.

Y. Xiang, R. Mottaghi, and S. Savarese, Beyond pascal: A benchmark for 3d object detection in the wild, Applications of Computer Vision (WACV), p.107, 2014.

Y. Xiang, W. Choi, Y. Lin, and S. Savarese, Subcategory-aware convolutional neural networks for object proposals and detection, Applications of Computer Vision (WACV), 2017 IEEE Winter Conference on, p.14, 2017.

H. Xie, Q. Wu, B. Chen, Y. Chen, and S. Hong, Vehicle Detection in Open Parks Using a Convolutional Neural Network, Sixth International Conference on Intelligent Systems Design and Engineering Applications (ISDEA), p.37, 2015.

G. Xue, W. Dai, Q. Yang, and Y. Yu, Topic-bridged PLSA for cross-domain text classification, the 31st annual international ACM SIGIR conference on Research and Development in Information Retrieval (RDIR), p.34, 2008.

J. Yang, R. Yan, and A. G. Hauptmann, Adapting SVM classifiers to data with shifted distributions, Seventh International Conference on Data Mining Workshops (ICDMW), p.33, 2007.

W. Yang, Y. Choi, and . Lin, Exploit all the layers: Fast and accurate cnn object detector with scale dependent pooling and cascaded rejection classifiers, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, p.14, 2016.

B. Yao, X. Jiang, A. Khosla, A. L. Lin, L. Guibas et al., Human action recognition by learning bases of action attributes and parts, International Conference on Computer Vision (ICCV), p.34, 2011.

Q. Ye, T. Zhang, W. Ke, Q. Qiu, J. Chen et al., Self-Learning Scene-Specific Pedestrian Detectors Using a Progressive Latent Model, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol.85, 2017.

H. Yoo, K. Kim, M. Byeon, Y. Jeon, and J. Y. Choi, Online Scheme for Multiple Camera Multiple Target Tracking Based on Multiple Hypothesis Tracking, IEEE Transactions on Circuits and Systems for Video Technology, vol.27, p.65, 2017.

D. Matthew, R. Zeiler, and . Fergus, Visualizing and understanding convolutional networks, European conference on computer vision, vol.100, pp.818-833, 2014.

X. Zeng, W. Ouyang, M. Wang, and X. Wang, Deep learning of scene-specific classifier for pedestrian detection, ECCV, vol.55, p.41, 2014.

C. Zhang, R. Hamid, and Z. Zhang, Taylor expansion based classifier adaptation: Application to person detection, Computer Vision and Pattern Recognition, p.35, 2008.

X. Zhang, F. Zhou, Y. Lin, and S. Zhang, Embedding label structures for fine-grained feature representation, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, p.66, 2016.

Z. Zivkovic and F. Van-der-heijden, Efficient adaptive density estimation per image pixel for the task of background subtraction, Pattern recognition letters, vol.27, issue.7, p.49, 2006.

A. Zweig and D. Weinshall, Exploiting object hierarchy: Combining models from different category levels, International Conference on Computer Vision (ICCV), vol.32, pp.1-8, 2007.