地球信息科学学报 2016 , 18 (11): 1476-1784 https://doi.org/10.3724/SP.J.1047.2016.01476

特约稿件

基于众包数据的室内定位方法和平台

黄正宇¹^,²^,³^,, 陈益强¹^,²^,^3*^,^*^,, 刘军发²^,³, 蒋鑫龙²^,³^,⁴, 胡春雨²^,³^,⁴

1. 湘潭大学信息工程学院,湘潭 411105
2. 中国科学院计算技术研究所,北京 100190
3. 北京市移动计算与新型终端重点实验室,北京 100190
4. 中国科学院大学,北京 100190

Indoor Localization Method and Platform Based on Crowdsourcing Data

HUANG Zhengyu¹^,²^,³^,, CHEN Yiqiang¹^,²^,³^,^*^,, LIU Junfa²^,³, JIANG Xinlong²^,³^,⁴, HU Chunyu²^,³^,⁴

1. College of Information Engineering, Xiangtan University, Xiangtan Hunan 411105, China
2. Institute of Computing Technology Chinese Academy of Sciences, Beijing 100190, China
3.Beijing Key Laboratory of Mobile Computing and Pervasive Device, Beijing 100190, China
4. University of Chinese Academy of Sciences, Beijing 100190, China

通讯作者: *Corresponding author: CHEN Yiqiang, E-mail: yqchen@ict.ac.cn

收稿日期: 2016-07-25

修回日期: 2016-09-29

网络出版日期: 2016-11-20

基金资助: 国家自然科学基金项目(61572004、61472399)中国科学院科研装备研制项目“面向可穿戴行为感知的精准模型测试仪研制”(YZ201527)

作者简介:

作者简介：黄正宇(1991-),男,湖南岳阳人,硕士生,主要从事室内定位和信息检索方面研究。E-mail: huangzhengyu@ict.ac.cn

展开

摘要

随着WLAN的普及,基于Wi-Fi的室内定位方法逐渐成为研究与应用的热点。虽然,其中基于位置指纹的定位算法研究相对广泛,应用效果较好,然而现有的指纹定位方法或系统仍存在以下3个问题：① 离线阶段的数据标定和定位模型的训练需要耗费大量人力物力,以及时间消耗,使系统很难得到实际应用;② 真实环境中WLAN信号波动呈现高动态性,采集的数据存在显著的时效性,无法提供长时间的有效定位保证;③ 实际环境中AP设备变动频繁,导致训练数据与定位数据特征维度不等长,造成模型失效。针对上述问题,本文提出了一种基于众包数据的模型更新方法,通过不断融合增量数据,使定位模型保持实时有效。该方法主要包括半监督极速学习机(SELM)、具有时效机制的增量式定位方法(TMELM)和特征自适应的在线极速学习机(FA-OSELM)3部分。基于上述方法,本文设计并实现了基于众包数据的室内定位平台系统。实际应用表明,本文提出的方法能够显著降低模型训练阶段的数据采集工作量,有效提升模型训练速度,并且长时间保持较高的定位精度。

关键词： 室内定位 ; 增量学习 ; 极速学习机 ; 特征自适应 ; 时效机制 ; 众包数据

Abstract

As WLAN getting more and more popular and pervasive, Wi-Fi based indoor localization is becoming a hot issue in research and application fields. Among various kinds of up-to-date indoor localization methods, fingerprint based methods are most widely used because of the good performance. However, the existing fingerprint based methods still have following three common problems: Firstly, fingerprint based methods require a vast amount of calibration work, which need huge human and time consumption both in offline and online phases. It makes the systems difficult to be applied in the practical applications. Secondly, the Wi-Fi signals in the environment change frequently, bringing the significant timeliness in collected data. So it cannot guarantee to provide a long term effective localization. Thirdly, the Wi-Fi access points change frequently in real scene. Thus, the feature dimensions of training data and testing data are unequal. The traditional algorithms cannot well handle the feature dimension changing problem caused by increase or decrease in APs’ number. To solve these problems mentioned above, we proposed a crowdsourcing based indoor localization method, including Semi-supervised ELM, Timeliness Managing ELM and Feature Adaptive Online Sequential ELM. We also developed an indoor localization platform. Applications show that our method can reduce human effort in data calibration and improve the model training speed. Moreover, our method can maintain the high location accuracy for a long time.

Keywords： indoor localization ; incremental learning ; extreme learning machine ; feature adaptive ; timeline managing ; crowdsourcing data

PDF (6155KB) 元数据多维度评价相关文章收藏文章

本文引用格式导出 EndNote Ris Bibtex

黄正宇, 陈益强, 刘军发, 蒋鑫龙, 胡春雨. 基于众包数据的室内定位方法和平台[J]. , 2016, 18(11): 1476-1784 https://doi.org/10.3724/SP.J.1047.2016.01476

HUANG Zhengyu, CHEN Yiqiang, LIU Junfa, JIANG Xinlong, HU Chunyu. Indoor Localization Method and Platform Based on Crowdsourcing Data[J]. 地球信息科学学报, 2016, 18(11): 1476-1784 https://doi.org/10.3724/SP.J.1047.2016.01476

1 引言

基于位置的服务(Location Based Services, LBS)已逐渐成为移动互联网的研究内容热点,基于LBS的应用也随着移动互联网的发展而越来越丰富。近年来,许多定位解决方案不断涌现,如GPS定位、基站定位、地磁定位、超声波定位、射频(RFID)定位、Wi-Fi定位以及基于iBeacon技术的蓝牙定位解决方案等。然而上述定位解决方案均存在其局限性：GPS由于易受到建筑或其他障碍物的阻挡,只适用于室外环境;超声波易受物体遮挡,存在传输受限问题;基于射频的定位方法探测范围较小;iBeacon信号覆盖范围小,需大面积布设,定位局限性较大,成本较高;地磁信号易受电磁干扰而难以精准采集。Wi-Fi具有覆盖面积广、抗干扰能力强、信号稳定以及数据传输速度快和质量高等特点。此外随着智慧城市建设的逐步推进,具备无线通信模块的智能终端逐年普及,日常环境中Wi-Fi信号日渐丰富,无线接入点(Access Point, AP)越来越多,这些都为基于Wi-Fi定位方法的普及实施提供了有利条件。

2 基于Wi-Fi的定位方法

目前,基于Wi-Fi的定位方法主要包含以下4类：基于信号到达时间TOA(Time of Arrival)的定位方法;基于信号到达时间差TDOA(Time difference of Arrival)的定位方法;基于信号到达角度AOA(Angle of Arrive)的定位方法和基于信号强度RSS(Received Signal Strength)的定位方法。由于室内环境复杂,Wi-Fi信号易受障碍物反射,存在严重的多径效应,基于AOA的方法定位精度较低。此外,由于时间精准控制在近距离情况下难以实现,基于TOA和TDOA等定位方法误差过大,实用性差。基于RSS的定位方法因为信号检测装置简单、信号稳定性好以及定位方法简易等特点而受到广泛关注。基于RSS的Wi-Fi室内定位方法又称为位置指纹算法^[1],该方法包括离线训练阶段和在线预测阶段,如图1所示。

显示原图| 下载原图ZIP| 生成PPT

图1 指纹模型定位方法框架

Fig. 1 The fingerprint model localization method framework

离线阶段的任务就是大量采集定位区域中各个参考点(Reference Point, RP)的标定指纹数据,然后利用机器学习方法挖掘Wi-Fi信号向量 $\begin{matrix} X \end{matrix}$ 与真实物理位置 $\begin{matrix} Y \end{matrix}$ 之间的映射关系 $\begin{matrix} Y = f (X) \end{matrix}$ ;最终在线阶段针对输入的Wi-Fi信号向量 $\begin{matrix} X^{*} \end{matrix}$ 利用模型估计出当前位置 $\begin{matrix} Y^{*} \end{matrix}$ 。已有的研究通过不同的方法对模型精度进行提升,马里兰大学的HORUS^[2]定位系统通过建立信号空间并引入概率模型,同时采用联合分簇法和三角测量2种方式进行定位。Sjoberg等^[3]利用SVM结合计算机视觉技术取得了较好的定位效果。Wu等^[4]首次引入信道状态信息(Channel State Information, CSI),通过提取Wi-Fi信号中的稳定信道作为特征来进行定位,取得了良好的效果。Sen等^[5]也通过细粒度的CSI特征实现了波长级别的定位效果,同时降低了系统功耗。在高动态信号的优化方面,Fang等^[6]通过主成份分析(Principal Component Analysis, PCA)来提取Wi-Fi信号中最主要的成分作为特征训练定位模型,最终使平均定位误差降低了33.75%。Zheng等^[7]通过迁移学习来处理静态Radio Map在不同时间段的适用性,提出基于迁移的隐马尔科夫模型来动态调整Radio Map。Sun等^[8]将源数据和部分标定数据进行结合,并通过流形对齐(Manifold Alignment)来处理时间和设备的差异,从而保证了定位的精度。

然而,为了保证定位精度,基于位置指纹的定位方法需要实时更新定位模型,而定位模型的更新需要不断融入新的样本,进而产生了以下3个问题：

（1）样本数据集采集工作量巨大。位置指纹定位算法为保证一定的预测精确度而依赖于大量的训练样本,当样本数据较少时,训练得到的模型无法满足定位精度需求。因此为保证模型预测达到较高的精度,需要耗费大量的人力和物力采集标定数据。在定位系统规模扩大时,大批量的指纹采集将成为一项十分繁重的工作,而采集得到的大量样本也给模型训练和实时预测带来困难。当模型无法在有限时间内快速完成数据训练和预测时,用户体验将会显著下降。

（2）无线信号波动较大,具有明显的时效性特点。基于指纹的定位方法要求用于模型训练的样本数据和在线实时位置估计的测试数据分布具有一致性,当训练数据和测试数据不满足同一分布时,模型的预测精度会出现很大程度的下降。然而在真实环境中,受到时间、空气温湿度、室内环境布局等因素变动的影响,Wi-Fi信号会出现显著的波动,呈现高动态特性。不同因素条件下,扫描终端接收到的AP信号强度和AP个数都会出现明显的变化,从而导致基于前一时间段采集的训练数据得到的预测模型在一段时间后的在线预测阶段适应程度大幅下降。

（3）环境AP呈现高动态性,学习样本和定位样本特征维度不等长。室内环境下Wi-Fi呈现高动态性,一方面体现在Wi-Fi信号强度值的高动态变化,另一方面体现在AP的数量上。因为AP的基本功能是提供智能终端到Internet的桥接,真实环境中部分AP会因为某些原因被移除,新的AP也有可能会部署到环境中。当部分已有AP被移除,信号强度值检测不到或环境中出现新增AP时,均会使特征维度发生变化,即特征维度不等长。传统的指纹定位模型无法对这类情况进行有效处理,只能选择重新训练模型或者忽略新的AP。

针对位置指纹算法在更新模型时遇到的上述问题,本文提出了基于众包(Crowdsourcing)的室内定位方法,即大量引入来自用户的众包增量数据,采用半监督极速学习机(SELM)、具有时效机制的增量式定位方法(TMELM)和特征自适应的在线极速学习机(FA-OSELM),分别从非标定样本应用于模型学习、训练样本时效性以及模型对具有不同特征维度样本的自适应3个角度对问题予以解决,并实现了基于众包数据的室内定位平台系统,取得了较好的应用效果。

3 基于众包数据的室内定位方法

本文所引入的半监督极速学习机(SELM)^[9]、具有时效机制的增量式定位方法(TMELM)^[10]和特征自适应的在线极速学习机(FA-OSELM)^[11]均基于极速学习机(Extreme Learning Machine, ELM),本节首先对ELM模型进行简要介绍,接着对SELM、TMELM、FA-OSELM算法模型进行阐述。

3.1 极速学习机(ELM)

ELM(Extreme Learning Machine)^[12-14]属于人工神经网络范畴,是一种单隐层前馈神经网络(Single Hidden Layer Feedforward Neural Network, SLFN)。,该神经网络训练时间短,网络结构较为简洁（图2）。对于一个输入向量 $\begin{matrix} x \in R^{n} \end{matrix}$ 和具有 $\begin{matrix} L \end{matrix}$ 个隐层节点的SLFN,其输出形式定义如式（1）所示。

$\begin{matrix} f_{L} (x) = \overset{L}{\sum_{i = 1}} β_{i} G (a_{i}, b_{i}, x) a_{i} \in R^{L}, b_{i} \in R, β_{i} \in R^{m} \end{matrix}$ （1）

式中： $\begin{matrix} a_{i} = [a_{i 1}, a_{i 2}, \dots, a_{in}]^{T} \end{matrix}$ 表示连接输入向量与第 $\begin{matrix} i \end{matrix}$ 个隐层节点的权值; $\begin{matrix} b_{i} \end{matrix}$ 表示第 $\begin{matrix} i \end{matrix}$ 个隐层节点的偏置参数; $\begin{matrix} G (a_{i}, b_{i}, x) \end{matrix}$ 是第 $\begin{matrix} i \end{matrix}$ 个隐层节点的输出; $\begin{matrix} β_{i} = \end{matrix} \begin{matrix} [β_{i 1}, β_{i 2}, \dots β_{im}]^{T} \end{matrix}$ 表示连接第 $\begin{matrix} i \end{matrix}$ 个隐层节点与输出节点的权值。

显示原图| 下载原图ZIP| 生成PPT

图2 ELM网络结构(单隐层前馈神经网络SLFN)

Fig. 2 Structure of ELM(Single-hidden layerfeedforward neural networks)

假设 $\begin{matrix} g (x) \end{matrix}$ 为隐层节点的激励函数,则：

$\begin{matrix} G (a_{i}, b_{i}, x) = g (a_{i} ∙ x + b_{i}) a_{i} \in R^{n}, b_{i} \in R \end{matrix}$ （2）

对于输入的 $\begin{matrix} N \end{matrix}$ 个训练样本集 $\begin{matrix} {(x_{j}, t_{j}) | j = 1,2, \dots, \end{matrix} \begin{matrix} N}, x_{j} = [x_{j 1}, x_{j 2}, \dots, x_{jn}]^{T} \end{matrix}$ ,其中 $\begin{matrix} x_{j} \in R^{n} \end{matrix}$ 代表输入的信号特征向量, $\begin{matrix} t_{j} \in R^{m} \end{matrix}$ 表示样本点的物理位置,则有

$\begin{matrix} G (a_{i}, b_{i}, x_{j}) = g (a_{i} ∙ x_{j} + b_{i}) a_{i} \in R^{_{n}}, b_{i} \in R, j = 1,2, \dots, N \end{matrix}$ （3）

可用矩阵表达为：

$\begin{matrix} \{\begin{matrix} H = {[\begin{matrix} G (a_{1}, b_{1}, x_{1}) & \dots & G (a_{L}, b_{L}, x_{1}) \\ ⋮ & ⋱ & ⋮ \\ G (a_{1}, b_{1}, x_{N}) & \dots & G (a_{L}, b_{L}, x_{N}) \end{matrix}]}_{N \times L}, \\ β = {[\begin{matrix} β_{1}^{T} \\ ⋮ \\ β_{L}^{T} \end{matrix}]}_{L \times m}, T = {[\begin{matrix} t_{1}^{T} \\ ⋮ \\ t_{N}^{T} \end{matrix}]}_{N \times m} \end{matrix} \end{matrix}$ （4）

根据式（1）可得 $\begin{matrix} Hβ = T ， \end{matrix}$ 即期望获得具有 $\begin{matrix} L \end{matrix}$ 个隐层节点的SLFN零误差逼近这 $\begin{matrix} N \end{matrix}$ 个样本。

根据文献[12]-[14],参数 $\begin{matrix} a_{i} \end{matrix}$ 和 $\begin{matrix} b_{i} \end{matrix}$ 无需在训练过程中进行调整,只需在初始阶段随机指定即可。因此 $\begin{matrix} H \end{matrix}$ 与 $\begin{matrix} T \end{matrix}$ 均为已知, $\begin{matrix} H^{†} \end{matrix}$ 为 $\begin{matrix} H \end{matrix}$ 的伪逆矩阵^[15-16],则 $\begin{matrix} β \end{matrix}$ 的求解公式如式（5）所示。

$\begin{matrix} β = H^{†} T = (H^{T} {H)}^{- 1} H^{T} T \end{matrix}$ （5）

针对式（4）进行优化,即等价于求解式（6）。

$\begin{matrix} \arg \min_{β} (‖Hβ - T‖) \end{matrix}$ （6）

3.2 半监督极速学习机(SELM)

为了保证位置指纹定位算法预测精确度,需要采集大量的标定样本进行模型训练,当样本数据不够时,模型在定位精度上无法满足需求,而人工采集大批量的标定数据工作量巨大。故本文提出众包的采集思路,即用户在使用定位系统的过程中会进行指纹数据采集,通过获取该数据来作为模型更新时的增量训练数据,但这种方式采集到的训练数据为无标定数据。针对该问题,通过引入半监督极速学习机(Semi-supervised ELM, SELM)^[9]进行解决。使用SELM能充分利用非标定数据来训练模型,极大地减少标定数据采集的工作量。为了使SELM相比ELM能取得更好的位置预测效果,同时具有很好的泛化能力,根据结构风险最小化理论,模型需要在经验风险和学习函数 $\begin{matrix} f \end{matrix}$ 的复杂度之间取得平衡。本文通过引入图的拉普拉斯(Laplacian)算子^[17]来达到预期目标。在文献[18]中,使用图的平滑度函数 $\begin{matrix} S (f) = \end{matrix} \begin{matrix} \sum^{} W_{ij} (f_{i} - f_{j})^{2} = f^{T} L_{a} f \end{matrix}$ 来表示模型复杂性,其中 $\begin{matrix} L_{a} \end{matrix}$ 代表图的拉普拉斯算子。文献[19]已经讨论, $\begin{matrix} L_{a} = D - W \end{matrix}$ ,其中 $\begin{matrix} D \end{matrix}$ 表示一个对角矩阵,由 $\begin{matrix} D_{ii} = \end{matrix} \begin{matrix} \sum_{j}^{=} W_{ij} \end{matrix}$ 给出, $\begin{matrix} W_{ij} = e^{\frac{- | x_{i} - x_{j} |^{2}}{2 δ^{2}}} \end{matrix}$ 。由此,式（6）可通过最小化均方损失函数加上平滑度惩罚而转化为式（7）。

$\begin{matrix} \arg \min_{f} \frac{1}{2} ({‖f - T‖}^{2} + λ f^{T} L_{a} f) \end{matrix}$ （7）

由 $\begin{matrix} Hβ = f \end{matrix}$ 可得式（8）：

$\begin{matrix} \arg \min_{β} l (β) = \arg \min_{β} l \frac{1}{2} ({‖JHβ - T‖}^{2} + λ {(Hβ)}^{T} L_{a} Hβ) \end{matrix}$ （8）

根据上述, $\begin{matrix} H \end{matrix}$ 已知,且包含所有的标定数据集和非标定数据集,维度变为 $\begin{matrix} (l + u) \times L \end{matrix}$ 。

为了计算方便,可令 $\begin{matrix} J = diag (1,1, \dots, 0,0) \end{matrix}$ , $\begin{matrix} l \end{matrix}$ 个1,剩下的全部填充0。通过对式（8）求导得到其最优解如式（9）、（10）所示：

$\begin{matrix} \frac{\partial l}{\partial β} = 0 \Rightarrow {(JHβ - T)}^{T} JH + λ {(Hβ)}^{T} L_{a} H = 0 \end{matrix}$ （9）

进一步可得：

$\begin{matrix} β = ((J + λ {L_{a}}^{T} {) H)}^{†} JT \end{matrix}$ （10）

式（7）和式（10）中 $\begin{matrix} λ \end{matrix}$ 被置为0,意味着非标定样本数据被忽略,则 $\begin{matrix} β \end{matrix}$ 退化为式（5）的结果。

对于标定数据集： $\begin{matrix} \{(x_{i}, t_{i}) | x_{i} \in R^{n}, t_{i} \in R^{m}, i = \end{matrix} \begin{matrix} 1,2, \dots, l} \end{matrix}$ ,非标定数据集： $\begin{matrix} {x_{i} | x_{i} \in R^{n}, i = 1,2, \dots, u} \end{matrix}$ ,隐层节点数量为 $\begin{matrix} L \end{matrix}$ ,权值 $\begin{matrix} λ \end{matrix}$ , $\begin{matrix} G (a_{i}, b_{i}, x) = \end{matrix} \begin{matrix} g (b_{i} || x - a_{i} ||) \end{matrix}$ ,则整个位置估计过程可表示如下：

（1）通过随机方式来给 $\begin{matrix} a_{i} \end{matrix}$ 和 $\begin{matrix} b_{i} \end{matrix}$ 赋值,设置隐层节点数量为 $\begin{matrix} L \end{matrix}$ ;

（2）计算矩阵H( $\begin{matrix} H \end{matrix}$ 的维度为 $\begin{matrix} m \times L \end{matrix}$ ), $\begin{matrix} m = l + u \end{matrix}$ ;

（3）计算图的拉普拉斯算子 $\begin{matrix} L_{a} \end{matrix}$ , $\begin{matrix} L_{a} \end{matrix}$ 、 $\begin{matrix} D \end{matrix}$ 和 $\begin{matrix} W \end{matrix}$ 的维数均为 $\begin{matrix} m \times m \end{matrix}$ ;

（4）计算 $\begin{matrix} β \end{matrix}$ ;

（5）利用训练得到的模型 $\begin{matrix} f = Hβ \end{matrix}$ 进行预测。

在真实环境中,大批量标定数据采集非常困难,而非标定数据的获取比标定数据容易,通过无监督的众包方式,让用户在使用定位系统的过程中实时采集并上传收集到的Wi-Fi指纹数据,进而实时更新训练模型,此方法极大减少了标定数据采集的工作量,训练出来的模型具有很好的泛化能力,在标定数据十分稀疏的情况下也表现良好。

3.3 具有时效机制的增量式极速学习机模型 (TMELM)

指纹定位模型的预测精度很大程度上由离线阶段的标定数据和在线测试数据是否满足同一分布模型决定,在真实环境中,由于室内的Wi-Fi信号具有高动态性,室内建筑布置格局、空气温湿度以及人流等因素都会引起Wi-Fi信号不同程度的波动。在此情况下,依据采集数据训练出来的模型适应性会显著下降。为了解决模型分布不一致的问题,可引入增量学习框架^[20],其特点在于能将最新训练样本实时融入训练模型以达到模型适应环境变化的要求。然而传统增量学习模型框架未对训练样本的时效性加以考虑,采取同等对待方式,导致新模型的适应效果不佳。在3.2中通过半监督学习的方式充分利用了众包方式采集到非标定数据来训练更新位置指纹模型,然而用户不断提交的众包数据具有明显时效差异,不同时间段采集到的无标定数据对当前位置指纹模型的贡献度不一样。通过对新训练样本加以权重^[21]考虑,并引入了一种基于时效机制的增量式无线定位方法(Timeliness Managing Extreme Learning Machine, TMELM)^[10],可对实时加入的训练数据进行在线增量学习,同时融入时效机制来提高新训练数据对定位模型的贡献,从而保持定位模型的精度。

在增量学习框架中,对于新加入的训练数据 $\begin{matrix} X^{*} \end{matrix}$ ,一种处理方式是将其和以前的训练数据混合在一起,重新训练得到新的定位模型,另一种处理方式就是考虑新训练数据的贡献 $\begin{matrix} Δβ \end{matrix}$ ,将其与现有模型参数 $\begin{matrix} β \end{matrix}$ 相融合得到 $\begin{matrix} β^{*} \end{matrix}$ ,如式（11）所示,从而达到修正模型的效果,显然后者的计算量较前者减少很多。

$\begin{matrix} β^{*} = β + Δβ (X^{*}) \end{matrix}$ （11）

令 $\begin{matrix} K_{0} = H_{0}^{T} H_{0} \end{matrix}$ ,根据式（5）有 $\begin{matrix} β_{0} = K_{0}^{- 1} H_{0}^{T} T_{0} \end{matrix}$ , 新增加 $\begin{matrix} N_{1} \end{matrix}$ 个样本点的数据集为 $\begin{matrix} {\{(x_{i}, t_{i})\}}_{i = N + 1}^{N + N_{1}} \end{matrix}$ ,则有式（12）。

$\begin{matrix} K_{1} = {[\begin{matrix} H_{0} \\ H_{1} \end{matrix}]}^{T} [\begin{matrix} H_{0} \\ H_{1} \end{matrix}] = [H_{0}^{T} H_{1}^{T}] [\begin{matrix} H_{0} \\ H_{1} \end{matrix}] = K_{0} + H_{1}^{T} H_{1} \end{matrix}$ （12）

从而得到的新模型参数为式（13）：

$\begin{matrix} β_{1} = K_{1}^{- 1} {[\begin{matrix} H_{0} \\ H_{1} \end{matrix}]}^{T} [\begin{matrix} T_{1} \\ T_{1} \end{matrix}] \\ = K_{1}^{- 1} (K_{1} β_{0} - H_{1}^{T} H_{1} β_{0} + H_{1}^{T} T_{1}) \\ = β_{0} + K_{1}^{- 1} H_{1}^{T} (T_{1} - H_{1} β_{0}) \end{matrix}$ （13）

从式（13）可知 $\begin{matrix} β_{1} \end{matrix}$ 的计算是基于 $\begin{matrix} β_{0} \end{matrix}$ 进行的,不需要将所有原始数据混合新训练样本全部重新计算,从而大大减少了模型训练的计算量。同时式（13）中 $\begin{matrix} K_{1}^{- 1} H_{1}^{T} (T_{1} - H_{1} β_{0}) \end{matrix}$ 也体现了新加入样本对已有模型的修正。但该式（13）未考虑新增样本的时效性,根据上文分析,在真实环境中Wi-Fi信号具有高动态性,因此考虑新增数据的时效性,需建立一种时效机制,在该机制中越久远的数据对位置预测的贡献越少,越新的数据对位置预测的贡献越大,通过引入权重惩罚来实现该机制,即对式（13）做如下修改：

$\begin{matrix} β_{1} = β_{0} + ω ∙ K_{1}^{- 1} H_{1}^{T} (T_{1} - H_{1} β_{0}) \end{matrix}$ （14）

惩罚权重 $\begin{matrix} ω \end{matrix}$ 加强了新增数据对预测模型的影响程度,减少了旧数据对预测模型的贡献, $\begin{matrix} ω \end{matrix}$ 的取值通过实验获得,且在增量学习过程中保持不变。通过对 $\begin{matrix} ω \end{matrix}$ 进行牛顿迭代来使其收敛,即对式（14）进行迭代计算直到 $\begin{matrix} | β_{k + 1} - β_{k} |< ε \end{matrix}$ ( $\begin{matrix} ε \end{matrix}$ 为经验阈值)。

$\begin{matrix} β_{k + 1} = β_{k} + ω ∙ K_{k + 1}^{- 1} H_{K + 1}^{T} (T_{k + 1} - H_{k + 1} β_{k}) \end{matrix}$ （15）

根据3.2讨论,在得到通过用户进行的众包采集无标定数据后,需要将标定数据和非标定数据混合重新训练模型,本节通过引入增量学习方法,只需将非标定数据用于模型训练,从而大大提高了模型的训练速度,同时加入权重考虑,使更新后的模型具有更高的适应度,定位精度长时间保持在较高的水平。

3.4 特征自适应的在线连续增量极速学习机模型(FA-OSELM)

在3.3节中,通过引入时效机制和权重因子解决了众包采集数据因为时间段不同从而贡献度不一致,导致位置指纹模型适应度下降的问题。然而在真实环境中用户会移除部分AP,同时也有可能部署新的AP,由于不同的用户使用定位系统的时间不一致,这样在不同时间段采集到众包数据特征维数不等长,从而导致采集的训练数据、用户提交的无标定增量数据以及实时定位请求数据会出现维度不一致的问题。针对这一问题,本文进一步引入了特征自适应的在线增量极速学习机(Feature Adaptive Online Sequential Extreme Learning Machine, FA-OSELM)^[11],当真实环境中AP发生变化导致特征维度改变时,该方法能针对已训练模型的特征维度和新增样本数据特征维度不等长的情况进行自适应调整,从而有效利用新样本数据更新模型,提升了模型的适应度。

首先,给定N个具有区分度的样本 $\begin{matrix} \{(x_{i}, t_{i}) | x_{i} \in \end{matrix} \begin{matrix} R^{n}, \end{matrix} \begin{matrix} t_{i} \in R^{m}, i = 1,2, \dots, N} \end{matrix}$ , $\begin{matrix} x_{i} \end{matrix}$ 为 $\begin{matrix} n \times 1 \end{matrix}$ 向量, $\begin{matrix} x_{i} = \end{matrix} \begin{matrix} [x_{i 1}, x_{i 2}, \dots, x_{in}]^{T} \end{matrix}$ ,再给定 $\begin{matrix} N_{1} \end{matrix}$ 个同样具有区分度的样本集合 $\begin{matrix} {(x_{i}^{'}, t_{i}) | x_{i} \in R^{\overset{′}{n}}, t_{i} \in R^{m}, i = 1,2, \dots, N_{1}} \end{matrix}$ , $\begin{matrix} x_{i}^{'} \end{matrix}$ 为 $\begin{matrix} \overset{′}{n} \times 1 \end{matrix}$ 向量, $\begin{matrix} x_{i}^{'} = [x_{i 1}^{'}, x_{i 2}^{'}, \dots, x_{in'}^{'}]^{T} \end{matrix}$ ,当真实环境中AP数量发生变化时,根据式（6）均可转化为求式（16）的最小值。

$\begin{matrix} ‖[\begin{matrix} H_{0} \\ H_{1} \end{matrix}] β - [\begin{matrix} T_{0} \\ T_{1} \end{matrix}]‖ \end{matrix}$ （16）

$\begin{matrix} \{\begin{matrix} H_{0} = {[\begin{matrix} G (a_{1}, b_{1}, x_{1}) & \dots & G (a_{L}, b_{L}, x_{1}) \\ ⋮ & ⋱ & ⋮ \\ G (a_{1}, b_{1}, x_{N}) & \dots & G (a_{L}, b_{L}, x_{N}) \end{matrix}]}_{N \times L}, \\ T_{0} = {[\begin{matrix} t_{1}^{T} \\ ⋮ \\ t_{N}^{T} \end{matrix}]}_{N \times m} \end{matrix} \end{matrix}$ （17）

$\begin{matrix} \{\begin{matrix} H_{1} = {[\begin{matrix} G ({\overset{′}{a}}_{1}, {\overset{′}{b}}_{1}, x_{N + 1}^{'}) & \dots & G ({\overset{′}{a}}_{L}, b_{L}^{'}, x_{N + 1}^{'}) \\ ⋮ & ⋱ & ⋮ \\ G ({\overset{′}{a}}_{1}, {\overset{′}{b}}_{1}, x_{N + N_{1}}^{'}) & \dots & G ({\overset{′}{a}}_{L}, b_{L}^{'}, x_{N + N_{1}}^{'}) \end{matrix}]}_{N_{1} \times L}, \\ T_{1} = {[\begin{matrix} t_{N + 1}^{T} \\ ⋮ \\ t_{N + N_{1}}^{T} \end{matrix}]}_{N_{1} \times m} \end{matrix} \end{matrix}$ （18）

其中：

$\begin{matrix} {\{a_{i} = \{a_{1}, a_{2}, \dots a_{n}\}\}}_{i = 1}^{L}, {\{x_{i} = \{x_{1}, x_{2}, \dots x_{n}\}\}}_{i = 1}^{N} \end{matrix}$ （19）

$\begin{matrix} {\{{\overset{′}{a}}_{i} = \{{\overset{′}{a}}_{1}, {\overset{′}{a}}_{２}, \dots a_{\overset{′}{n}}^{'}\}\}}_{i = 1}^{L}, {\{{\overset{′}{x}}_{i} = \{{\overset{′}{x}}_{1}, {\overset{′}{x}}_{2}, \dots x_{\overset{′}{n}}^{'}\}\}}_{i = 1}^{N_{1}} \end{matrix}$ （20）

根据式（17）和式（18）, $\begin{matrix} a_{i} \end{matrix}$ 和 $\begin{matrix} x_{i} \end{matrix}$ 维度相同, $\begin{matrix} {\overset{′}{a}}_{i}^{} \end{matrix}$ 和 $\begin{matrix} {\overset{′}{x}}_{i} \end{matrix}$ 维度相同,而 $\begin{matrix} x_{i} \end{matrix}$ 和 $\begin{matrix} {\overset{′}{x}}_{i} \end{matrix}$ 维度不一致,因此,当特征维度发生改变时仅需要对 $\begin{matrix} a_{i} \end{matrix}$ 进行调整。本文引入输入权重转换矩阵 $\begin{matrix} P \end{matrix}$ 和输入权重增补向量 $\begin{matrix} Q_{i} \end{matrix}$ ,然后根据式（21）来生成 $\begin{matrix} {\overset{′}{a}}_{i}^{} \end{matrix}$ 。

$\begin{matrix} {\{a_{i}^{'} = a_{i} ∙ P + Q_{i}\}}_{i = 1}^{L} \end{matrix}$ （21）

其中,

$\begin{matrix} P = {[\begin{matrix} P_{11} & \dots & P_{1 \overset{′}{n}} \\ ⋮ & ⋱ & ⋮ \\ P_{n 1} & \dots & P_{n \overset{′}{n}} \end{matrix}]}_{n \times \overset{′}{n}} \end{matrix}$ （22）

$\begin{matrix} Q_{i} = {[Q_{1}, \dots, Q_{1 \overset{′}{n}}]}_{1 \times \overset{′}{n}} \end{matrix}$ （23）

矩阵 $\begin{matrix} P \end{matrix}$ 的规则如下：

（1）每行只有一维为1,其余维为0;

（2）每列最多有一维为1,其余维为0;

（3） $\begin{matrix} P_{ij} = 1 \end{matrix}$ 表示第 $\begin{matrix} i \end{matrix}$ 维特征转换成了第 $\begin{matrix} j \end{matrix}$ 维特征。

向量 $\begin{matrix} Q_{i} \end{matrix}$ 的规则如下：

（1）当特征维度减少时, $\begin{matrix} Q_{i} \end{matrix}$ 是一个全0向量;

（2）当特征维度增加时,如果 $\begin{matrix} {\overset{′}{a}}_{i}^{} \end{matrix}$ 是新的特征,则根据 $\begin{matrix} a_{i} \end{matrix}$ 的分布随机生成 $\begin{matrix} Q_{i} \end{matrix}$ 的第 $\begin{matrix} i \end{matrix}$ 维。

在新样本数据的特征维度发生变化时,无法根据式（15）来计算 $\begin{matrix} β^{*} \end{matrix}$ ,所以需要利用FA-OSELM方法重新生成 $\begin{matrix} {\overset{′}{a}}_{i}^{} \end{matrix}$ ,然后利用调整后的模型参数来迭代生成新的 $\begin{matrix} β^{*} \end{matrix}$ ,最后用更新后的模型做位置估计,从而在任意特征维度下本文模型都能自适应地进行调整,实现在线连续增量学习。

4 基于众包数据的室内定位平台

基于本文提出的室内定位方法,解决了基于位置指纹模型的室内定位方法在更新定位模型时遇到的3个难点问题,并实现了基于众包数据的室内定位平台,系统整体架构如图3所示。

显示原图| 下载原图ZIP| 生成PPT

图3 基于众包数据的室内定位平台架构图

Fig. 3 Whole architecture of localization platform

4.1 数据层

4.1.1 室内地图

在基础数据采集阶段和实际定位阶段,都需要用到室内地图。借助绘制准确的室内地图,基础采集阶段能对采集区域进行准确的位置映射标定,实际定位阶段能将定位结果直观、实时地呈现给用户。

4.1.2 指纹数据

指纹数据用于训练指纹定位模型,通过基础采集和众包采集2种数据采集方式,能得到大量训练样本,从而进行模型训练和更新。

（1）基础采集

对于位置指纹算法,初始指纹模型必须通过监督学习的方式得到,即初始训练时应采用有标定样本数据,而基础采集则用于得到相应的标定训练样本。在本定位平台中,基础采集部分由专门的数据采集客户端实现,对于从未采集过位置指纹数据的室内环境,通过制作相应的环境地图并结合数据采集客户端采集足够多的标定样本数据,训练出一个初始的位置指纹模型并应用于定位计算。

（2）众包采集

基础采集只用做训练初始定位模型,而在后续模型更新过程中,如果每次都通过人工的方式大批量采集标定样本数据,其工作量巨大,而众包采集则较好地解决了该问题。客户端在实时定位的过程中会自动进行无标定指纹数据采集,同时每一次位置估计都采集了相应的无标定样本,而用户使用定位平台的次数越多,时间越长,采集到的无标定样本数据也就越丰富。通过众包采集的方式能够得到海量的非标定样本,充分利用这些无标定样本来实时增量更新定位模型,一方面减少了人工采集标定数据的工作量,另一方面也能使指纹模型的定位精度长时间内维持在较高水平。

4.2 接口层

接口层提供定位引擎调用SDK,该SDK实现了定位核心算法,并封装为应用接口以供第三方应用程序调用。SDK的整体架构包含接口模块、数据加载模块和定位计算模块3个模块（图4）。

显示原图| 下载原图ZIP| 生成PPT

图4 SDK整体架构图

Fig. 4 Whole architecture of SDK

（1）接口模块

接口模块是对SDK整体框架的抽象封装,通过暴露简单的调用接口并结合回调机制来实现SDK底层和应用之间的数据通信和交互,第三方应用无需了解SDK的内部实现原理,便可轻松在应用中集成室内定位功能。

（2）数据加载模块

在执行定位计算前,SDK需将指纹数据加载至系统内存,然后才能进行位置计算。由于无法一次性将所有指纹数据加载到内存,在定位SDK中实现了基于新鲜度的缓存更新机制。通过该机制,SDK能减少从外部存储读取指纹的次数,有效地规避了IO瓶颈,提升了SDK的性能。

（3）定位计算模块

SDK加载完指纹数据后先进行AP扫描来采集定位数据,然后调用核心定位算法进行位置计算,最后将定位结果以接口回调的方式返回给应用,最终应用将定位结果以地图坐标的形式呈献给用户,同时SDK根据当前定位结果调用缓存更新算法进行异步指纹更新,保证当前缓存中指纹数据的有效性。

4.3 应用层

在定位SDK提供的定位和导航服务的基础上,根据实际的应用场景以及用户需求,能够实现不同的应用。如在陌生环境中,通过定位平台提供的实时导航功能,用户能快速到达目的地;商超可通过定位平台对目标用户推送促销活动信息,进行精准位置营销;通过在小区、智慧楼宇搭建定位平台,能极大提高社区的安全;基于位置的定位服务结合儿童手表打造智能防丢器,有效地保障了儿童在户外的安全性等。由此可见,基于众包数据的定位平台在步行导航、位置营销、安防监控和近场社交等领域均有丰富的应用场景。

5 定位平台系统应用案例

5.1 智能导购系统或者智能购物导航系统

大型商场的商家为了吸引消费者,会不定期推出各种优惠活动进行促销,但是在购物O-2-O闭环系统形成前,商家无法有效地把自身的促销信息及时推送给消费者。对消费者而言,他们在购物时无法有效地获知他们所在位置的周围商铺提供商品信息,无法获知是否有商家正在进行商品促销。对商场而言,他们希望对进入商场的消费者进行消费行为分析,以便于他们对商场进行改进,吸引更多商家入驻,吸引更多消费者进行消费。通过将自身的定位平台与大型商场、超市进行结合,实现了完整的购物导航系统（图5）,实现了O-2-O闭环的关键一步。用户只需要安装集成了定位导航功能的商场APP,进入商场后能实时收到周围商铺的推送信息,即使陌生的环境,他们也能通过APP的导航和路径规划顺利到达目的商铺。另外,用户可以在任何时间地点打开对应商场的APP,挑选喜爱的商品,然后直接到商场购买;同时,APP能记录消费者的常去的店铺,记录消费习惯,在积累一定数据后,商场便能从数据中挖掘潜在的商业价值。

显示原图| 下载原图ZIP| 生成PPT

图5 购物闭环系统应用APP

Fig. 5 The application of shopping closed-loop system

5.2 博览会导航

与商场、超市的导航需求不同,博览会导航是一种偶发性、短时性的应用场景。随着智慧城市的不断推进,相关技术与应用如雨后春笋层出不穷,博览会作为科技产业化的展示平台,承担着技术普及和推广的重大责任,如何及时有效地向客户推荐展品信息成为博览会的首要需求。当身处产品博览会时,由于对环境缺乏了解,面对着宽阔的展厅和数量繁多的展品,迫切希望短时间内获取丰富信息的客户往往会无所适从。利用定位平台的技术特点结合博览会需求,打造了面向展会的导航应用,如图6所示。该应用能提供展会的整体位置示意图,详细标识所有参展产品的位置,当用户对某一产品感兴趣时,只需通过简单操作,应用便能规划出当前位置到目标点的路径,指引用户到达。同时在用户走近某一展品区域时,应用能自动推送该产品的相关信息,既方便又快捷,真正实现了智慧博览。

5.3 智能停车服务或者辅助找车服务

另一个基于位置服务的典型需求是停车导航。大型停车场找车一直是一个难题,多个大型商场的地下停车场动辄上千个车位,如果用户不熟悉停车场的布局而停车场又缺乏简单明了的指示牌时,车主在找车上往往要花上很长时间。为了帮车主解决该难题,推出了停车场智能停车,如图7所示。车主只需要在停车时打开定位APP,记录当前位置,然后在用户想取车时,在任意位置再次打开APP,选择找车,APP便能根据当前位置和汽车所在位置自动规划路径,引导车主快速找到自己的汽车,为车主节省了大量找车时间。

显示原图| 下载原图ZIP| 生成PPT

图6 智慧展览应用

Fig. 6 The application of wisdom exhibition

显示原图| 下载原图ZIP| 生成PPT

图7 智慧停车应用

Fig. 7 The application of smart parking

6 结束语

针对基于指纹模型的Wi-Fi室内定位方法存在的3个核心问题：数据增量问题、增量数据的时效问题和数据特征维度不等长问题,本文分别引入了SELM.TMELM.FA-OSELM方法,从3个层面进行解决,有效提升了定位精度和稳定性,同时实现了基于众包数据的室内定位平台,将Wi-Fi定位与实际应用相结合,取得良好效果。下一步的工作将侧重Wi-Fi定位的更多具体应用,增强定位平台的稳定性,提升用户体验。

The authors have declared that no competing interests exist.

参考文献

原文顺序

文献年度倒序

文中引用次数倒序

被引期刊影响因子

[1]	Torteeka P, Chundi X I U. Indoor positioning based on wi-fi fingerprint technique using fuzzy k-nearest neighbor [C]. Applied Sciences and Technology (IBCAST), 2014 11th International Bhurban Conference on. IEEE, 2014:461-465. [本文引用: 1]
[2]	Youssef M, Agrawala A. The Horus WLAN location determination system [C]. Proceedings of the 3^rd international conference on Mobile systems, applications, and services. ACM, 2005:205-218. [本文引用: 1]
[3]	Sjöberg M, Koskela M, Viitaniemi V, et al. Indoor location recognition using fusion of SVM-based visual classifiers [C]. Machine Learning for Signal Processing (MLSP), 2010 IEEE International Workshop on. IEEE, 2010:343-348. [本文引用: 1]
[4]	Wu K, Xiao J, Yi Y, et al. Fila: Fine-grained indoor localization [C]. INFOCOM, 2012 Proceedings IEEE. IEEE, 2012:2210-2218. [本文引用: 1]
[5]	Sen S, Radunovic B, Choudhury R R, et al. You are facing the Mona Lisa: spot localization using PHY layer information [C]. Proceedings of the 10^th international conference on Mobile systems, applications, and services. ACM, 2012:183-196. [本文引用: 1]
[6]	Fang S H, Lin T N. Principal component localization in indoor WLAN environments [J]. Mobile Computing, IEEE Transactions on, 2012,11(1):100-110. https://doi.org/10.1109/TMC.2011.30 URL [本文引用: 1] 摘要 This paper presents a novel approach to building a WLAN-based location fingerprinting system. Our algorithm intelligently transforms received signal strength (RSS) into principal components (PCs) such that the information of all access points (APs) is more efficiently utilized. Instead of selecting APs, the proposed technique replaces the elements with a subset of PCs to simultaneously improve the accuracy and reduce the online computation. Our experiments are conducted in a realistic WLAN environment. The results show that the mean error is reduced by 33.75 percent, and the complexity by 40 percent, as compared to the existing methods. Moreover, several benefits of our algorithm are demonstrated, such as requiring fewer training samples and enhancing the robustness to RSS anomalies.
[7]	Zheng V W, Xiang E W, Yang Q, et al. Transferring Localization Models over Time [C]. AAAI. 2008:1421-1426. [本文引用: 1]
[8]	Sun Z, Chen Y Q, Qi J, et al. Adaptive localization through transfer learning in indoor wi-fi environment [C]. Machine Learning and Applications, 2008. ICMLA'08. Seventh International Conference on. IEEE, 2008:331-336. [本文引用: 1]
[9]	Liu J F, Chen Y Q, Liu M J, et al. SELM: semi-supervisedELM with application in sparse calibrated location estimation [J]. Neurocomputing, 2011,74(16):2566-2572. https://doi.org/10.1016/j.neucom.2010.12.043 URL [本文引用: 2] 摘要 Indoor location estimation based on Wi-Fi has attracted more and more attention from both research and industry fields. It brings two significant challenges. One is requiring a vast amount of labeled calibration data. The other is real-time training and testing for location estimation task. Traditional machine learning methods cannot get high performance in both aspects. This paper proposed a novel semi-supervised learning method SELM (semi-supervised extreme learning machine) and applied it to sparse calibrated location estimation. There are two advantages of the proposed SELM. First, it employs graph Laplacian regularization to import large number of unlabeled samples which can dramatically reduce labeled calibration samples. Second, it inherits the good property of ELM on extreme training and testing speed. Comparative experiments show that with same number of labeled samples, our method outperforms original ELM and back propagation (BP) network, especially in the case that the calibration data is very sparse.
[10]	刘军发,谷洋,陈益强,等. 具有时效机制的增量式无线定位方法 [J].计算机学报,2013,36(7):1448-1455. https://doi.org/10.3724/SP.J.1016.2013.01448 URL [本文引用: 2] 摘要基于WLAN（Wireless Local Area Networks）的无线定位是移动互联网领域的重要研究内容之一.其中,指纹定位方法已成为主流,此类方法的特点之一在于需要离线训练数据与在线测试数据具有严格的一致性.但在实际环境中,无线信号数据波动较大,存在显著的时效性问题.这导致一定时间后,定位模型的预测精度不断下降.文中提出一种具有时效机制的增量式定位方法（Timeliness Managing Extreme Learning Machine,TMELM）,一方面满足实际系统的应用需求,可随时加入新的训练数据进行在线增量式学习,另一方面融入时效机制,以最大化新增训练数据对定位模型的贡献,保持定位模型的精度.实验表明,在实际WLAN定位数据集上,文中方法相比于传统的几种增量式学习方法,具有明显的时效优势,能获得更好的定位精度. [ Liu J F, Gu Y, Chen Y Q, et al. Incremental Localization in WLAN Environment with Timeliness Management [J]. Chinese Journal of Computers, 2013,36(7):1448-1455. ] https://doi.org/10.3724/SP.J.1016.2013.01448 URL [本文引用: 2] 摘要基于WLAN（Wireless Local Area Networks）的无线定位是移动互联网领域的重要研究内容之一.其中,指纹定位方法已成为主流,此类方法的特点之一在于需要离线训练数据与在线测试数据具有严格的一致性.但在实际环境中,无线信号数据波动较大,存在显著的时效性问题.这导致一定时间后,定位模型的预测精度不断下降.文中提出一种具有时效机制的增量式定位方法（Timeliness Managing Extreme Learning Machine,TMELM）,一方面满足实际系统的应用需求,可随时加入新的训练数据进行在线增量式学习,另一方面融入时效机制,以最大化新增训练数据对定位模型的贡献,保持定位模型的精度.实验表明,在实际WLAN定位数据集上,文中方法相比于传统的几种增量式学习方法,具有明显的时效优势,能获得更好的定位精度.
[11]	Jiang X L, Liu J F, Chen Y Q, et al. Feature adaptive online sequential extreme learning machine for lifelong indoor localization [J]. Neural Computing and Applications,2016,27(1):215-225. URL [本文引用: 2]
[12]	Huang G B, Zhu Q Y, Siew C K. Extreme learning machine: a new learning scheme of feedforward neural networks [C]. Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on. IEEE, 2004,2:985-990. [本文引用: 1]
[13]	Huang G B, Zhu Q Y, Siew C K. Extreme learning machine: theory and applications [J]. Neurocomputing, 2006,70(1):489-501. https://doi.org/10.1016/j.neucom.2005.12.126 URL 摘要 It is clear that the learning speed of feedforward neural networks is in general far slower than required and it has been a major bottleneck in their applications for past decades. Two key reasons behind may be: (1) the slow gradient-based learning algorithms are extensively used to train neural networks, and (2) all the parameters of the networks are tuned iteratively by using such learning algorithms. Unlike these conventional implementations, this paper proposes a new learning algorithm called e xtreme l earning m achine (ELM) for s ingle-hidden l ayer f eedforward neural n etworks (SLFNs) which randomly chooses hidden nodes and analytically determines the output weights of SLFNs. In theory, this algorithm tends to provide good generalization performance at extremely fast learning speed. The experimental results based on a few artificial and real benchmark function approximation and classification problems including very large complex applications show that the new algorithm can produce good generalization performance in most cases and can learn thousands of times faster than conventional popular learning algorithms for feedforward neural networks. 1
[14]	Huang G B, Chen L, Siew C K. Universal approximation using incremental constructive feedforward networks with random hidden nodes [J]. Neural Networks, IEEE Transactions on, 2006,17(4):879-892. https://doi.org/10.1109/TNN.2006.875977 URL PMID: 16856652 [本文引用: 1] 摘要 According to conventional neural network theories, single-hidden-layer feedforward networks (SLFNs) with additive or radial basis function (RBF) hidden nodes are universal approximators when all the parameters of the networks are allowed adjustable. However, as observed in most neural network implementations, tuning all the parameters of the networks may cause learning complicated and inefficient, and it may be difficult to train networks with nondifferential activation functions such as threshold networks. Unlike conventional neural network theories, this paper proves in an incremental constructive method that in order to let SLFNs work as universal approximators, one may simply randomly choose hidden nodes and then only need to adjust the output weights linking the hidden layer and the output layer. In such SLFNs implementations, the activation functions for additive nodes can be any bounded nonconstant piecewise continuous functions g : R --> R and the activation functions for RBF nodes can be any integrable piecewise continuous functions g : R --> R and integral of R g(x)dx not equal to 0. The proposed incremental method is efficient not only for SFLNs with continuous (including nondifferentiable) activation functions but also for SLFNs with piecewise continuous (such as threshold) activation functions. Compared to other popular methods such a new network is fully automatic and users need not intervene the learning process by manually tuning control parameters.
[15]	Rao C R, Mitra S K. Generalized inverse of matrices and its applications [M]. New York: Wiley, 1971. [本文引用: 1]
[16]	Serre D, Matrices: Theory and Applications [M]. New York: Springer, 2002. [本文引用: 1]
[17]	Chung, F R. Spectral graph theory [C]. Vol. 92. American Mathematical Soc, 1997. [本文引用: 1]
[18]	Belkin M, Matveeva I, Niyogi P. Regularization and semi-supervised learning on large graphs [C]. Proceedings of 17^th Conference on Learning Theory(COLT), 2004,3120:624-638.
[19]	Belkin M, Niyogi P, Sindhwani V. Manifold regularization: A geometric framework for learning from labeled and unlabeled examples [J]. Journal of machine learning research, 2006,7(11):2399-434. https://doi.org/10.1007/s10846-006-9077-x URL 摘要 We propose a family of learning algorithms based on a new form of regularization that allows us to exploit the geometry of the marginal distribution. We focus on a semi-supervised framework that incorporates labeled and unlabeled data in a general-purpose learner. Some transductive graph learning algorithms and standard methods including support vector machines and regularized least squares can be obtained as special cases. We use properties of reproducing kernel Hilbert spaces to prove new Representer theorems that provide theoretical basis for the algorithms. As a result (in contrast to purely graph-based approaches) we obtain a natural out-of-sample extension to novel examples and so are able to handle both transductive and truly semi-supervised settings. We present experimental evidence suggesting that our semi-supervised algorithms are able to use unlabeled data effectively. Finally we have a brief discussion of unsupervised and fully supervised learning within our general framework.
[20]	Reddi S J, Sra S, Poczos B, et al. Fast Incremental Method for nonconvex optimization [J]. arxiv preprint arxiv:2016:1603.06159. URL [本文引用: 1] 摘要 We analyze a fast incremental aggregated gradient method for optimizing nonconvex problems of the form $\min_x \sum_i f_i(x)$. Specifically, we analyze the SAGA algorithm within an Incremental First-order Oracle framework, and show that it converges to a stationary point provably faster than both gradient descent and stochastic gradient descent. We also discuss a Polyak's special class of nonconvex problems for which SAGA converges at a linear rate to the global optimum. Finally, we analyze the practically valuable regularized and minibatch variants of SAGA. To our knowledge, this paper presents the first analysis of fast convergence for an incremental aggregated gradient method for nonconvex problems.
[21]	Ma R, Guo Q, Hu C, et al. An improved wi-fi indoor positioning algorithm by weighted fusion [J]. Sensors, 2015,15(9):1824-21843. https://doi.org/10.3390/s150921824 URL PMID: 26334278 [本文引用: 1] 摘要 The rapid development of mobile Internet has offered the opportunity for WiFi indoor positioning to come under the spotlight due to its low cost. However, nowadays the accuracy of WiFi indoor positioning cannot meet the demands of practical applications. To solve this problem, this paper proposes an improved WiFi indoor positioning algorithm by weighted fusion. The proposed algorithm is based on traditional location fingerprinting algorithms and consists of two stages: the offline acquisition and the online positioning. The offline acquisition process selects optimal parameters to complete the signal acquisition, and it forms a database of fingerprints by error classification and handling. To further improve the accuracy of positioning, the online positioning process first uses a pre-match method to select the candidate fingerprints to shorten the positioning time. After that, it uses the improved Euclidean distance and the improved joint probability to calculate two intermediate results, and further calculates the final result from these two intermediate results by weighted fusion. The improved Euclidean distance introduces the standard deviation of WiFi signal strength to smooth the WiFi signal fluctuation and the improved joint probability introduces the logarithmic calculation to reduce the difference between probability values. Comparing the proposed algorithm, the Euclidean distance based WKNN algorithm and the joint probability algorithm, the experimental results indicate that the proposed algorithm has higher positioning accuracy.

Chundi X I U. Indoor positioning based on wi-fi fingerprint technique using fuzzy k-nearest neighbor

2014

... 目前,基于Wi-Fi的定位方法主要包含以下4类：基于信号到达时间TOA(Time of Arrival)的定位方法;基于信号到达时间差TDOA(Time difference of Arrival)的定位方法;基于信号到达角度AOA(Angle of Arrive)的定位方法和基于信号强度RSS(Received Signal Strength)的定位方法.由于室内环境复杂,Wi-Fi信号易受障碍物反射,存在严重的多径效应,基于AOA的方法定位精度较低.此外,由于时间精准控制在近距离情况下难以实现,基于TOA和TDOA等定位方法误差过大,实用性差.基于RSS的定位方法因为信号检测装置简单、信号稳定性好以及定位方法简易等特点而受到广泛关注.基于RSS的Wi-Fi室内定位方法又称为位置指纹算法^[1],该方法包括离线训练阶段和在线预测阶段,如图1所示. ...

The Horus WLAN location determination system

2005

... 离线阶段的任务就是大量采集定位区域中各个参考点(Reference Point, RP)的标定指纹数据,然后利用机器学习方法挖掘Wi-Fi信号向量

\begin{matrix} X \end{matrix}

与真实物理位置

\begin{matrix} Y \end{matrix}

之间的映射关系

\begin{matrix} Y = f (X) \end{matrix}

;最终在线阶段针对输入的Wi-Fi信号向量

\begin{matrix} X^{*} \end{matrix}

利用模型估计出当前位置

\begin{matrix} Y^{*} \end{matrix}

.已有的研究通过不同的方法对模型精度进行提升,马里兰大学的HORUS^[2]定位系统通过建立信号空间并引入概率模型,同时采用联合分簇法和三角测量2种方式进行定位.Sjoberg等^[3]利用SVM结合计算机视觉技术取得了较好的定位效果.Wu等^[4]首次引入信道状态信息(Channel State Information, CSI),通过提取Wi-Fi信号中的稳定信道作为特征来进行定位,取得了良好的效果.Sen等^[5]也通过细粒度的CSI特征实现了波长级别的定位效果,同时降低了系统功耗.在高动态信号的优化方面,Fang等^[6]通过主成份分析(Principal Component Analysis, PCA)来提取Wi-Fi信号中最主要的成分作为特征训练定位模型,最终使平均定位误差降低了33.75%.Zheng等^[7]通过迁移学习来处理静态Radio Map在不同时间段的适用性,提出基于迁移的隐马尔科夫模型来动态调整Radio Map.Sun等^[8]将源数据和部分标定数据进行结合,并通过流形对齐(Manifold Alignment)来处理时间和设备的差异,从而保证了定位的精度. ...

et al. Indoor location recognition using fusion of SVM-based visual classifiers

2010

... 离线阶段的任务就是大量采集定位区域中各个参考点(Reference Point, RP)的标定指纹数据,然后利用机器学习方法挖掘Wi-Fi信号向量

\begin{matrix} X \end{matrix}

与真实物理位置

\begin{matrix} Y \end{matrix}

之间的映射关系

\begin{matrix} Y = f (X) \end{matrix}

;最终在线阶段针对输入的Wi-Fi信号向量

\begin{matrix} X^{*} \end{matrix}

利用模型估计出当前位置

\begin{matrix} Y^{*} \end{matrix}

et al. Fila: Fine-grained indoor localization

2012

... 离线阶段的任务就是大量采集定位区域中各个参考点(Reference Point, RP)的标定指纹数据,然后利用机器学习方法挖掘Wi-Fi信号向量

\begin{matrix} X \end{matrix}

与真实物理位置

\begin{matrix} Y \end{matrix}

之间的映射关系

\begin{matrix} Y = f (X) \end{matrix}

;最终在线阶段针对输入的Wi-Fi信号向量

\begin{matrix} X^{*} \end{matrix}

利用模型估计出当前位置

\begin{matrix} Y^{*} \end{matrix}

et al. You are facing the Mona Lisa: spot localization using PHY layer information

2012

... 离线阶段的任务就是大量采集定位区域中各个参考点(Reference Point, RP)的标定指纹数据,然后利用机器学习方法挖掘Wi-Fi信号向量

\begin{matrix} X \end{matrix}

与真实物理位置

\begin{matrix} Y \end{matrix}

之间的映射关系

\begin{matrix} Y = f (X) \end{matrix}

;最终在线阶段针对输入的Wi-Fi信号向量

\begin{matrix} X^{*} \end{matrix}

利用模型估计出当前位置

\begin{matrix} Y^{*} \end{matrix}

Principal component localization in indoor WLAN environments

2012

... 离线阶段的任务就是大量采集定位区域中各个参考点(Reference Point, RP)的标定指纹数据,然后利用机器学习方法挖掘Wi-Fi信号向量

\begin{matrix} X \end{matrix}

与真实物理位置

\begin{matrix} Y \end{matrix}

之间的映射关系

\begin{matrix} Y = f (X) \end{matrix}

;最终在线阶段针对输入的Wi-Fi信号向量

\begin{matrix} X^{*} \end{matrix}

利用模型估计出当前位置

\begin{matrix} Y^{*} \end{matrix}

et al. Transferring Localization Models over Time

2008

... 离线阶段的任务就是大量采集定位区域中各个参考点(Reference Point, RP)的标定指纹数据,然后利用机器学习方法挖掘Wi-Fi信号向量

\begin{matrix} X \end{matrix}

与真实物理位置

\begin{matrix} Y \end{matrix}

之间的映射关系

\begin{matrix} Y = f (X) \end{matrix}

;最终在线阶段针对输入的Wi-Fi信号向量

\begin{matrix} X^{*} \end{matrix}

利用模型估计出当前位置

\begin{matrix} Y^{*} \end{matrix}

et al. Adaptive localization through transfer learning in indoor wi-fi environment

2008

... 离线阶段的任务就是大量采集定位区域中各个参考点(Reference Point, RP)的标定指纹数据,然后利用机器学习方法挖掘Wi-Fi信号向量

\begin{matrix} X \end{matrix}

与真实物理位置

\begin{matrix} Y \end{matrix}

之间的映射关系

\begin{matrix} Y = f (X) \end{matrix}

;最终在线阶段针对输入的Wi-Fi信号向量

\begin{matrix} X^{*} \end{matrix}

利用模型估计出当前位置

\begin{matrix} Y^{*} \end{matrix}

et al. SELM: semi-supervisedELM with application in sparse calibrated location estimation

2011

... 本文所引入的半监督极速学习机(SELM)^[9]、具有时效机制的增量式定位方法(TMELM)^[10]和特征自适应的在线极速学习机(FA-OSELM)^[11]均基于极速学习机(Extreme Learning Machine, ELM),本节首先对ELM模型进行简要介绍,接着对SELM、TMELM、FA-OSELM算法模型进行阐述. ...

... 为了保证位置指纹定位算法预测精确度,需要采集大量的标定样本进行模型训练,当样本数据不够时,模型在定位精度上无法满足需求,而人工采集大批量的标定数据工作量巨大.故本文提出众包的采集思路,即用户在使用定位系统的过程中会进行指纹数据采集,通过获取该数据来作为模型更新时的增量训练数据,但这种方式采集到的训练数据为无标定数据.针对该问题,通过引入半监督极速学习机(Semi-supervised ELM, SELM)^[9]进行解决.使用SELM能充分利用非标定数据来训练模型,极大地减少标定数据采集的工作量.为了使SELM相比ELM能取得更好的位置预测效果,同时具有很好的泛化能力,根据结构风险最小化理论,模型需要在经验风险和学习函数

\begin{matrix} f \end{matrix}

的复杂度之间取得平衡.本文通过引入图的拉普拉斯(Laplacian)算子^[17]来达到预期目标.在文献[18]中,使用图的平滑度函数

\begin{matrix} S (f) = \end{matrix} \begin{matrix} \sum^{} W_{ij} (f_{i} - f_{j})^{2} = f^{T} L_{a} f \end{matrix}

来表示模型复杂性,其中

\begin{matrix} L_{a} \end{matrix}

代表图的拉普拉斯算子.文献[19]已经讨论,

\begin{matrix} L_{a} = D - W \end{matrix}

,其中

\begin{matrix} D \end{matrix}

表示一个对角矩阵,由

\begin{matrix} D_{ii} = \end{matrix} \begin{matrix} \sum_{j}^{=} W_{ij} \end{matrix}

给出,

\begin{matrix} W_{ij} = e^{\frac{- | x_{i} - x_{j} |^{2}}{2 δ^{2}}} \end{matrix}

.由此,式（6）可通过最小化均方损失函数加上平滑度惩罚而转化为式（7）. ...

具有时效机制的增量式无线定位方法

2013

... 指纹定位模型的预测精度很大程度上由离线阶段的标定数据和在线测试数据是否满足同一分布模型决定,在真实环境中,由于室内的Wi-Fi信号具有高动态性,室内建筑布置格局、空气温湿度以及人流等因素都会引起Wi-Fi信号不同程度的波动.在此情况下,依据采集数据训练出来的模型适应性会显著下降.为了解决模型分布不一致的问题,可引入增量学习框架^[20],其特点在于能将最新训练样本实时融入训练模型以达到模型适应环境变化的要求.然而传统增量学习模型框架未对训练样本的时效性加以考虑,采取同等对待方式,导致新模型的适应效果不佳.在3.2中通过半监督学习的方式充分利用了众包方式采集到非标定数据来训练更新位置指纹模型,然而用户不断提交的众包数据具有明显时效差异,不同时间段采集到的无标定数据对当前位置指纹模型的贡献度不一样.通过对新训练样本加以权重^[21]考虑,并引入了一种基于时效机制的增量式无线定位方法(Timeliness Managing Extreme Learning Machine, TMELM)^[10],可对实时加入的训练数据进行在线增量学习,同时融入时效机制来提高新训练数据对定位模型的贡献,从而保持定位模型的精度. ...

具有时效机制的增量式无线定位方法

2013

et al. Feature adaptive online sequential extreme learning machine for lifelong indoor localization

2016

... 在3.3节中,通过引入时效机制和权重因子解决了众包采集数据因为时间段不同从而贡献度不一致,导致位置指纹模型适应度下降的问题.然而在真实环境中用户会移除部分AP,同时也有可能部署新的AP,由于不同的用户使用定位系统的时间不一致,这样在不同时间段采集到众包数据特征维数不等长,从而导致采集的训练数据、用户提交的无标定增量数据以及实时定位请求数据会出现维度不一致的问题.针对这一问题,本文进一步引入了特征自适应的在线增量极速学习机(Feature Adaptive Online Sequential Extreme Learning Machine, FA-OSELM)^[11],当真实环境中AP发生变化导致特征维度改变时,该方法能针对已训练模型的特征维度和新增样本数据特征维度不等长的情况进行自适应调整,从而有效利用新样本数据更新模型,提升了模型的适应度. ...

Extreme learning machine: a new learning scheme of feedforward neural networks

2004

... ELM(Extreme Learning Machine)^[12-14]属于人工神经网络范畴,是一种单隐层前馈神经网络(Single Hidden Layer Feedforward Neural Network, SLFN).,该神经网络训练时间短,网络结构较为简洁（图2）.对于一个输入向量

\begin{matrix} x \in R^{n} \end{matrix}

和具有

\begin{matrix} L \end{matrix}

个隐层节点的SLFN,其输出形式定义如式（1）所示. ...

Extreme learning machine: theory and applications

2006

Universal approximation using incremental constructive feedforward networks with random hidden nodes

2006

\begin{matrix} x \in R^{n} \end{matrix}

和具有

\begin{matrix} L \end{matrix}

个隐层节点的SLFN,其输出形式定义如式（1）所示. ...

Generalized inverse of matrices and its applications

1971

... 根据文献[12]-[14],参数

\begin{matrix} a_{i} \end{matrix}

和

\begin{matrix} b_{i} \end{matrix}

无需在训练过程中进行调整,只需在初始阶段随机指定即可.因此

\begin{matrix} H \end{matrix}

与

\begin{matrix} T \end{matrix}

均为已知,

\begin{matrix} H^{†} \end{matrix}

为

\begin{matrix} H \end{matrix}

的伪逆矩阵^[15-16],则

\begin{matrix} β \end{matrix}

的求解公式如式（5）所示. ...

Matrices: Theory and Applications

2002

... 根据文献[12]-[14],参数

\begin{matrix} a_{i} \end{matrix}

和

\begin{matrix} b_{i} \end{matrix}

无需在训练过程中进行调整,只需在初始阶段随机指定即可.因此

\begin{matrix} H \end{matrix}

与

\begin{matrix} T \end{matrix}

均为已知,

\begin{matrix} H^{†} \end{matrix}

为

\begin{matrix} H \end{matrix}

的伪逆矩阵^[15-16],则

\begin{matrix} β \end{matrix}

的求解公式如式（5）所示. ...

Spectral graph theory

1997

\begin{matrix} f \end{matrix}

的复杂度之间取得平衡.本文通过引入图的拉普拉斯(Laplacian)算子^[17]来达到预期目标.在文献[18]中,使用图的平滑度函数

\begin{matrix} S (f) = \end{matrix} \begin{matrix} \sum^{} W_{ij} (f_{i} - f_{j})^{2} = f^{T} L_{a} f \end{matrix}

来表示模型复杂性,其中

\begin{matrix} L_{a} \end{matrix}

代表图的拉普拉斯算子.文献[19]已经讨论,

\begin{matrix} L_{a} = D - W \end{matrix}

,其中

\begin{matrix} D \end{matrix}

表示一个对角矩阵,由

\begin{matrix} D_{ii} = \end{matrix} \begin{matrix} \sum_{j}^{=} W_{ij} \end{matrix}

给出,

\begin{matrix} W_{ij} = e^{\frac{- | x_{i} - x_{j} |^{2}}{2 δ^{2}}} \end{matrix}

.由此,式（6）可通过最小化均方损失函数加上平滑度惩罚而转化为式（7）. ...

Regularization and semi-supervised learning on large graphs

2004

Manifold regularization: A geometric framework for learning from labeled and unlabeled examples

2006

et al. Fast Incremental Method for nonconvex optimization

2016

et al. An improved wi-fi indoor positioning algorithm by weighted fusion

2015

〈

〉

基于众包数据的室内定位方法和平台

Indoor Localization Method and Platform Based on Crowdsourcing Data

1 引言

2 基于Wi-Fi的定位方法

3 基于众包数据的室内定位方法

3.1 极速学习机(ELM)

3.2 半监督极速学习机(SELM)

3.3 具有时效机制的增量式极速学习机模型 (TMELM)

3.4 特征自适应的在线连续增量极速学习机模型(FA-OSELM)

4 基于众包数据的室内定位平台

4.1 数据层

4.2 接口层

4.3 应用层

5 定位平台系统应用案例

5.1 智能导购系统或者智能购物导航系统

5.2 博览会导航

5.3 智能停车服务或者辅助找车服务

6 结束语

参考文献 文献选项 原文顺序 文献年度倒序 文中引用次数倒序 被引期刊影响因子

参考文献

原文顺序

文献年度倒序

文中引用次数倒序

被引期刊影响因子