Abstract
We develop efficient coordination techniques that support inelastic traffic in large-scale distributed dynamic spectrum access (\osa) networks. By means of any learning algorithm, the proposed techniques enable \osa users to locate and exploit spectrum opportunities effectively, thereby increasing their achieved throughput (orrewards to be more general). Basically, learning algorithms allow \osa users to learn by interacting with the environment, and use their acquired knowledge to select the proper actions that maximize their own objectives, thereby hopefully maximizing their long-term cumulative received reward. However, when \osa users' objectives are not carefully coordinated, learning algorithms can lead to poor overall system performance, resulting in lesser per-user average achieved rewards. In this paper, we derive efficient objective functions that \osa users can aim to maximize, and that by doing so, users' collective behavior also leads to good overall system performance, thus maximizing each user's long-term cumulative received rewards. We show that the proposed techniques are: (i) efficient by enabling users to achieve high rewards, (ii) scalable by performing well in systems with a small as well as a large number of users, (iii) learnable by allowing users to reach up high rewards very quickly, and (iv) distributive by being implementable in a decentralized manner.
Original language | English |
---|---|
Article number | 6264039 |
Pages (from-to) | 501-513 |
Number of pages | 13 |
Journal | IEEE Transactions on Network and Service Management |
Volume | 9 |
Issue number | 4 |
DOIs | |
Publication status | Published - 2012 |
Externally published | Yes |
Keywords
- Distributed resource allocation and management
- cooperative and coordinated learning
- dynamic and opportunistic spectrum access