Influence maximization is the problem of finding a small set of seed nodes in
a social network that maximizes the spread of influence under a certain
diffusion model. The Greedy algorithm for influence maximization first proposed
by Kempe, later improved by Leskovec suffers from two sources of computational
deficiency: 1) the need to evaluate many candidate nodes before selecting a new
seed in each round, and 2) the calculation of the influence spread of any seed
set relies on Monte-Carlo simulations. In this work, we tackle both problems by
devising efficient algorithms to compute influence spread and determine the
best candidate for seed selection. The fundamental insight behind the proposed
algorithms is the linkage between influence spread determination and belief
propagation on a directed acyclic graph (DAG). Experiments using real-world
social network graphs with scales ranging from thousands to millions of edges
demonstrate the superior performance of the proposed algorithms with moderate
computation costs.
We introduce a new measure of activity of financial markets that provides a
direct access to their level of endogeneity. This measure quantifies how much
of price changes are due to endogenous feedback processes, as opposed to
exogenous news. For this, we calibrate the self-excited conditional Poisson
Hawkes model, which combines in a natural and parsimonious way exogenous
influences with self-excited dynamics, to the E-mini S&P 500 futures contracts
traded in the Chicago Mercantile Exchange from 1998 to 2010. We find that the
level of endogeneity has increased significantly from 1998 to 2010, with only
70% in 1998 to less than 30% since 2007 of the price changes resulting from
some revealed exogenous information. Analogous to nuclear plant safety
concerned with avoiding "criticality", our measure provides a direct
quantification of the distance of the financial market to a critical state
defined precisely as the limit of diverging trading activity in absence of any
external driving.
How far and how fast does information spre
95e
ad in social media? Researchers
have recently examined a number of factors that affect information diffusion in
online social networks, including: the novelty of information, users' activity
levels, who they pay attention to, and how they respond to friends'
recommendations. Using URLs as markers of information, we carry out a detailed
study of retweeting, the primary mechanism by which information spreads on the
Twitter follower graph. Our empirical study examines how users respond to an
incoming stimulus, i.e., a tweet (message) from a friend, and reveals that
%retweeting behavior is constrained by a few simple principles. the "principle
of least effort" combined with limited attention plays a dominant role in
retweeting behavior. Specifically, we observe that users retweet information
when it is most visible, such as when it near the top of their Twitter stream.
Moreover, our measurements quantify how a user's limited attention is divided
among incoming tweets, providing novel evidence that highly connected
individuals are less likely to propagate an arbitrary tweet. Our study
indicates that the finite ability to process incoming information constrains
social contagion, and we conclude that rapid decay of visibility is the primary
barrier to information propagation online.
A limit order book provides information on available limit order prices and
their volumes. Based on these quantities, we give an empirical result on the
relationship between the bid-ask liquidity balance and trade sign and we show
that liquidity balance on best bid/best ask is quite informative for predicting
the future market order's direction. Moreover, we define price jump as a
83b
sell
(buy) market order arrival which is executed at a price which is smaller
(larger) than the best bid (best ask) price at the moment just after the
precedent market order arrival. Features are then extracted related to limit
order volumes, limit order price gaps, market order information and limit order
event information. Logistic regression is applied to predict the price jump
from the limit order book's feature. LASSO logistic regression is introduced to
help us make variable selection from which we are capable to highlight the
importance of different features in predicting the future price jump. In order
to get rid of the intraday data seasonality, the analysis is based on two
separated datasets: morning dataset and afternoon dataset. Based on an analysis
on forty largest French stocks of CAC40, we find that trade sign and market
order size as well as the liquidity on the best bid (best ask) are consistently
informative for predicting the incoming price jump.
Modularity is an important concept in evolutionary theorizing but lack of a consistent definition renders study difficult. Using the generalized NK-model of fitness landscapes, we differentiate modularity from decomposability. Modular and decomposable systems are both composed of subsystems, but in the former, these subsystems are connected via interface standards, while in the latter, subsystems are completely isolated. We derive the optimal level of modularity, which minimizes the time required to globally optimize a system, both for the case of two-layered systems and for the general case of multi-layered hierarchical systems containing modules within modules. This derivation supports the hypothesis of modularity as a mechanism to increase the speed of evolution. Our formal definition clarifies the concept of modularity and provides a framework and an analytical baseline for further research.