Message ID | 1358251363-14385-1-git-send-email-sven@open-mesh.com (mailing list archive) |
---|---|
State | Not Applicable, archived |
Headers | show |
On Tue, Jan 15, 2013 at 01:02:43PM +0100, Sven Eckelmann wrote: > The sample data received through the spectral scan can be either in big or > little endian byteorder. This information isn't stored in the output file. > Therefore it is not possible for the analyzer software to find the correct byte > order. > > It is relative common to get the data from a low end AP in big endian mode and > transfer it to another computer in little endian mode to analyze it. Therefore, > it would be better to store it in network (big endian) byte order. I'd agree that changing the byteorder to the "general" network order format is a good idea, although we will break compatibility again - which should hopefully be no problem as the original patch is pretty new anyway. I was pondering about performance loss when we run spectral on the same machine (i.e. maybe adding two useless byteswaps per value). OTOH the byteswaps are pretty cheap and we do some computations anyway, so the performance difference should be quite small. Therefore, Acked-by: Simon Wunderlich <siwu@hrz.tu-chemnitz.de> (I've CCd some more people in the reply so they can scream if they don't like it. ;]) Thanks, Simon > > Signed-off-by: Sven Eckelmann <sven@open-mesh.com> > Cc: Simon Wunderlich <siwu@hrz.tu-chemnitz.de> > --- > A patch for FFT_eval was also sent to the author. > > drivers/net/wireless/ath/ath9k/recv.c | 10 +++++++--- > 1 file changed, 7 insertions(+), 3 deletions(-) > > diff --git a/drivers/net/wireless/ath/ath9k/recv.c b/drivers/net/wireless/ath/ath9k/recv.c > index d7c129b..2fac787 100644 > --- a/drivers/net/wireless/ath/ath9k/recv.c > +++ b/drivers/net/wireless/ath/ath9k/recv.c > @@ -1064,8 +1064,9 @@ static void ath_process_fft(struct ath_softc *sc, struct ieee80211_hdr *hdr, > > fft_sample.tlv.type = ATH_FFT_SAMPLE_HT20; > fft_sample.tlv.length = sizeof(fft_sample) - sizeof(fft_sample.tlv); > + fft_sample.tlv.length = __cpu_to_be16(fft_sample.tlv.length); > > - fft_sample.freq = ah->curchan->chan->center_freq; > + fft_sample.freq = __cpu_to_be16(ah->curchan->chan->center_freq); > fft_sample.rssi = fix_rssi_inv_only(rs->rs_rssi_ctl0); > fft_sample.noise = ah->noise; > > @@ -1106,13 +1107,16 @@ static void ath_process_fft(struct ath_softc *sc, struct ieee80211_hdr *hdr, > mag_info = ((struct ath_ht20_mag_info *)radar_info) - 1; > > /* Apply exponent and grab further auxiliary information. */ > - for (i = 0; i < SPECTRAL_HT20_NUM_BINS; i++) > + for (i = 0; i < SPECTRAL_HT20_NUM_BINS; i++) { > fft_sample.data[i] = bins[i] << mag_info->max_exp; > + fft_sample.data[i] = __cpu_to_be16(fft_sample.data[i]); > + } > > fft_sample.max_magnitude = spectral_max_magnitude(mag_info->all_bins); > + fft_sample.max_magnitude = __cpu_to_be16(fft_sample.max_magnitude); > fft_sample.max_index = spectral_max_index(mag_info->all_bins); > fft_sample.bitmap_weight = spectral_bitmap_weight(mag_info->all_bins); > - fft_sample.tsf = tsf; > + fft_sample.tsf = __cpu_to_be64(tsf); > > ath_debug_send_fft_sample(sc, &fft_sample.tlv); > #endif > -- > 1.7.10.4 > > -- > To unsubscribe from this list: send the line "unsubscribe linux-wireless" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html
On 01/16/2013 09:59 AM, Simon Wunderlich wrote: > On Tue, Jan 15, 2013 at 01:02:43PM +0100, Sven Eckelmann wrote: >> The sample data received through the spectral scan can be either in big or >> little endian byteorder. This information isn't stored in the output file. >> Therefore it is not possible for the analyzer software to find the correct byte >> order. >> >> It is relative common to get the data from a low end AP in big endian mode and >> transfer it to another computer in little endian mode to analyze it. Therefore, >> it would be better to store it in network (big endian) byte order. > > I'd agree that changing the byteorder to the "general" network order format is > a good idea, although we will break compatibility again - which should hopefully > be no problem as the original patch is pretty new anyway. > > I was pondering about performance loss when we run spectral on the same machine > (i.e. maybe adding two useless byteswaps per value). OTOH the byteswaps are pretty > cheap and we do some computations anyway, so the performance difference should be quite > small. > > Therefore, > > Acked-by: Simon Wunderlich <siwu@hrz.tu-chemnitz.de> > > (I've CCd some more people in the reply so they can scream if they don't like it. ;]) > > Thanks, > Simon > Hi Simon, at this point, I'd like to advocate the approach I am currently using that does the conversion fully in user space. There, each sample provided consists of the fft data as bytes plus the values required to scale the magnitude values back to absolute power numbers (i.e. the 'bug-fixed' raw data). Since for the scaling an user space component is mandatory (FP calculations), I miss to see the benefit of splitting post-processing in the kernel (shifting bins) and user space. This step doubles fft data size and introduces endianess issues for no (at least to me) evident reason. Maybe you have a host application in mind where it does not really matter where the processing is done. Whereas, if you operate an AP as continuous spectral scanner, shifting the post-processing to the point where it is evaluated / displayed (usually your fat PC) significantly impacts your AP's load and its capability to provide higher sampling rates. I'd assume Adrian's proposal to just push out the raw spectral data to user-space is also targeting at that, which I fully support. The only thing I would keep in the driver is fixing the fft data corruption caused by known chip bugs. Cheers, Zefir >> >> Signed-off-by: Sven Eckelmann <sven@open-mesh.com> >> Cc: Simon Wunderlich <siwu@hrz.tu-chemnitz.de> >> --- >> A patch for FFT_eval was also sent to the author. >> >> drivers/net/wireless/ath/ath9k/recv.c | 10 +++++++--- >> 1 file changed, 7 insertions(+), 3 deletions(-) >> >> diff --git a/drivers/net/wireless/ath/ath9k/recv.c b/drivers/net/wireless/ath/ath9k/recv.c >> index d7c129b..2fac787 100644 >> --- a/drivers/net/wireless/ath/ath9k/recv.c >> +++ b/drivers/net/wireless/ath/ath9k/recv.c >> @@ -1064,8 +1064,9 @@ static void ath_process_fft(struct ath_softc *sc, struct ieee80211_hdr *hdr, >> >> fft_sample.tlv.type = ATH_FFT_SAMPLE_HT20; >> fft_sample.tlv.length = sizeof(fft_sample) - sizeof(fft_sample.tlv); >> + fft_sample.tlv.length = __cpu_to_be16(fft_sample.tlv.length); >> >> - fft_sample.freq = ah->curchan->chan->center_freq; >> + fft_sample.freq = __cpu_to_be16(ah->curchan->chan->center_freq); >> fft_sample.rssi = fix_rssi_inv_only(rs->rs_rssi_ctl0); >> fft_sample.noise = ah->noise; >> >> @@ -1106,13 +1107,16 @@ static void ath_process_fft(struct ath_softc *sc, struct ieee80211_hdr *hdr, >> mag_info = ((struct ath_ht20_mag_info *)radar_info) - 1; >> >> /* Apply exponent and grab further auxiliary information. */ >> - for (i = 0; i < SPECTRAL_HT20_NUM_BINS; i++) >> + for (i = 0; i < SPECTRAL_HT20_NUM_BINS; i++) { >> fft_sample.data[i] = bins[i] << mag_info->max_exp; >> + fft_sample.data[i] = __cpu_to_be16(fft_sample.data[i]); >> + } >> >> fft_sample.max_magnitude = spectral_max_magnitude(mag_info->all_bins); >> + fft_sample.max_magnitude = __cpu_to_be16(fft_sample.max_magnitude); >> fft_sample.max_index = spectral_max_index(mag_info->all_bins); >> fft_sample.bitmap_weight = spectral_bitmap_weight(mag_info->all_bins); >> - fft_sample.tsf = tsf; >> + fft_sample.tsf = __cpu_to_be64(tsf); >> >> ath_debug_send_fft_sample(sc, &fft_sample.tlv); >> #endif >> -- >> 1.7.10.4 >> >> -- >> To unsubscribe from this list: send the line "unsubscribe linux-wireless" in >> the body of a message to majordomo@vger.kernel.org >> More majordomo info at http://vger.kernel.org/majordomo-info.html -- To unsubscribe from this list: send the line "unsubscribe linux-wireless" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
Hey Zefir, On Wed, Jan 16, 2013 at 10:53:15AM +0100, Zefir Kurtisi wrote: > On 01/16/2013 09:59 AM, Simon Wunderlich wrote: > > On Tue, Jan 15, 2013 at 01:02:43PM +0100, Sven Eckelmann wrote: > >> The sample data received through the spectral scan can be either in big or > >> little endian byteorder. This information isn't stored in the output file. > >> Therefore it is not possible for the analyzer software to find the correct byte > >> order. > >> > >> It is relative common to get the data from a low end AP in big endian mode and > >> transfer it to another computer in little endian mode to analyze it. Therefore, > >> it would be better to store it in network (big endian) byte order. > > > > I'd agree that changing the byteorder to the "general" network order format is > > a good idea, although we will break compatibility again - which should hopefully > > be no problem as the original patch is pretty new anyway. > > > > I was pondering about performance loss when we run spectral on the same machine > > (i.e. maybe adding two useless byteswaps per value). OTOH the byteswaps are pretty > > cheap and we do some computations anyway, so the performance difference should be quite > > small. > > > > Therefore, > > > > Acked-by: Simon Wunderlich <siwu@hrz.tu-chemnitz.de> > > > > (I've CCd some more people in the reply so they can scream if they don't like it. ;]) > > > > Thanks, > > Simon > > > Hi Simon, > > at this point, I'd like to advocate the approach I am currently using that does > the conversion fully in user space. There, each sample provided consists of the > fft data as bytes plus the values required to scale the magnitude values back to > absolute power numbers (i.e. the 'bug-fixed' raw data). > > Since for the scaling an user space component is mandatory (FP calculations), I > miss to see the benefit of splitting post-processing in the kernel (shifting bins) > and user space. This step doubles fft data size and introduces endianess issues > for no (at least to me) evident reason. Well, that's right. The idea was to not send so "obscure" data, but I guess providing sample code which shifts the bins etc should be easy enough for people integrating this in userspace. > > Maybe you have a host application in mind where it does not really matter where > the processing is done. Whereas, if you operate an AP as continuous spectral > scanner, shifting the post-processing to the point where it is evaluated / > displayed (usually your fat PC) significantly impacts your AP's load and its > capability to provide higher sampling rates. I didn't see any performance limits yet. Shifting the data is probably not so expensive (what we do right now in the kernel), but doubled sample size and other aspects will certainly have an impact for this kind of "streaming" application for (low-end) APs. > > I'd assume Adrian's proposal to just push out the raw spectral data to user-space > is also targeting at that, which I fully support. The only thing I would keep in > the driver is fixing the fft data corruption caused by known chip bugs. As far as I understood, Adrian returns the whole packet with all flags and bugs, but I might be wrong. I'd agree to at least fix the length bug in the kernel (newer chips should not have this flaw anymore). In any case, I don't want to change the format multiple times again, so let's discuss and fix it once and for all[tm] now. There are some parties already experimenting with and integrating this feature now. And if I'm not mistaken we have ~3 weeks left to add changes for the next kernel version. My suggestions for the new format: * fix length bugs (as you said) * use a TLV format to hand out samples (as done now) - we can use a new type * each "sample packet" holds one sample, i.e. aggregated sample packets from the chip (we are not processing them in the current version, but in later versions maybe) are split into multiple "sample packets" * the bins are not shifted (8 bit per bin) * auxiliary information which is larger than 8 bit (tsf, frequency, etc) is converted into network byte order Anything else? Cheers, Simon
On 01/16/2013 02:18 PM, Simon Wunderlich wrote: > [...] >> Since for the scaling an user space component is mandatory (FP calculations), I >> miss to see the benefit of splitting post-processing in the kernel (shifting bins) >> and user space. This step doubles fft data size and introduces endianess issues >> for no (at least to me) evident reason. > > Well, that's right. The idea was to not send so "obscure" data, but I guess providing > sample code which shifts the bins etc should be easy enough for people integrating this > in userspace. > Understood, but providing 'semi-obscure' data is not really worth the overhead ;) >> >> Maybe you have a host application in mind where it does not really matter where >> the processing is done. Whereas, if you operate an AP as continuous spectral >> scanner, shifting the post-processing to the point where it is evaluated / >> displayed (usually your fat PC) significantly impacts your AP's load and its >> capability to provide higher sampling rates. > > I didn't see any performance limits yet. Shifting the data is probably not so expensive > (what we do right now in the kernel), but doubled sample size and other aspects > will certainly have an impact for this kind of "streaming" application for (low-end) APs. > Right now, with a 500MHz ARM SoC I am limited to ~12k-samples/s with resources fully utilized. That's far from the 250kS/s the chip could theoretically provide. But I'm still with my netlink based interface and need to check if relay-fs is more efficient. >> >> I'd assume Adrian's proposal to just push out the raw spectral data to user-space >> is also targeting at that, which I fully support. The only thing I would keep in >> the driver is fixing the fft data corruption caused by known chip bugs. > > As far as I understood, Adrian returns the whole packet with all flags and bugs, but I might > be wrong. I'd agree to at least fix the length bug in the kernel (newer chips should not > have this flaw anymore). > Right, that's why the 'bug-fixing' should remain part of the driver. > In any case, I don't want to change the format multiple times again, so let's discuss and > fix it once and for all[tm] now. There are some parties already experimenting with and integrating > this feature now. And if I'm not mistaken we have ~3 weeks left to add changes for > the next kernel version. > Beside the required modifications being trivial, since the user space post-processing is mandatory - could a library for data conversion help to decouple driver from test development? > My suggestions for the new format: > * fix length bugs (as you said) > * use a TLV format to hand out samples (as done now) - we can use a new type > * each "sample packet" holds one sample, i.e. aggregated sample packets from the chip (we are not processing > them in the current version, but in later versions maybe) are split into multiple "sample packets" What are your plans for aggregation here, given that you need to convert the data won't (easily) work in the kernel? > * the bins are not shifted (8 bit per bin) > * auxiliary information which is larger than 8 bit (tsf, frequency, etc) is converted into network byte order > > Anything else? > With the rssi and noise-floor values already included: no, looks complete. > Cheers, > Simon > -- To unsubscribe from this list: send the line "unsubscribe linux-wireless" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
Hey Zefir, On Wed, Jan 16, 2013 at 05:00:09PM +0100, Zefir Kurtisi wrote: > [...] > >> > >> Maybe you have a host application in mind where it does not really matter where > >> the processing is done. Whereas, if you operate an AP as continuous spectral > >> scanner, shifting the post-processing to the point where it is evaluated / > >> displayed (usually your fat PC) significantly impacts your AP's load and its > >> capability to provide higher sampling rates. > > > > I didn't see any performance limits yet. Shifting the data is probably not so expensive > > (what we do right now in the kernel), but doubled sample size and other aspects > > will certainly have an impact for this kind of "streaming" application for (low-end) APs. > > > Right now, with a 500MHz ARM SoC I am limited to ~12k-samples/s with resources > fully utilized. That's far from the 250kS/s the chip could theoretically provide. > But I'm still with my netlink based interface and need to check if relay-fs is > more efficient. Well, give it a try. My application is not requiring maximum output, but I've changed to relayfs because it was suggested for performance reasons. Let's hope it can keep this promise. :) > [...] > > In any case, I don't want to change the format multiple times again, so let's discuss and > > fix it once and for all[tm] now. There are some parties already experimenting with and integrating > > this feature now. And if I'm not mistaken we have ~3 weeks left to add changes for > > the next kernel version. > > > Beside the required modifications being trivial, since the user space > post-processing is mandatory - could a library for data conversion help to > decouple driver from test development? > Might be a good thing, although I don't plan to write this now - The fft_eval program is good enough for a reference right now. A library could handle both FreeBSD and Linux in the future. Feel free to contribute. ;) > > My suggestions for the new format: > > * fix length bugs (as you said) > > * use a TLV format to hand out samples (as done now) - we can use a new type > > * each "sample packet" holds one sample, i.e. aggregated sample packets from the chip (we are not processing > > them in the current version, but in later versions maybe) are split into multiple "sample packets" > What are your plans for aggregation here, given that you need to convert the data > won't (easily) work in the kernel? I haven't seen the aggregated spectral reports myself yet, only seen comments from Adrian. http://article.gmane.org/gmane.linux.kernel.wireless.general/102030 The plan would be to split them so userspace always gets one sample per TLV, and we don't have to create yet another format. So we would have two types: HT20 report and HT40 report that userspace can support. But we are not handling them now anyway, just to have the format somewhat future proof. :) > > * the bins are not shifted (8 bit per bin) > > * auxiliary information which is larger than 8 bit (tsf, frequency, etc) is converted into network byte order > > > > Anything else? > > > With the rssi and noise-floor values already included: no, looks complete. OK, if no one else objects or has further comments I can provide patches for this one. Cheers, Simon
Hi, Yes, I advocate doing it all in userspace. I'm just exporting the raw PHY errors via bpf, so I can just expose all PHY errors this way and let the userland application do what it wants with this. I do the same with radar data too, fwiw. Adrian
diff --git a/drivers/net/wireless/ath/ath9k/recv.c b/drivers/net/wireless/ath/ath9k/recv.c index d7c129b..2fac787 100644 --- a/drivers/net/wireless/ath/ath9k/recv.c +++ b/drivers/net/wireless/ath/ath9k/recv.c @@ -1064,8 +1064,9 @@ static void ath_process_fft(struct ath_softc *sc, struct ieee80211_hdr *hdr, fft_sample.tlv.type = ATH_FFT_SAMPLE_HT20; fft_sample.tlv.length = sizeof(fft_sample) - sizeof(fft_sample.tlv); + fft_sample.tlv.length = __cpu_to_be16(fft_sample.tlv.length); - fft_sample.freq = ah->curchan->chan->center_freq; + fft_sample.freq = __cpu_to_be16(ah->curchan->chan->center_freq); fft_sample.rssi = fix_rssi_inv_only(rs->rs_rssi_ctl0); fft_sample.noise = ah->noise; @@ -1106,13 +1107,16 @@ static void ath_process_fft(struct ath_softc *sc, struct ieee80211_hdr *hdr, mag_info = ((struct ath_ht20_mag_info *)radar_info) - 1; /* Apply exponent and grab further auxiliary information. */ - for (i = 0; i < SPECTRAL_HT20_NUM_BINS; i++) + for (i = 0; i < SPECTRAL_HT20_NUM_BINS; i++) { fft_sample.data[i] = bins[i] << mag_info->max_exp; + fft_sample.data[i] = __cpu_to_be16(fft_sample.data[i]); + } fft_sample.max_magnitude = spectral_max_magnitude(mag_info->all_bins); + fft_sample.max_magnitude = __cpu_to_be16(fft_sample.max_magnitude); fft_sample.max_index = spectral_max_index(mag_info->all_bins); fft_sample.bitmap_weight = spectral_bitmap_weight(mag_info->all_bins); - fft_sample.tsf = tsf; + fft_sample.tsf = __cpu_to_be64(tsf); ath_debug_send_fft_sample(sc, &fft_sample.tlv); #endif
The sample data received through the spectral scan can be either in big or little endian byteorder. This information isn't stored in the output file. Therefore it is not possible for the analyzer software to find the correct byte order. It is relative common to get the data from a low end AP in big endian mode and transfer it to another computer in little endian mode to analyze it. Therefore, it would be better to store it in network (big endian) byte order. Signed-off-by: Sven Eckelmann <sven@open-mesh.com> Cc: Simon Wunderlich <siwu@hrz.tu-chemnitz.de> --- A patch for FFT_eval was also sent to the author. drivers/net/wireless/ath/ath9k/recv.c | 10 +++++++--- 1 file changed, 7 insertions(+), 3 deletions(-)