I wrote about trying to use TwitteR to download AGU13 tweets. I'm getting fewer and fewer with my calls. I was very excited to try Webometric Analyst from Wolverhampton and described by Kim Holmberg in his ASIST webinar (BIG pptx, BIG wmv).
One of the things Webometric Analyst will do is do repeated searches until you tell it to stop. This was very exciting. But I tried it and alas, I think Twitter thinks I'm abusive or something because it was way throttled. Like I could see the tweets flying up on the screen at twitter.com but the search was retrieving like 6. I ran the R search mid-day today and got 99 tweets back which covered 5 minutes O_o. I asked for up to 2000, from the whole day, and had it set to retry if stopped.
I made an assumption and was called on it on Liblicense. I was under the impression that to be published you ( or someone from your place of work if it's work-for-hire like at mpow) had to have signed a copyright agreement. Turns out that over time, some publishers have been a little slipshod with this. They didn't follow up or they made some statement that if the author didn't sign, the transfer was assumed or claimed or whatever. Even if they got the agreement, they might not still have it.
Huh. According to Laura Quilter on LibLicense, in the US there has to be a signed agreement. So... makes you wonder... could these people who never signed agreements claim their own copyright back? Post their articles wherever they want? Or maybe it would still have to be the content stripped of all the publisher specific formatting and logo?
So I said I wouldn't analyze the data from '13 because I'm already underwater, I have plenty, and I need to get done. However, I figured that I had already figured out oauth and using twitteR so no harm in running a couple commands, stashing the data somewhere, and maybe pulling it out if there's a specific question or maybe later when turning my dissertation into an article (should I live so long!).
I thought well, it will give me about 2 weeks worth, but maybe I should give it a try while the conference is still going to make sure everything works ok. Well crap. I'm getting anywhere from 99 to 1000 tweets per query... and that's covering like at most 3 hours... and I can't seem to fill in the rest. Bummer.
The search has a sinceid but no untilid... and it has since and until for dates - but full days not down to the hour or minute or anything. So I'm really able to get 9pm-midnight GMT. Huh.
I watched Kim Holmberg's fabulous webinar today, so I'm going to try something he suggested to see if that helps. Otherwise, I kinda need to run the search throughout the day, which I can do if I work from home, but I will have missed the most important days of the conference. It's tapering off now. Sigh.
I should be more positive: picking up steam. Or picking up steam! But it's really pretty tenuous with the busyness at home and whatnot.
Here's where I am now... still (months late) analyzing the actual tweets as we head into the next conference. I'm going to collect and save this year's even if I don't plan to analyze it.
Started interviews and interviewee tweeted about the interview which I think is awesome, but Bo (*) would frown upon.
Research talk Tuesday (eek!) for CASCI. I had hoped it would be livestreamed but I haven't heard about that to share the details. I will post the slides to SlideShare.
So anyway, I'm still alive. Lots of new papers on blogging, incidentally. Here are 3 (one via Jason Priem the others... I don't remember where I found them?)
Hank, C. (2013). Communications in blogademia: An assessment of scholar blogs’ attributes and functions New Review of Information Networking, 18(2), 51-69. doi:10.1080/13614576.2013.802179
Luzón, M. J. (2013). Public communication of science in blogs: Recontextualizing scientific discourse for a diversified audience. Written Communication, 30(4), 428-457. doi:10.1177/0741088313493610 <- reading this now, and I like it, but the research methods are sort of inadequately described? I mean, I guess I'm not used to reading rhetoric papers, maybe this is typical? worth a blog post for sure
Mewburn, I., & Thomson, P. (2013). Why do academics blog? an analysis of audiences, purposes and challenges. Studies in Higher Education, 38(8), 1105-1119.
oh and this one that discusses blogging while you research
Olive, R. (2013). ‘Making friends with the neighbours’: Blogging as a research method. International Journal of Cultural Studies, 16(1), 71-84. doi:10.1177/1367877912441438
* Kazmer, M. M., & Xie, B. (2008). Qualitative interviewing in internet studies: Playing with the media, playing with the method. Information, Communication & Society, 11(2), 257-278. doi:10.1080/13691180801946333
The "sting" article that details a Sokal-Affair-type test of crap open access publishers to see if they really were crap open access publishers is getting all the attention. (do note that Hindawi and PlosONE quickly rejected the manuscript and Plos even questioned the ethical issues - hence they are not crap publishers but decent publishers).
When they shut down servers instead of keeping them up with a skeleton crew? LOC data services are down and, as far as I can tell, almost all of NASA's web stuff has been taken down. PubMed is available with a skeleton crew.
Joe Hourclé pointed out on some listservs that they were even given guidance to return an HTTP status of 302 which will essentially tell search engines that they're gone forever.
Luckily, a lot of government-funded science data services are run by universities and research labs that already have their funding and will keep working.
I first noticed this in chemistry but now I'm seeing it in engineering, too. Major publishers (?), content vendors, indexes (?) are starting to offer services whereby your company can use their search tool to index your local content and display it side by side with their content OR a way to use their api to pull their content in to your local tool.
That's a common complaint in research labs and in companies. Knowledge management of internal information gets funded and defunded and is cool and then not cool... External information is familiar to people coming out of school... how can you search across both.
We have the artist formerly known as Vivissimo (now IBM something spherical I think) as an intranet search, and I would love to see it use our discovery layer type search as a tab. I don't see why it couldn't.
This deserves analysis and thought - no time. sorry!
I don't believe I ever met F.W. Lancaster but I so value his work. His books and articles are so practical and useful. Clearly written. They're pretty much timeless, too. For example, his book: Lancaster, F. W. (1993). If you want to evaluate your library (2nd ed.). Champaign, IL: University of Illinois Graduate School of Library and Information Science, is on my bookshelf at work, and I do pull it down from time to time to reference various parts.
I showed a graph of the agu10 archive here, and more recently the agu11/2011 archive here, and now for the agu12/2012 archive. See the 2011 post for the exact methods used to get the data and to clean it.
#agu12 and #agu2012 largest component, nodes sized by degree
#agu12 and #agu2012 other components, no isolates, nodes sized by degree
I will have to review methods to show this, but from appearances, the networks are becoming more like hairballs. In the first year, half the people were connected to theAGU and the other half were connected to NASA, but very few were connected to both. The other prominent nodes were pretty much all institutional accounts. In 2011, that started to decrease and now in 2012 you can't really see that division at all. There are the top three nodes - two the same plus a NASA robotic mission - but then there's a large second group with degrees (connections to others) around 40-80 (combined indegree and outdegree) of individual scientists.