Tradesmen Entrance web

 

Photo by Chris Radcliff  (CC BY-SA 2.0)     “Sign in Washington D.C. near the U.S. Capitol building. The arrow pointed to a blocked narrow staircase leading down to an unused basement door.”

Another comment, following my earlier one,  posted in response to  the Guardian article .

“Where are university websites hiding all their research?”

http://www.guardian.co.uk/higher-education-network/blog/2013/jan/10/research-communications-uk-university-websites

 The descriptive text for the photo above seems very appropriate to how Universities treat research pages on their websites.

Got a bit steamed up at the end – aux barricades citoyens and all that- but why not?  Am beginning to wonder if researchers give a damn about their research pages on university websites. If they “owned”  them it might be different.

(Comment follows)

“I submitted a clunky comment earlier.I’d like to repackage and try again.

First, I’ll declare an interest. I want to make money from the information about the research going on in UK universities. Let’s be clear, what I’m interested in is not the published papers. They are probably the most important output of university research but they are not the only product of that research. There is also the “translate and engage” information that you will find on the website pages about departmental research and in the research councils’ grant databases. This where a research unit can sketch out who’s doing what, where, and who’s paying. Where researchers can blog and post videos. Where they can engage possible collaborators from other disciplines and other countries. I want to take this information mix and mash it and sell it on to the world.  Why? Because the UK public sector research base is one of the few strengths the UK still has since Mrs. T’s long bet on the banks went bust. Why commercial? Because a “push” model of getting that information out as far as possible would serve UK universities better at engaging with the world outside the research units than the present “pull” model of researcher looking up researcher. And to push that information out requires an incentive to me, and others like me. And no, given the volume of UK research, I don’t think a trickle feed of research news items from university press departments counts as an effective push model

So what do I find when I get to a university departmental research page? A big sign saying “Gentlemen  and scholars only. No tradesmen”.That is what the present  “all rights reserved except for fair dealing exceptions” copyright T&Cs that are applied to university websites amount to.

Things are changing, Gateway to Research the beta phase RCUK portal to grant database information is under Open Government Licence- that is I can do more or less what I want with it, including commercial re-use, so long as I attribute source and don’t misrepresent.

Cicero21 quite rightly points out that universities don’t do research, it’s the researchers in departments who do. And who writes the departmental research webpages?  The researchers not the university administrators. So here’s my modest proposal to you researchers – take back control of your intellectual property, of your copyright over what you have written on those webpages, and decide for yourselves, unit by unit, what licensing terms you want to apply. You want to keep the No Tradesmen sign up, fair enough- but you decide.  And don’t be put off by being told that the university website can only support one level of licensing for everyone and everything. A modern CMS based university website that does not support explicit granularity of content licensing is not fit for purpose- especially with the emergence of MOOCs and OERs. And don’t be put off by administrators telling you that in your contract of employment you assigned your website page copyright to the “university”. Remind them that when you write for your departmental webpages it is no different from when you write a research paper. And if research papers are moving towards Open Access and Open Licences, and the research councils are behind that move, then why should your departmental web pages be different. Place them in the position of being perceived as Elsevier and see how they like it.”

(cross posted from www.samwellerdog.wordpress.com 19 Jan 2013)

With my  Scibella  hat on, one aim in 2013 is to  campaign for CC-BY “open”  licencing terms for research pages on university websites. It’s time to drag whoever writes their website T&Cs  away from a vision of university research as a “club for gentlemen & scholars “.    So have just commented on Guardian article

“Where are university websites hiding all their research?”

http://www.guardian.co.uk/higher-education-network/blog/2013/jan/10/research-communications-uk-university-websites

Mainly a cut and paste job from section 2 of my BIS select committee submission

http://www.publications.parliament.uk/pa/cm201213/cmselect/cmbis/367/367vw20.htm

with added Gateway to Research OGL licence (hooray,hooray) angle.

I would agree with Ian Carter of the University of Sussex that “presenting research and knowledge exchange information online is important as the university’s website is an entry point for potential funders, customers, partners and researchers themselves.”

A 2010 NESTA-RIN report characterised these research outputs as the “translating and engaging” part of the research cycle :

“involving the envisaged users of the research in actual or potential applications of it, in other research fields, commercialisation or policy” ..by means of “General articles, web pages, briefings, public exhibits, presentations”

Source: Open to All? Case studies of openness in research,
http://www.rin.ac.uk/system/files/attachments/NESTA-RIN_Open_Science_V01_0.pdf

It would seem obvious that the widest dissemination, within and beyond the UK, of this “translate & engage” information would be of great economic value. However, at present, nearly all of this information is only available under university website copyright terms & conditions which limit its use to non-commercial private research & study. Re-use for any other purpose is forbidden without “express written consent”—even if you are a non-commercial academic researcher.

Recently RCUK set up Gateway to Research, a portal for research grants, “aimed at those that wish to access UK research information, with a particular focus on innovation intensive SMEs, who wish to understand the UK research base”- ( http://gtr.rcuk.ac.uk/ ) using the Open Government Licence (more or less equivalent to a CC-BY licence). Under “Intellectual Property” ( http://gtr.rcuk.ac.uk/resources/ip.html ) they say:

“If an individual or a company wishes to learn more about a research project or gain access to IP, they should contact the host Research Organisation. Research Councils expect both the host Research Organisation and interested parties to take effective decisions about intellectual asset management that deliver the most benefit to society and the economy. This will include recognising circumstances where the publication of research outcomes or free dissemination to users might be the most effective approach.”

At present a innovative SME going from Gateway to Research to a university website to learn more about a research project,or a researcher,infringes the terms of use for that website if they make any copy of that information without “prior written permission”! Not quite the best way to deliver the most benefit to society and the economy.

Time for university website managers and others responsible for website T&Cs to get in line with the RCUK and change to a CC-BY or OGL licence for university website research pages?   “

(cross posted from www.samwellerdog.wordpress.com 19 Jan 2013)

 

Twitter blackbird pie

http://en.support.wordpress.com/twitter-blackbird-pie/

Hmm seems to work but if inserting in visual mode must unlink so that it’s just the pasted url and no wrap around <a href… etc.>

This was with picking post status via Tweetdeck… should work with regular twitter.. I’ll just post another

Ok must remember when typing enter for new line after insert that it will auto relink and then need to be unlinked again- so paste, new line, back unlink

ok back up and unlink then vedremo!

 

Archive last update: 28/05/2011

28/05 Added new:

CILIP briefing, manchester 19/05/2011 (copyright-education/research)

http://www.scibella.com/twitter/hashtags/ecopy11_manchester.xml 

Copyright- national & international strategies to support education & research, 27/05/2011

http://www.scibella.com/twitter/hashtags/ences.xml 

Digital preservation for Health Sciences, Newcastle, 26/05/2011 (managing research data)

http://www.scibella.com/twitter/hashtags/data4life.xml 

 

See notes after #list for more info about what this is about. I’ll get round to annotating more archived hashtags with  conference/workshop info over next days. List in no particular order:

Research Data

JISC Managing Research Data  international programme workshop 28-29 th March 2011 Aston UK  (28/03 2:23 pm onwards) + ongoing

http://www.scibella.com/twitter/hashtags/jiscmrd.xml  (weekly)

http://www.scibella.com/twitter/hashtags/rdap11.xml (closed)

http://www.scibella.com/twitter/hashtags/romeocris.xml (closed)

http://www.scibella.com/twitter/hashtags/dataprato.xml (closed)

http://www.scibella.com/twitter/hashtags/kblog.xml (weekly)

added 08 may 2011

Open software meeting at EBI 5May  http://bit.ly/iGeq2a

http://www.scibella.com/twitter/hashtags/mioss.xml  (closed)

DCC Research Data Management Forum Leicester 5-6May http://bit.ly/mbGivC

http://www.scibella.com/twitter/hashtags/rdmf.xml  (closed)

added 14 may 2011

UMF Shared Services and the Cloud Programme     symposium 12May London

http://www.scibella.com/twitter/hashtags/esym11.xml  (closed)

OER

http://www.scibella.com/twitter/hashtags/oerhack.xml  (closed)

http://www.scibella.com/twitter/hashtags/ukoer.xml  (weekly)

http://www.scibella.com/twitter/hashtags/openn11.xml (closed)

Social Media & Research

http://www.scibella.com/twitter/hashtags/bathcr.xml (closed)

Astronomy

http://www.scibella.com/twitter/hashtags/dotastro.xml (closed)

http://www.scibella.com/twitter/hashtags/nam2011.xml  (closed)

Synthetic Biology

6 academies #synbio3nations symposium tomorrow on #syntheticbiology. have posted the programme here: http://ow.ly/1stLv4

http://www.scibella.com/twitter/hashtags/synbio3nations.xml (closed)

Pachube/”Internet of things”

http://www.scibella.com/twitter/hashtags/pachube.xml (weekly)

http://www.scibella.com/twitter/hashtags/pachubehack.xml (closed)

http://www.scibella.com/twitter/hashtags/openorchard.xml (closed)

Science Policy

http://www.scibella.com/twitter/hashtags/scipolicy.xml  (weekly)

Science Communication

UKSG 2011  Harrogate, Yorkshire, UK 4-6 April 2011 + ongoing

http://www.scibella.com/twitter/hashtags/uksg.xml  (weekly)

Copyright 

# copyright best to follow via twapperkeeper archive

http://www.scibella.com/twitter/hashtags/ecopy11.xml 

For tweeps and others coming straight to this page – Scibella is a project to set up a CRIS (see about scibella page for more background) held up at the moment whilst copyright issues are sorted. However, this year, I’ve got into Twitter- at first as a source of UK science policy news- then I realised that Twitter/social media are very much part of what a CRIS needs to cover these days. One problem with Twitter is that with the volume of tweets their indexes now only go back 2 weeks or so. Anything before that is lost to search.  So something Scibella can do -which doesn’t take up too much time – is to archive the tweetstreams for conferences/workshops. Of course I now have the problem of  finding out what’s going on that’s worth archiving! 

To archive I’m  using desktop archivist   open source software running on windows xp pro sp3  with .NET 3.5 SP1 framework installed.  Uses twitter regular “garden hose” search feed so can only go back as far as normal search function from within twitter. Max search number going back is 1500. Another problem is that desktop archivist will only save tweetstreams from simple searches #xyz etc.  Any search using Twitter advanced search functions e.g. since: until: to: from: etc. will bring the results into archivist desktop but the program crashes when you try to save them.  Short of buying MS Visual Studio, learning WFP, and going through the source code I don’t think I can do anything about this problem.

The format the files are in is the default xml output from archivist desktop- to turn them back into full tweet streams with icons,active links etc, will take some xsl/xpath work which I’ll try to do asap. (Or download your own copy of archivist desktop and load them in via “open” option of file menu!)

 Have checked in IE8,Firefox 4, and Chrome (with xml extension) browsers from my windows XP desktop and files can be accessed OK.  However, there could be an XML parsing error with some browsers (thanks to  simonhodson99 for the heads up on this), haven’t yet been able to check with Mac, Iphone, Android etc. So fall back would be save with file name etc. Is that possible from a smartphone?

Hope this is useful. Any feedback on problems/ideas would be welcome.

PS: TwapperKeeper  is an alternative to archivist desktop and is well worth searching to see if your area of interest is covered. For example you’ll find #JISCMRD and #Copyright going back 2 or more years.  Trouble is that from April 2011 Twitter won’t let the web version of TwapperKeeper export data so you can view but can’t save.  If I ever get time,  and tweet archiving looks as if it could be a useful feature of Scibella, I’ll look into installing TwapperKeeper on Scibella’s server so that export is possible.  What would be useful is using either archivist desktop or twapperkeeper to merge xml files to produce sector compilations of workshops etc..

Default xml file out of archivist desktop is:

<TwitterDataModel RateLimitCount=”150” RateLimitStartTime=”2011-04-01T23:29+02:00” ErrorMessage=” ” SearchTerm=”%23jiscmrd” LastId=”53908716243783680” LastUpdate=”2011-04-01T22:29:14.21875+02:00” IconSource=”{x:Null}” State=”Active” xmlns=”clr-namespace:archivist;assembly=archivist” xmlns:x=”http://schemas.microsoft.com/winfx/2006/xaml“>

- <TwitterDataModel.Tweets>
- <TweetCollection SearchTerm=”{x:Null}” LastId=”{x:Null}” LastUpdate=”0001-01-01“>
  <Tweet BadWord=”{x:Null}” TweetStatus=”Unapproved” Username=”simonhodson99” TweetDate=”2011-04-01T21:56:54+02:00” Status=”RT @cygri: @PaulMiller attacks #linkeddataobsession with triple counts: time to focus on licensing & links: http://bit.ly/eOE8M5 #jiscmrd” TweetID=”53908716243783680” Image=”http://a1.twimg.com/profile_images/1190942574/Profile_normal.jpg” />
  <Tweet BadWord=”{x:Null}” TweetStatus=”Unapproved” Username=”kevingashley” TweetDate=”2011-04-01T17:45:45+02:00” Status=”@researchremix #rdap11 looks as if it might be doing a good job – but only have the tweetstream to go on #idcc10 #jiscmrd” TweetID=”53845515334467584” Image=”http://a1.twimg.com/profile_images/248761894/avamug_june_normal.jpg” />

….

<Tweet BadWord=”{x:Null}” TweetStatus=”Unapproved” Username=”gklyne” TweetDate=”2011-03-28T14:23:36+02:00” Status=”RT @CameronNeylon: I remain unconvinced that these big data sources are the real challenge. Is the mass of heterogeneous small data that is the issue #jiscmrd” TweetID=”52345089439182849” Image=”http://a3.twimg.com/profile_images/700133743/010710-1108-079_normal.jpg” />

 </TweetCollection>

  </TwitterDataModel.Tweets>
 <TwitterDataModel.Users>
  <UserCollection />
  </TwitterDataModel.Users>
 <TwitterDataModel.ChartList>
  <DataItem Name=”{x:Null}” Date=”2011-04-01” Count=”24” />
  <DataItem Name=”{x:Null}” Date=”2011-03-31” Count=”9” />
  <DataItem Name=”{x:Null}” Date=”2011-03-30” Count=”27” />
  <DataItem Name=”{x:Null}” Date=”2011-03-29” Count=”654” />
  <DataItem Name=”{x:Null}” Date=”2011-03-28” Count=”840” />
  </TwitterDataModel.ChartList>
 <TwitterDataModel.PieList>
  <DataItem Name=”researchremix” Date=”0001-01-01” Count=”178” />
  <DataItem Name=”datalibsam” Date=”0001-01-01” Count=”169” />
  <DataItem Name=”cardcc” Date=”0001-01-01” Count=”161” />
  <DataItem Name=”simonhodson99” Date=”0001-01-01” Count=”146” />

…..

<DataItem Name=”mbonett” Date=”0001-01-01” Count=”1” /> 

 <DataItem Name=”DafneJansen” Date=”0001-01-01” Count=”1” />

  <DataItem Name=”mymarkup” Date=”0001-01-01” Count=”1” />
  </TwitterDataModel.PieList>
  </TwitterDataModel>

 

All published Hargreaves IP Review submissions are .pdf files with names of type http://www.ipo.gov.uk/ipreview-c4e-sub-  followed by filename e.g http://www.ipo.gov.uk/ipreview-c4e-sub-scibella.pdf

So click on following link to open up the google search result for site:ipo.gov.uk/ipreview-c4e-sub- in a new window then type in your search term e.g. “fair use”  at the beginning before “site:…..”

http://www.google.it/search?source=ig&hl=en&rlz=&q=site%3Aipo.gov.uk%2Fipreview-c4e-sub-&btnG=Google+Search&aq=f&aqi=&aql=&oq

Happy searching

Updates: March 25 – finally got submission published- see after March 24th entry –  for IPO reason for delay and (excuse me) a sarcastic rant at their excuse.

updates: July 24th  have discovered from FOI disclosure log http://www.ipo.gov.uk/foi-log160.pdf  that Scibella submission was redacted,without informing me. (Hat tip to Tweeps @copyrightgirl and @filemot).  I have now appended the full e-mails between Scibella & IPO about publication of my submission. Bad enough to have to struggle to get the submission published, to discover that appendix 1 (extract from BBSRC website terms and conditions) has gone  missing – Grrr.. I Am going to contact IPO  to confirm what they redacted and the reason.  Suspect that they considered the 700 word extract was not fair dealing for purposes of review and criticism but infringed BBSRC copyright. I’ll hold my fire till I know more.

“Who steals my purse steals trash; ’tis something, nothing; ‘Twas mine, ’tis his, and has been slave to thousands; But he that filches from me my good name robs me of that which not enriches him, and makes me poor indeed”  Othello Act 3, scene 3, 155–161

March 24th:

IP REVIEW response Peter Carroll (Scibella) (.pdf 203 Kb)  (including Appendix 1 not published on IPO website)

Summary:

Scibella wants to make transformative re-use of UK university & research council website and other material to set up a Current Research Information Service . In particular job ads for research posts  past their application date which give details of  People, Units, Projects. These appear to have copyright protection for 70 years!

Problem is that copyright on nearly all sites allows only fair dealing for private research any other other use requires “express  written permission”.  This locks information in.  If you know where to look its there – a “pull” model  but a “push” model for wider dissemination is not allowed. Over 100 organisations to try and get permission from. 

Two solutions for Scibella:

Incorporate Scibella in USA where a fair use defence for a transformative re-use of job ads could be argued .  Don’t want to do this, USA very litigatious and couldn’t afford to defend.

 Could any aspects of fair use be introduced into UK? Yes, probably Gowers review recommendation 11 on transformative works.  This seems to have fallen between two stools.

Not taken forward in UK as Gowers refers to changing EU directive. However, EU have IViR (Amsterdam) study that says Gowers seeking change at EU level is wrong as EU directive does not cover harmonisation of “adaptation”.

Therefore UK could introduce Gowers recommendation 11 without prior change in EU law.   Gowers 11 including explicit mention of Berne 3 Step test criteria could establish UK version of fair-use, at least for transformative works, contained within a more defined framework than USA.  This could avoid excessive litigation.

The problems of defining “transformative” could be outweighed by advantages to UK economy, particularly relocation of creative industries into UK to take advantage of the exception.

Second Solution:

Extend  Open Government License for  re-use of public sector information coverage to include universities and research councils.  EU has consulted on this twice in 2008 and 2010. 2010 consultation shows a majority of respondents in favour of extension. However, APPSI has moved from  having no position in 2008 to disagreeing with extension in 2010.  Reason given in 2010 is that they “have seen no evidence from a UK perspective that would suggest the Directive needs to apply to these bodies.”  Not clear from this response  what exactly that means – statement of fact-no evidence received – or evidence seen, evaluated and conclusion made.

BBSRC (research council)  terms for use of new grants database very similiar to Open Government License so change is happening anyway.

 Update: 25th March. Submission finally published by IPO. So there it is under S for Scibella tucked up with its companions . 

Oh, and the reason for the delay::

“The team wanted to ensure that you are happy for this to be made public before placing it on our website. I will take it from your email that you are expecting this to be made public and will now send to our web team for publishing.”

How kind of them to think of my happiness whilst so busy themselves. After I had finished laughing I started to think a bit..

Now the default mode for submission is publication in toto unless 

“If you do not want part or whole of your response or name to be made public please state this clearly in the response, explaining why you regard the information you have provided as confidential.”

(see page 9 of the call for evidence for the rest of the small print)

Re-read my submission,  no nothing I could see that I had clearly stated should be regarded as confidential.  Ah,  but maybe the IPO is staffed with graduates skilled in deconstruction of the text.  Readers of Baudrillard and Lacan. Had they spotted some unconscious tension within the text between my desire to reveal and my desire to hide.  Maybe a méconnaissance of my true intentions? Re-read again, no to my eyes at least,  a plain piece of anglo saxon “sweat of the brow” composition with no  literary originality or merit.

Next, had I encoded Dan Brown style a message in the submission? Did the Ascii values of sentence word length spell .. I.. DO.. NOT.. WANT..  Still waiting for GCHQ to come back on this one..

So, more thinking. Maybe they wanted to save me from myself.

a)  The submission is so risible that people would snigger behind my back at the gym.. no one would speak to me at the squash club.. Twitter would untweet  me.. Facebook would efface me.  Thank you team but I’ll take my chances. Being an internet hill-billy from the back of beyond there are no cyber-metropolitan charmed circles I am part of.

b) I had given away my business plan, sold my birthright for a mess of pottage. True, for many years I kept quiet about what I was doing until I realised no-one was going to steal my idea. Probably dozens of people have had the same idea and realised that under present copyright law it’s dead in the water. Just me then , plugging on because I’ve got 10 years of  content which has turned into a tiger from which I cannot dismount.  Now, I would be glad to seek  collaboration because this project has become a long and lonely road. 

“Nel mezzo del cammin di nostra vita mi ritrovai per una selva oscura ché la diritta via era smarrita”  Dante Inferno Canto 1.

So copy on but just remember who’s got the data mine.

c) I might upset the powers that be- e.g. the universities & research councils from which, at the end of the day, I will have to get permission, one by one, committee by committee.  Sorry team but my estimation of the probability that any of those powers  will ever read my submission is close to zero.  

 OK let’s leave it there..sarcasm done..  at least I got it published. Still  wonder what the real reason was?

Update: July 24th:

emails between scibella & IPO

email 1  – Scibella to IPO pretty please where’s my submission?

Da: pete carroll [pete.carroll@scibella.com]

Inviato: martedì 22 marzo 2011 14.28

A: ‘ipreview@ipo.gov.uk’

Oggetto: Missing submission, Carroll Peter, (Scibella.com)

Priorità: Alta

Dear Sirs

I am rather concerned that my submission in response to the Call for Evidence has not appeared in the list of submissions received. It was emailed to you on 2nd march, before the 4th March deadline, and I received a response from yourselves that it had been received. See below. I can only assume, at this stage, that some mistake has been made, I quite understand that with a large number of submissions this can happen. I would be grateful if it could be added to the list as soon as possible so that it forms part of the body of evidence.

However, if there is any other reason as to why it has not been included I would appreciate as soon as possible some answer from yourselves as to why it is not appropriate for it to be included in the list of submissions on your website.

 Thank you for your prompt consideration of this matter.

 Peter Carroll

 From callforevidence [callforevidence@ipo.gov.uk] sent 02/03/2011 18.04

 “Thank you for your submission to the Call for Evidence for the Review of Intellectual Property and Growth. The information you have provided will be recorded and considered carefully.

 As indicated in the Call for Evidence, submissions made to the Review will be published on the Review’s website http://www.ipo.gov.uk/ipreview.htm in the near future unless you have indicated in your response you do not wish this to happen (as outlined on page 9 of the Call for Evidence.) We may refer to submissions received in the Review blog.

 You can follow the progress of the Review on the blog here http://www.uk-ipo.net/blog/. We welcome your participation.

Best wishes

Professor Ian Hargreaves and the Review Team.

—–Original Message—–

From: pete carroll [mailto:pete.carroll@scibella.com]

Sent: 02 March 2011 15:58

To: callforevidence

Subject: Peter Carroll (Scibella) Response to call for evidence

Importance: High

 Re: Hargreaves Review

Please find attached cover sheet and my response as word (.doc) files. If you have any problems in opening them please do not hesitate to contact me.

Peter Carroll”

e-mail 2  Scibella to IPO, not at all pretty please – get this sorted or else.

Da: pete carroll [pete.carroll@scibella.com]

Inviato: venerdì 25 marzo 2011 12.53

A: ‘information@ipo.gov.uk’

Cc: ‘HargreavesI@cardiff.ac.uk’; ‘ianhargreaves@msn.com’

Oggetto: FAO: Robin Webb, Director of Innovation

Re: Hargreaves IP Review submission by Carroll, Peter (Scibella) missing from published list. No response from IPO

 I know you must all be very busy but can you get someone in the IP review support team to sort this out asap. Background. Submitted 2nd march via pete.carroll@scibella.com – received acceptance response 02/03/2011 18.04. When full list published 17 march not there. Waited over weekend. Monday 21st @scibella asked via twitter @ip_review_uk all published? Answer, yes. Follow up tweet same day from @scibella to @ip_review_uk, you sure etc? so where’s mine? No further response to @scibella from @ip_review_uk via twitter channels. Follow up email Tuesday Mar 22 from pete.carroll@scibella.com to ipreview@ipo.gov.uk. Subject Missing submission, Carroll Peter, (Scibella.com). Gist Sure it’s a mistake etc- but if not please give reason. E-mail opened at your end Tuesday, March 22, 2011 1:56:49 PM. Now Friday, no reply- zilch, not even 30 second- thanks we’re looking into it will get back first response. Bloody pissed off with this, smacks of indifference towards the small guy, the SME. If Google or BPI submission had gone missing would it have been the same silence? Don’t think so.

Ok, your IPO contact page says response within 1 working day. Let’s say 2 working days (you’re busy). So no satisfactory response by end of play on Tuesday 29th and the story becomes about IPO silence, conspiracy or cock-up, etc. Will do my best to get it out to all interested parties, press, IP blogosphere. Think about it.

Ps: submission now in pdf format on http://bit.ly/i5Biq3 you can copy from there if it helps IPO.

Regards

Pete Carroll

Strada Castagna Bassa, 3

Calosso, Piemonte, Italy

Tel: 0039 0141 853446

Email: pete.carroll@scibella.com

e-mail 3   IPO to Scibella – holding reply

Da: Information [Information@ipo.gov.uk]

Inviato: venerdì 25 marzo 2011 16.03

A: ‘pete carroll’

Cc: HargreavesI@cardiff.ac.uk; ianhargreaves@msn.com

Oggetto: RE: Robin Webb, Director of Innovation

Dear Pete,

I have forwarded your email onto Robin Webb, who will look into your enquiry and respond in due course.

Yours sincerely

(name redacted by Pete Carroll)

Information Centre Manager

IPO

e-mail 4  IPO to Scibella – success at last, submission will be published

Da: ipreview [ipreview@ipo.gov.uk]

Inviato: venerdì 25 marzo 2011 15.23

A: ‘pete carroll’

Oggetto: RE: Missing submission, Carroll Peter, (Scibella.com)

Dear Pete,

Thank you for your email and your input to the review. Your submission has been recorded and will be considered along with the other submissions received.

Please accept my apology for the time it has taken in replying to you and for the delay in publishing your submission. The team wanted to ensure that you are happy for this to be made public before placing it on our website.

I will take it from your email that you are expecting this to be made public and will now send to our web team for publishing. This will appear on our website by 4:30pm today.

Kind regards

(name redacted by Peter Carroll)

IP Review team

—–Original Message—–

From: pete carroll [mailto:pete.carroll@scibella.com]

Sent: 22 March 2011 13:28

To: ipreview

Subject: Missing submission, Carroll Peter, (Scibella.com)

Importance: High

Dear Sirs>>>>

I tweeted via http://twitter.com/scibella

copyright status of old university vacancy ads – publish and be damned? http://bit.ly/htKNm3 @copyrightgirl @IP_Review_UK #IP_Review 5:26 PM Jan 30th via web

Thank you to Emily Goodhand http://copyright4education.blogspot.com/

for taking the time to reply and raising some good points.  My response to her comments are below in blue

@copyrightgirl Emily Goodhand  @scibella that’s a very interesting one! It’s true, the lifespan of a job advert is not long, and the employer would own the copyright…

See final points below

@copyrightgirl Emily Goodhand  @scibella do you absolutely have to use the full-text? Or can you just mine the appropriate data? Copyright does not protect facts – could..

Two sides to this.

1. Practical – I can program to data extract from html structure of JAC ad between two points beginning advert text and end of advert text. At present can’t go down into text to extract paragraphs relevant to a CRIS and exclude general job bumf.  Can probably do but it’s hassle. Anyway, more granularity wouldn’t help – point 2 below.

2. Copyright Law.  Protection extends to a ” substantial part” of the work. This can be very small as in 11 word danish case  (keyword + 5 words on either side to provide context) see:

http://www.farrer.co.uk/index.php?option=com_flexicontent&view=items&cid=191%3Aour_briefings&id=1114%3Acopying-just-11-words-could-be-too-much&Itemid=10

provided that the ““intellectual creation of the author” was copied in that extract.  Even newspaper headlines can be copyright protected:http://www.freehills.com/6830.aspx

I am working on entity extraction programs to go through adverts but what they extract beyond the “facts” of  name,unit, and this is the tricky programming part -a title- would probably be copyright covered. I.e. all context to provide meaning to the title. UK case law seems to be that only “insignificant” = unsubstantial. Everything else- i.e. context that provides significance is substantial. Hey,ho – not much encouragement there to write entity extraction programs.

@copyrightgirl Emily Goodhand @scibella it be argued that this piece of work is purely factual and does not attract sufficient creative originality to be copyrighted?22 hours ago  

I agree a job ad is hardly  Shakespeare or Milton but that doesn’t seem to be the test. UK case law appears to be that creativity = “sweat of the brow” – if it takes some intellectual effort to compose then it is a protected literary work.  see: http://en.wikipedia.org/wiki/Sweat_of_the_brow
 
 
Final points:
 
All this makes me wonder if copyright law in the 21st century just “doesn’t get” what the impact of moving from print to web for has been.  I’ll try to explain.
 
In the days of the steam press access to dissemination of information/knowledge was restricted through a few points – publishers- and relatively few people – “writers” who were mostly the “bourgeoisie” and above.  It was not easy to write (pre word processing) & publish so writing and publishing represented substantial effort- intellectual property- which was intended to have a lasting relevance. Rather like crafting by hand a fine piece of furniture- a family heirloom.  Copyright law reflected the values of this class – writing was property  and the law existed as walls and bars to protect this property.
 
With the web has come about a immense “democratisation” of this process and more importantly a change in the purpose of writing/publishing.   Most publication on the web is “ephemera” not intended to last but do a job- passing on information and opinion etc.  via blog, tweet, website etc.  To return to the furniture analogy – it’s Ikea not Chippendale.  A primary purpose of this ephemera is self publicity for individuals and organisations- this is is what I/we think- this is what I/we do. It’s aim is to spread as widely as possible, not to be a creative monument.
 
Yet copyright law still treats this ephemeral side of web publication as valuable IP. The twitter tweets of a 30 something with, we hope, a life expectancy of 90+  are (unless things change)  treated as valuable enough to retain copyright protection well into the 22nd century.
 
Just doesn’t make sense to me. Perhaps the default for web publication should be something like the creative commons CC-BY license, exertion of the moral right of attribution, but no attached economic IP  rights.  I.e. turn the system upside down.. rather than opting-out of protection by CC etc. – opt in  for more substantial work to licensing that preserves IP rights.

update: just come across this great Nina Paley cartoon- says it all..

clearly it's valuable!

Scibella is a project to set up a CRIS (Current Research Information Service) for UK public sector research in science & technology. At the core of a CRIS is People, Projects, Units (PPU).  Who is doing what, where.  The raw data for Scibella would come from vacancy adverts like the one below from the University  of Bath in 2008 – 2  years ago.

Research Assistant

Department of Pharmacy & Pharmacology

You can apply for this position by filling out an on-line application form.

 Further Details

From this ad can be extracted the following minimum PPU info

People: Dr Ian Eggleston (i.eggleston@bath.ac.uk)

Project title: photoactivatable caged iron chelators.

Unit: Bath. Department of Pharmacy & Pharmacology.

I think this PPU info is factual and therefore not covered by copyright. So far, so good.

But there is obviously more project info in the complete advert text. Text which presumably has enough creative content to be covered by copyright.

So, here is the copyright statement for the University of Bath website:

Copyright

All material on any of the University’s Web sites is, unless specified otherwise, copyright of the University of Bath and may not be reproduced without prior permission.

You may copy, print or download onto disc material for non-commercial use provided that any existing copyright or other intellectual property notices are retained.

You may not use or distribute material from this site or change the content of these pages in any way for any public or commercial purpose other than with the express written consent of the University of Bath and on such terms as the University of Bath may specify.

The University logo is a registered trade mark of University of Bath and should not be used without permission; requests to use the logo should be sent to web-support@bath.ac.uk.

Fairly standard terms for UK universities, fair dealing for private research and study. Anything else – get permission first and be prepared to negotiate terms.

 Here is my problem – I want to be able to use the full advert text in the CRIS.  This is not for private research or study, but for public and maybe commercial purpose. I am sure that if I contact all the UK universities and can find the right person to deal with and wait  for someone ,or some committee, to decide that it is OK for me to do so that I may get all the permissions required.  But this lengthy process, and its uncertainty of outcome,  is certainly a deterrent to setting up a CRIS whose purpose is to disseminate information to other researchers, facilitate collaboration between Universities and the private sector, etc.  All good and worthy aims which no doubt every university would support.

 Never mind that, the job ad has done its job, has expired, is no more (apologies to Monty Python fans!), has no further economic value to the University,  is no longer on the university website but perhaps, and only perhaps, archived somewhere on some disc or other that only Fred in the  IT department knows about – this advert text is covered by copyright until 2078.. Use it without “written permission”  and I infringe copyright law.

This offends my sense of natural justice. It is as if someone has discarded a worn-out tyre by the road and a poor peasant comes along thinking, ah I can re-use that. Shoes for myself and family. Then, one day, the  tyre owner turns up and says to the peasant – that’s still my tyre, I didn’t give you permission to re-use it, off to court you go.

So what should the right copyright status be for an expired job advert? Any comments would be very gratefully received.

 to be continued (Part 2. creative commons licence CC BY for university websites?)

 

So what’s been happening with Scibella over the last 18 months- just 5-6 hours a week gathering raw data, JAC ads and further particulars off university sites. 3.9 Gbytes in my public-ftp folder on scibella.com and counting at 40 Mbytes a week – mind you much of that is the puff and bureaucratese that is a HR dept.  job description. Probably, I have the best archive of  university terms & conditions in the UK.  I call it riding the tiger of content, stop and Scibella falls into the limbo of  discontinued blogs and projects that must fill the ISP servers of the world – a bit like all the dark matter that probably constitutes most of the universe. 

So, 2011 and I start again.  This time the business model is that there is no business model .  I haven’t abandoned the hope that there is something I could produce via Scibella that people would pay for. If it can be useful to people then there is value in its survival.  The best chance of it surviving is if it can make some money for someone . I am not a 40 something, a 50 something, but a 60 something who is not going to be around forever.  Scibella as a community service, a past-time (grandad in his cyber-shed), would stop when I get really old, get ill etc.  I have thought could I hand it over to a university department for free-  but these are hard times and who has the resources to spare?

So as the old song says “ride that tiger, ride that tiger”.

Latest thinking:

CRIS traditional as in http://www.eurocris.org/Index.php?page=homepage&t=1   = People Units Projects  (PUPs) as core and a bloody complicated relational database  (sorry CERIF people)

CRIS+  = PUPs +  background on the  science policy and funding environment the PUPs live in.  With all the emphasis on “impact” and “prioritisation” of research PUPs by themselves give only half the picture of what is going on in UK public sector research. 

tbc

Continuing business plan. Need to define  final output to subscribers and then work backwards from that to code up data manipulation and database(s) structure.

Want to give subscribers something where info is aggregated and tagged/categorised so that it can be scanned quickly and interesting new items picked out for follow up.  Call this a “full data view”

Possibilities

Pull Model?

Everything in database, generate a full data view from database via user generated  action.

Advantages – Intrinsic  flexibility of ways of viewing data, minimises user management overheads (alerting lists etc.)

Disadvantages.  Subscriber will forget to check, won’t use much, won’t re-subscribe.  Not re-assuring to subscriber that there is no “permanent” format to keep should Scibella (an unknown new start up)  fail.   Technically, query structure for hundreds of item could take annoying time to build up on screen.  Repeated queries for same month could occur (user would have to re-run over web every time) . Bandwidth quota issues for Scibella with host ISP?

Push Model?

Generate a stand-alone full data view file that gets sent out to subscribers. Users archive and read over organisation intranet or on local device.

Advantages – permanent output held by user on local PC or intranet. Access /Query times would be faster than web only access.

?? would item by item database(s)  (15000 project records a year) be needed at all.   CMS database(s) could be a lot simpler.

Disadvantages –  Flexibility of viewing, this approach “freezes” data format.   Copyright management – files could be transferred to non-subscribers etc.

Technically, what would be size of files.  Peak bandwidth/ISP reliability issues as files are pushed out.  Non-receipt issues (anti-spam filters in subscriber organisations etc.)

Hybrid push pull variants

Set up email/rss alert that new full data view is available for download from Scibella  (embed link to data download)

As above but also TOC that links back  to full view of item in database -this assumes database(s) containing project info.

Project Development Blog from June 2009 onwards.

Archives

Follow

Get every new post delivered to your Inbox.