<?xml version='1.0' encoding='utf-8' ?>

<rss version='2.0' xmlns:lj='http://www.livejournal.org/rss/lj/1.0/' xmlns:atom10='http://www.w3.org/2005/Atom'>
<channel>
  <title>epershand</title>
  <link>https://epershand.dreamwidth.org/</link>
  <description>epershand - Dreamwidth Studios</description>
  <lastBuildDate>Wed, 13 Jun 2012 19:00:38 GMT</lastBuildDate>
  <generator>LiveJournal / Dreamwidth Studios</generator>
  <lj:journal>epershand</lj:journal>
  <lj:journaltype>personal</lj:journaltype>
  <image>
    <url>https://v2.dreamwidth.org/154130/171716</url>
    <title>epershand</title>
    <link>https://epershand.dreamwidth.org/</link>
    <width>100</width>
    <height>100</height>
  </image>

<item>
  <guid isPermaLink='true'>https://epershand.dreamwidth.org/73621.html</guid>
  <pubDate>Wed, 13 Jun 2012 19:00:38 GMT</pubDate>
  <title>A brief love note to the AO3</title>
  <link>https://epershand.dreamwidth.org/73621.html</link>
  <description>You know, I kvetch about the AO3 a lot, and their coding team has been doing a lot of hustling lately without getting a lot of love but damn. Sometimes I am just hit by how fucking RIGHT they&apos;ve done something.&lt;br /&gt;&lt;br /&gt;For example: right now I&apos;m helping out &lt;span style=&apos;white-space: nowrap;&apos;&gt;&lt;a href=&apos;https://starlady.dreamwidth.org/profile&apos;&gt;&lt;img src=&apos;https://www.dreamwidth.org/img/silk/identity/user.png&apos; alt=&apos;[personal profile] &apos; width=&apos;17&apos; height=&apos;17&apos; style=&apos;vertical-align: text-bottom; border: 0; padding-right: 1px;&apos; /&gt;&lt;/a&gt;&lt;a href=&apos;https://starlady.dreamwidth.org/&apos;&gt;&lt;b&gt;starlady&lt;/b&gt;&lt;/a&gt;&lt;/span&gt; with a fandom studies project, by writing her a script that looks at fanfiction html and extracts fandom, ship, publication date, etc. I&apos;m writing dedicated parsers for a few major fic archives.&lt;br /&gt;&lt;br /&gt;This is (roughly speaking) what my code looks like for the AO3:&lt;br /&gt;&lt;blockquote&gt;&lt;br /&gt;def GetAo3Metadata(self):&lt;br /&gt;    &quot;&quot;&quot;Extract metadata from Archive of Our Own Beautiful Soup object.&quot;&quot;&quot;&lt;br /&gt;    self.metadata.author = # Find the &quot;a&quot; tag with the class &quot;login author&quot;&lt;br /&gt;    self.metadata.title = # Find the &quot;h2&quot; tag with the class title heading&quot;&lt;br /&gt;    self.metadata.rating = # Get all items from the list with the class &quot;rating tags&quot;&lt;br /&gt;    etc.&lt;br /&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;This is, roughly speaking, what the code looks like for everything else:&lt;br /&gt;&lt;blockquote&gt;&lt;br /&gt;def ParseFanfictionNetMetadata(self):&lt;br /&gt;    &quot;&quot;&quot;Extract metadata from Fanfiction.net Beautiful Soup object.&quot;&quot;&quot;&lt;br /&gt;    # Find the block called &quot;gui_table1&quot; because, you know, that&apos;s meaningful.&lt;br /&gt;    # Fuck it, just extract all the text from that block.&lt;br /&gt;    # And then do a regular expression search.&lt;br /&gt;    # And then take a shot.&lt;br /&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;Or like this:&lt;br /&gt;&lt;blockquote&gt;&lt;br /&gt;def ParseYuletideTreaureMetadata(self):&lt;br /&gt;   &quot;&quot;&quot;Extract metadata from Yuletidetreasure.org Beautiful Soup object.&quot;&quot;&quot;&lt;br /&gt;   # Fuck is this the nineties? Are there really NO DIVS in this code?&lt;br /&gt;   # Or class attributes?&lt;br /&gt;   # Or even fucking paragraph blocks?&lt;br /&gt;   # Fuck it, I&apos;m drinking.&lt;br /&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;&lt;img src=&quot;https://www.dreamwidth.org/tools/commentcount?user=epershand&amp;ditemid=73621&quot; width=&quot;30&quot; height=&quot;12&quot; alt=&quot;comment count unavailable&quot; style=&quot;vertical-align: middle;&quot;/&gt; comments</description>
  <comments>https://epershand.dreamwidth.org/73621.html</comments>
  <category>coding</category>
  <category>html pedantry</category>
  <category>fake code</category>
  <category>archive of our own</category>
  <category>fandom: archives</category>
  <category>it works bitches</category>
  <lj:security>public</lj:security>
  <lj:reply-count>2</lj:reply-count>
</item>
<item>
  <guid isPermaLink='true'>https://epershand.dreamwidth.org/57886.html</guid>
  <pubDate>Thu, 11 Aug 2011 14:10:28 GMT</pubDate>
  <title>Damn You, Mark Pilgrim</title>
  <link>https://epershand.dreamwidth.org/57886.html</link>
  <description>All this week, I have been making the same mistake. I look at the clock and think &quot;huh, I should go to bed soon. Maybe I&apos;ll just read a chapter of &lt;a href=&quot;http://diveintohtml5.org/&quot;&gt;Dive Into HTML5&lt;/a&gt; before I go to bed.&quot;&lt;br /&gt;&lt;br /&gt;It is generally about two hours after this that I pull myself away from whatever fascinating and specific Wikipedia or Quora article or Joel on Software blog post or whatever I am currently reading, because &lt;em&gt;Dive Into HTML5&lt;/em&gt; is the TV Tropes of computer manuals.&lt;br /&gt;&lt;br /&gt;Seriously, read the chapter &lt;a href=&quot;http://diveintohtml5.org/past.html&quot;&gt; A Quite Biased History of HTML5&lt;/a&gt; and tell me if YOU can drag yourself away from it and its links. Browser wars! Extended quotations of Marc Andreessen&apos;s emails! Snarky commentary on the methods of standards bodies!&lt;br /&gt;&lt;br /&gt;This thing is BETTER THAN THE &lt;a href=&quot;http://www.amazon.com/Operating-System-Concepts-Abraham-Silberschatz/dp/0470128720&quot;&gt;DINOSAUR OPERATING SYSTEMS TEXTBOOK&lt;/a&gt;. (This is, for the record, the highest praise I can bestow on any book about computers.) But now I&apos;ve got this fear that it&apos;s going to be like it was after that month where I read all the Sarah Vowell books. I went around wanting to tell people Exciting Facts! And the response was always &quot;oh yeah, I think I read something like that in a Sarah Vowell book once.&quot; I am totally going to be all &quot;BROWSER WARS!&quot; and people will be like &quot;oh yeah, that was an awesome chapter in &lt;em&gt;Dive Into HTML5&lt;/em&gt;.&quot;&lt;br /&gt;&lt;br /&gt;So far, the people on twitter I&apos;ve enthused at have linked me to:&lt;br /&gt;&lt;a href=&quot;http://diveintomark.org/archives/2004/01/14/thought_experiment&quot;&gt;This snarky Pilgrim essay on XML&lt;/a&gt;&lt;br /&gt;&lt;a href=&quot;http://www.quora.com/Why-has-Microsoft-failed-to-make-Internet-Explorer-web-standards-compliant-in-spite-of-years-of-browser-market-share-loss&quot;&gt;This commentary on the positive things IE did in the world of browser development&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Oh also the &lt;a href=&quot;http://en.wikipedia.org/wiki/Browser_wars&quot;&gt;wikipedia page on BROWSER WARS!&lt;/a&gt; Is amazing. But you already know that because you have read the chapter above, which links to it.&lt;br /&gt;&lt;br /&gt;&lt;img src=&quot;https://www.dreamwidth.org/tools/commentcount?user=epershand&amp;ditemid=57886&quot; width=&quot;30&quot; height=&quot;12&quot; alt=&quot;comment count unavailable&quot; style=&quot;vertical-align: middle;&quot;/&gt; comments</description>
  <comments>https://epershand.dreamwidth.org/57886.html</comments>
  <category>semantic web</category>
  <category>enthusiasm</category>
  <category>coding</category>
  <category>internets</category>
  <category>big damn geek sir</category>
  <category>books</category>
  <lj:security>public</lj:security>
  <lj:reply-count>1</lj:reply-count>
</item>
</channel>
</rss>
