1
00:00:06,880 --> 00:00:12,960
This is an OpenRefine project that is loaded
from a Wikimedia Commons category.
2
00:00:12,960 --> 00:00:15,200
I am looking at a selection of images here,
3
00:00:15,200 --> 00:00:18,920
and I am interested in adding
structured data to them.
4
00:00:18,920 --> 00:00:24,640
For instance: what is being depicted in the files,
the photographer, etcetera.
5
00:00:24,640 --> 00:00:28,280
One way to add structured data is to actually
6
00:00:28,280 --> 00:00:32,360
take the wikitext - unstructured
description from these files...
7
00:00:32,360 --> 00:00:37,240
... and create a column with that wikitext.
8
00:00:37,240 --> 00:00:40,640
Later on, you can then extract
data from that wikitext
9
00:00:40,640 --> 00:00:42,520
and convert it to structured data.
10
00:00:42,520 --> 00:00:45,000
This is a very handy thing to do.
11
00:00:45,000 --> 00:00:47,080
How do you go about that?
12
00:00:47,080 --> 00:00:49,920
You select the column with your file names.
13
00:00:49,920 --> 00:00:52,600
As you can see, the file
names have been reconciled
14
00:00:52,600 --> 00:00:54,120
with Wikimedia Commons.
15
00:00:54,120 --> 00:00:57,960
So they are blue and they show a thumbnail.
16
00:00:57,960 --> 00:01:05,760
I select the column, and then I go to the function
"Add columns from reconciled values...".
17
00:01:05,760 --> 00:01:09,680
I get several options of things
I can retrieve about this file.
18
00:01:09,680 --> 00:01:12,000
I choose wikitext.
19
00:01:12,000 --> 00:01:16,476
Then it will load for a while,
and show me a preview ...
20
00:01:16,476 --> 00:01:22,080
... and then I click "OK", and
then OpenRefine will generate
21
00:01:22,080 --> 00:01:28,960
a column for me with Wikitext.