Utilizador Discussão:Alex brollo

Wikisource, a biblioteca livre
Saltar para a navegação Saltar para a pesquisa
Emblem-question.svg Possui alguma dúvida? Já sabe como editar em wikis, mas está perdido quanto a editar no Wikisource? Ou nem sabe como editar? Quer saber o que o Wikisource aceita? Que textos, onde e como pode adicioná-los? O que mais se pode fazer depois de colocar algo? Alguém tem acesso a uma certa edição impressa, para rever se o texto está certo?
Esclareça suas dúvidas no Café dos novatos.

Re: Help needed[editar]

in my humble opinion you have made the correct choice. The original author is Dom Duarte and the transcription made is something like the scenery described at commons:Commons:When to use the PD-scan tag: the transcription have no independent copyright as it is simply a mechanical reproduction of an old, public domain image. So according to it and to commons:Commons:Derivative works, the author is Dom Duarte, the 1843 is their original work without big additions. Lugusto 18h15min de 8 de Novembro de 2008 (UTC)


Thanks Lugusto! I worry about those difficult questions, but the core of the work is another: to introduce here that beautiful and important text. So... I'll simply go on helping Cecilia in her work (I luckily met her at Fieracavalli, Verona) trying to have a complete an accurate transcription as soon is possible!--Alex brollo 05h49min de 9 de Novembro de 2008 (UTC)

Quality Level[editar]

Hi Alex! In pt.wikisource we normally describe the quality level in the "Informações sobre esta edição" (Information about this edition) box, whose content is edited in pages from a different namespace (Anexo), and we use the {{Predefinição:Progresso}}. However, this solution can be used only to mark the quality level of the whole book, not each of its subpages. -Diego UFCG 23h51min de 5 de Fevereiro de 2010 (UTC). An example: Anexo:Ficha/Livro da ensinança de bem cavalgar toda sella. -Diego UFCG 23h53min de 5 de Fevereiro de 2010 (UTC)

Yes, I agree with you. Your trick seems useful. -Diego UFCG 23h59min de 5 de Fevereiro de 2010 (UTC)

Better OCR[editar]

Hi Alex!

Some time ago I've started to add the text of Galeria:Elementos de Arithmetica.djvu into Wikisource, but the text I can get by OCR is not very useful, so I would probably need to write everything... Do you know if it is possible to get OCR again, with better quality using this file? Thanks in advance... Helder 23h08min de 17 de Março de 2010 (UTC)

Thank you! Any help is welcome!!! Smile.png Helder 23h40min de 17 de Março de 2010 (UTC)

Thanks!!!![editar]

I'm really grateful! These scans will help a lot! Smile.png

I'm sorry for not coming before... Anyway, I saw that you already got the bot flag.

Best regards, Helder 11h29min de 23 de Março de 2010 (UTC)

Dictionary[editar]

Hello!

I would like to know more about the dictionary you mention at Galeria Discussão:Bem cavalgar.djvu. How does it works? What is it for? Where is it? Helder 22h49min de 23 de Março de 2010 (UTC)

I'm a python user, and I enjoy towrite my own scripts with a layman programming style. One of those scripts "parses" a text into words; really it converts a text into a list of "words" and "separators", considering "separator" a customable list of characters. This conversion is really interesting, because it turns out as a list like this: [word,separator,word,separator...] and it's really easy to:
  1. collect and do anything with "words";
  2. build dictionaries;
  3. replace any of the members of the list;
  4. join back the list into a text.
If you can use Python, I can copy here the simple code that does the trick. :-) --Alex brollo 23h42min de 23 de Março de 2010 (UTC)
Humm... Could you copy it here then, please? o.O (I'm not a Python programmer, but I can understand it :-) )
By the way, do you know if it could be used for auto-correction (like plugins for firefox)?Helder 14h24min de 7 de Abril de 2010 (UTC)
Here the code, "as it I used it for Bem cavalgar":
    def splitter(testo,caratteri=u" ,.;<>\n/'=\"0123456789"): # testo is the input text string, caratteri is the list of 
                                                               # "not good" characters, all the other being "good"
    #inizializzazioni
    parole=[]   # container for words
    elem=""     # element (substring of characters all of them of the same type good or not good
    ft=len(testo)
    
    #caricamento primo carattere
    tc=(testo[0:1] in caratteri)   # tc (type of character) is a boolean classifying a character as "good" or not

    #scorrimento e analisi
    for i in range(ft): 
        elem=elem+testo[i:i+1]     browses the text character by character
        if not tc == (testo[i+1:i+2] in caratteri): #if the following charatcer belong to a different type...
            parole.append(elem) # ... then the word stops, add element to list
            elem=""             # ....initialize element again
            tc=not tc           # ....and change the character type

    # caricamento dell'ultima parola
    parole.append(elem)   # load the last element of the text
    termini=[]            # initialize the list of unique words
    for i in parole:                    
        if not i[0:1] in caratteri:   # when first character of element is "good", all the other are "good"
            if not i in termini:      # if the "good element" in not into termini nor it into a list of wiki markup used (following row)
                if not i in ['Alex', 'brollo', 'class', 'div', 'level', 'noinclude', 'pagequality', 'pagetext', 'references']:
                    termini.append(i) # append the new word
    termini.sort() # sorte the list termini
    termini=" ".join(termini) # join it into a string 
    
    return parole, termini # return both the original sequence of elements and the text from termini
About your question: no, better: I couldn't do it; but you can update Firebox dictionary with those lists...the simplest way, to post it into a form, and to select "add" for any "new" word. --Alex brollo 15h31min de 7 de Abril de 2010 (UTC)
Thank you! =) Helder 21h54min de 7 de Abril de 2010 (UTC)

I was thinking about this again, and maybe this kind of dictionary could serve as a first step to make "customizable conversion tables" like those used by Chinese Wikipedia (see meta:Automatic conversion between simplified and traditional Chinese). In the Wikisources' case, maybe it would be possible to have new "language variants" for "old languages". This way, when a Wikisource text is written using ancient words, the users could choose between reading the text in the "original version" (default) or in a automatically generated "modern version" (resulting from conversion based on the customizable tables).

The text from dictionary should be filled in to build the table with rules like "old word => new word;", one for each word which has a corresponding in the current (modern) language.

What do you think of that? Does it seems to be useful/possible? Helder 21h44min de 1 de Junho de 2010 (UTC)

Hi Alex!

Where could I find the dictionary of words you have made for Bem cavalgar? Helder 15h36min de 27 de Agosto de 2010 (UTC)

Bem cavalgar[editar]

Hello Alex!

I was looking at the book and I only found the following:

In general, I think the organization is good. =) Helder 00h10min de 1 de Junho de 2010 (UTC)

Another one without diacritics: Página:Bem cavalgar.djvu/31 :-) Helder 21h52min de 1 de Junho de 2010 (UTC)
Now I understand why you have used that syntax... =)
I think the it:Template:pt could be useful here too. Fell free to add it! Helder 19h21min de 2 de Junho de 2010 (UTC)
I've moved Predefinição:pt to Predefinição:PT. The links to the first one seems to be expecting the template from Wikimedia Commons, so I copied it over the redirect and updated Página:Bem cavalgar.djvu/31. Should be ok now... Helder 19h58min de 9 de Junho de 2010 (UTC)
Thanks Helder. I'm not so deep into Commons templates, so I could not imagine about a possible confusion. I'm so busy into it.source.... but, I hope, I'll come beck here towork a little about the new version of Duarte's book. --Alex brollo 17h40min de 27 de Agosto de 2010 (UTC)