Drugie zadanie
adres: http://www.pythonchallenge.com/pc/def/ocr.html
tytuł strony: ocr
podpowiedź:
recognize the characters. maybe they are in the book,
but MAYBE they are in the page source.
rozwiązanie:
zaglądam do kodu strony i znajduję tam mnóstwo dziwnych znaków i polecenie:
<!-- find rare characters in the mess below: --> <!-- %%$@_$^__#)^)&!_+]!*@&^}@[@%]()%+$&[(_@%+%$*^@$^!+]!&_#)_*} }}!}_]$[%}@[{_@#_^{*@##&{#&{&)*%(]{{([*}@[@&]+!!*{)!}{%+{))])[!^} )+)$]#{*+^((@^@}$[**$&^{$!@#$%)!@(&+^!{%_$&@^!}$_${)$_#)!({@!)(^} !*^&!$%_&&}&_#&@{)]{+)%*{&*%*&@%$+]!*__(#!*){%&@++!_) (...) #@}&$[[%]_&$+)$!%{(}$^$}* -->
co to znaczy rzadkie? do sprawdzenia jakie mam znaki w tym bałaganie stworzyłam histogram:
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
f = open('2.txt', 'r') | |
histogram = {} | |
file = f.read() | |
for char in file: | |
if char != '\n': | |
if char in histogram.keys(): | |
histogram[char] += 1 | |
else: | |
histogram[char] = 1 | |
print histogram |
odpalam:
w wyniku widać, że rzadkie znaki to zwykłe litery alfabetu ukryte pomiędzy różnymi innymi znakami. Skoro już wiadomo czego szukać rozwiązanie zajmie jedną linijkę 😉
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
import re | |
print "".join(re.findall(r"[a-z]", file)) |
adres kolejnej zagadki: http://www.pythonchallenge.com/pc/def/equality.html