'파이썬' 태그의 글 목록

파이썬

[CodeUp] 코드업 기초 100제 6004번 풀이 - 파이썬(python) 2021.07.17
[CodeUp] 코드업 기초 100제 6002번 풀이 - 파이썬(python) 2021.07.17
[CodeUp] 코드업 기초 100제 6001번 풀이 - 파이썬(python) 2021.07.17
[CodeUp] 코드업 기초 100제 6003번 풀이 - 파이썬(python) 2021.07.17
[Crawling] Slack 파이썬으로 연동 / 채팅하기 2021.06.09
[Crawling] 가상 돔을 활용한 크롤러 / selenium 2021.06.09
[Crawling] Generator 만들기 / 자바스크립트 yield와 비교 2021.06.08
[Crawling] iterator 만들기 2021.06.08
[Crawling] get,[] 차이 / 반복문 이용 2021.06.08
[Crawling] 크롤링 get요청, post요청하기 2021.06.07

[CodeUp] 코드업 기초 100제 6004번 풀이 - 파이썬(python)

2021. 7. 17. 22:26

728x90

6004 : [기초-출력] 출력하기04(설명)(py)

시간 제한: 1 Sec 메모리 제한: 128 MB

문제 설명

이번에는 작은 따옴표(')(single quotation mark)가 들어있는
출력문 연습을 해보자.

다음 문장을 출력하시오.

'Hello'

입력

입력 없음

출력

'Hello'

내 소스

print("'Hello'")

모범 답안

print("'Hello'")

728x90

'알고리즘 (Python) > 코드업 기초 100제' 카테고리의 다른 글

[CodeUp] 코드업 기초 100제 6006번 풀이 - 파이썬(python) (0)	2021.07.17
[CodeUp] 코드업 기초 100제 6005번 풀이 - 파이썬(python) (0)	2021.07.17
[CodeUp] 코드업 기초 100제 6002번 풀이 - 파이썬(python) (0)	2021.07.17
[CodeUp] 코드업 기초 100제 6001번 풀이 - 파이썬(python) (0)	2021.07.17
[CodeUp] 코드업 기초 100제 6003번 풀이 - 파이썬(python) (0)	2021.07.17

[CodeUp] 코드업 기초 100제 6002번 풀이 - 파이썬(python)

2021. 7. 17. 22:24

728x90

6002 : [기초-출력] 출력하기02(설명)(py)

시간 제한: 1 Sec 메모리 제한: 128 MB

문제 설명

이번에는 공백( )을 포함한 문장을 출력한다.
다음 문장을 출력해보자.

Hello World
(대소문자에 주의한다.)

입력

입력 없음

출력

Hello World

내 소스

print("Hello World")

모범 답안

print("Hello World")

728x90

'알고리즘 (Python) > 코드업 기초 100제' 카테고리의 다른 글

[CodeUp] 코드업 기초 100제 6006번 풀이 - 파이썬(python) (0)	2021.07.17
[CodeUp] 코드업 기초 100제 6005번 풀이 - 파이썬(python) (0)	2021.07.17
[CodeUp] 코드업 기초 100제 6004번 풀이 - 파이썬(python) (0)	2021.07.17
[CodeUp] 코드업 기초 100제 6001번 풀이 - 파이썬(python) (0)	2021.07.17
[CodeUp] 코드업 기초 100제 6003번 풀이 - 파이썬(python) (0)	2021.07.17

[CodeUp] 코드업 기초 100제 6001번 풀이 - 파이썬(python)

2021. 7. 17. 22:20

728x90

6001 : [기초-출력] 출력하기01(설명)(py)

시간 제한: 1 Sec 메모리 제한: 128 MB

문제 설명

python 언어에서 가장 기본적인 명령이 출력문이다.
print( )를 이용해 다음 단어를 출력하시오.

Hello

입력

출력

Hello

내 소스

print("Hello")

모범 답안

print("Hello")

728x90

'알고리즘 (Python) > 코드업 기초 100제' 카테고리의 다른 글

[CodeUp] 코드업 기초 100제 6006번 풀이 - 파이썬(python) (0)	2021.07.17
[CodeUp] 코드업 기초 100제 6005번 풀이 - 파이썬(python) (0)	2021.07.17
[CodeUp] 코드업 기초 100제 6004번 풀이 - 파이썬(python) (0)	2021.07.17
[CodeUp] 코드업 기초 100제 6002번 풀이 - 파이썬(python) (0)	2021.07.17
[CodeUp] 코드업 기초 100제 6003번 풀이 - 파이썬(python) (0)	2021.07.17

[CodeUp] 코드업 기초 100제 6003번 풀이 - 파이썬(python)

2021. 7. 17. 22:16

728x90

6003 : [기초-출력] 출력하기03(설명)(py)

시간 제한: 1 Sec 메모리 제한: 128 MB

문제 설명

이번에는 줄을 바꿔 출력하는 출력문을 연습해보자.
다음과 같이 줄을 바꿔 출력해야 한다.

Hello
World
(두 줄에 걸쳐 줄을 바꿔 출력)

입력

입력 없음

출력

Hello
World

내 소스

print("Hello")
print("World")

모범 답안

print("Hello")
print("World")

'''
또는

print("Hello\nWorld")
'''

728x90

'알고리즘 (Python) > 코드업 기초 100제' 카테고리의 다른 글

[CodeUp] 코드업 기초 100제 6006번 풀이 - 파이썬(python) (0)	2021.07.17
[CodeUp] 코드업 기초 100제 6005번 풀이 - 파이썬(python) (0)	2021.07.17
[CodeUp] 코드업 기초 100제 6004번 풀이 - 파이썬(python) (0)	2021.07.17
[CodeUp] 코드업 기초 100제 6002번 풀이 - 파이썬(python) (0)	2021.07.17
[CodeUp] 코드업 기초 100제 6001번 풀이 - 파이썬(python) (0)	2021.07.17

[Crawling] Slack 파이썬으로 연동 / 채팅하기

2021. 6. 9. 16:05

728x90

2021.06.09 - [Data Analysis/web crawling] - [Crawling] Slack Bot 만들기

전 포스팅과 연결되어 있다. 이번에는 슬랙에서 봇채팅 치는 것을 하겠다!

슬랙에 들어가 왼쪽 편을 봐보자

아 그리고 나는 test라는 채널을 추가했다.

세로로 나열된 점 세개를 누른다.

앱을 누른다.

hello를 눌러주면

여기에 내가 만든 봇이 뜨는데 만약 안 뜨더라도 검색하면 뜰 것이다.

이렇게 뜨고 동그라미로 표시한 부분을 눌러준다.

이 앱을 채널에 추가 를 눌러 원하는 채널에 초대해준다.

나는 test 채널에 초대할 것이다.

추가를 누르고 test 채널에 가보면 이게 뜰 것이다.

이제 파이썬으로 채팅을 해볼 것이다.

나는 주피터 노트북을 이용했는데 상관없다.!

from slackclient import SlackClient
import requests as rq
from bs4 import BeautifulSoup
import time

slack_token = 'xoxb-460297928240-2145122313062-cafL1R4QG7nFNRn4QMIrJeiR' #발급 받은 토큰
sc = SlackClient(slack_token)
    
    #메세지 전달
def notification(message):
    sc.api_call(
        "chat.postMessage",
        channel="#test", #{#채널}의 형태로 채널 지정
        text=message
    )

발급받은 토큰과 채널 이름은 사람마다 다를 것이다. 바꿔준 후

notification('안녕')

728x90

'Data Analysis > web crawling' 카테고리의 다른 글

[Crawling] txt , csv 파일 저장하기 / with 사용 / encoding (0)	2021.06.09
[Crawling] Slack Bot 만들기 (0)	2021.06.09
[Crawling] logging 사용 / 파일에 문서 생성하기 (0)	2021.06.09
[crawling] 웹 제어하기 / 키보드 제어 (selenium) (0)	2021.06.09
[Crawling] 가상 돔을 활용한 크롤러 / selenium (0)	2021.06.09

[Crawling] 가상 돔을 활용한 크롤러 / selenium

2021. 6. 9. 11:11

728x90

* 본 포스팅은 주피터 노트북에서 진행하였다.

[ 용어 정리 ]

크롤링(crawling) 혹은 스크레이핑(scraping)은 웹 페이지를 그대로 가져와서 거기서 데이터를 추출해 내는 행위다. 크롤링하는 소프트웨어는 크롤러(crawler)라고 부른다.

requests와 bs4를 이용하여 크롤러를 만들 수 있지만 자바스크립트의 동작이 많은 웹 사이트일 경우 한계가 있기 때문에 selenium을 사용한다.

chrome driver가 필요한데 만약 설치가 안되어 있다면 아래 포스팅을 참고하면 좋을 것이다.

2021.06.09 - [Data Analysis/web crawling] - [Crawling] 크롬 드라이버(ChromeDriver) 설치하기

[Crawling] 크롬 드라이버(ChromeDriver) 설치하기

chrome driver를 설치한다. https://chromedriver.chromium.org/downloads ChromeDriver - WebDriver for Chrome - Downloads Current Releases If you are using Chrome version 91, please download ChromeDrive..

hello-ming.tistory.com

from selenium import webdriver
url = "https://pjt3591oo.github.io"
driver = webdriver.Chrome('chromedriver')
driver.get(url)

브라우저가 자동으로 열린다.

만약 selenium이 없다면 아래의 코드를 이용하면 된다.

!pip install selenium

요소를 가져오자

selected_id = driver.find_element_by_id('nav-trigger') #id로 요소 선택
print(selected_id)
print(selected_id.tag_name) #태그명
print(selected_id.text) #태그에 속한 문자

selected_p = driver.find_element_by_tag_name('p') #id로 요소 선택
print(selected_p)
print(selected_p.tag_name) #태그명
print(selected_p.text) #태그에 속한 문자

selected_tags_p = driver.find_elements_by_tag_name('p') #id로 요소 선택
print(selected_tags_p)

selected_link = driver.find_element_by_class_name('p')
print(selected_link)
print(selected_link.tag_name)
print(selected_link.text)

728x90

'Data Analysis > web crawling' 카테고리의 다른 글

[Crawling] logging 사용 / 파일에 문서 생성하기 (0)	2021.06.09
[crawling] 웹 제어하기 / 키보드 제어 (selenium) (0)	2021.06.09
[Crawling] 크롬 드라이버(ChromeDriver) 설치하기 (0)	2021.06.09
[Crawling] 사이트 분석/ 크롤러 만들기 (0)	2021.06.08
[Crawling] 정규식을 이용한 bs4 고급 스킬 / 정규식 정리 / match와 search 비교 (0)	2021.06.08

[Crawling] Generator 만들기 / 자바스크립트 yield와 비교

2021. 6. 8. 11:24

728x90

* 본 포스팅은 주피터 노트북에서 진행하였다.

def test_generator():
    yield 1
    yield 2
    yield 3

gen = test_generator()
type(gen)

next(gen) #1
next(gen) #2

for i in test_generator():
    print(i)

def test_generator():
    print('yield 1 전')
    yield 1
    print('yield 1 과 2 사이')
    yield 2
    print('yield 2 과 3 사이')
    yield 3
    print('yield 3 후')

for i in test_generator():
    print(i)

yiled를 만나면 반환되지만 내용은 유지가 된다. 양보느낌

yiled를 보면 generator라고 생각하면 된다.

무한으로 generator 생성하기

def infinite_generator():
    count=0
    while True:
        count+=1
        yield count
        
gen = infinite_generator()

next(gen)

계속 누를 수록 숫자가 증가한다.

우리가 알고있는 리스트, Set, Dictionary의 표현식의 내부도 사실 generator 이다.

[x *x for x in [2,4,6]]
#[4, 16, 36]

print(type(x*x for x in [2,4,6]))
#<class 'generator'>

자바스크립트의 yield와 비교해보기

visual Studio Code 에서 진행하였다.

<!DOCTYPE html>
<html lang="en">
<head>
    <meta charset="UTF-8">
    <meta http-equiv="X-UA-Compatible" content="IE=edge">
    <meta name="viewport" content="width=device-width, initial-scale=1.0">
    <script>
        function* idMaker(){ //자바스크립트 generator
            var index=0;
            shile(index<3)
                yield index++
        }
        var gen = idMaker();
        console.log(gen.next().value)
        console.log(gen.next().value)
        console.log(gen.next().value)
    </script>
    <title>Document</title>
</head>
<body>
    
</body>
</html>

728x90

'Data Analysis > web crawling' 카테고리의 다른 글

[Crawling] 요소에 접근하기 (0)	2021.06.08
[Crawling] 부모 태그 접근하기 / sibling / generator (0)	2021.06.08
[Crawling] iterator 만들기 (0)	2021.06.08
[Crawling] get,[] 차이 / 반복문 이용 (0)	2021.06.08
[crawling] requests vs urllib / 파싱모듈 (0)	2021.06.08

[Crawling] iterator 만들기

2021. 6. 8. 10:55

728x90

class IterClass(object):
    def __init__(self,start,last):
        self.currnet = start
        self.max = last
    
    def __iter__(self): #없으면 'object is not iterable' 예외 발생
        return self
    
    def __next__(self):
        if self.currnet > self.max: #current의 값이 __next__ 호출시 마다 1씩 증가되고 10이 되면 여기 if문에 도달하여 예외 발생됨
            raise StopIteration
        else:
            self.currnet += 1
            return self.currnet -1 #보여주기

n_list1 = IterClass(1,10)

type(n_list1)

n_list1.__next__() #1
n_list1.__next__() #2
n_list1.__next__() #3

for 문으로 바꾸면

for i in range(0,10):
    print(n_list1.__next__())

배열이나 tuple list는 iterable 가능한 객체이다.

내부적으로자동으로 advancedfor 같은 구문에서 next가 자동으로 호출된다.

마지막에 도달하면 종료되는 것이다.

728x90

'Data Analysis > web crawling' 카테고리의 다른 글

[Crawling] 부모 태그 접근하기 / sibling / generator (0)	2021.06.08
[Crawling] Generator 만들기 / 자바스크립트 yield와 비교 (0)	2021.06.08
[Crawling] get,[] 차이 / 반복문 이용 (0)	2021.06.08
[crawling] requests vs urllib / 파싱모듈 (0)	2021.06.08
[Crawling] 크롤링 get요청, post요청하기 (0)	2021.06.07

[Crawling] get,[] 차이 / 반복문 이용

2021. 6. 8. 10:51

728x90

* 본 포스팅은 주피터 노트북에서 진행하였다.

from bs4 import BeautifulSoup
html = """<html> <head><title class="t" id="ti">test site</title></head> <body> <p>test</p> <p>test1</p> <p>test2</p> </body></html>"""
soup = BeautifulSoup(html,'lxml')
tag_title = soup.title
print(tag_title['class'])

tag_title.get('class') #get으로 class의 속성을 가져와라

#둘다 같은 결과

만약 오류가 뜬다면 이대로 입력해주자! 주피터 노트북에서는 코드 앞에 !을 붙이면 된다. cmd로 할 경우 !을 빼면 된다.

!pip install beautifulsoup4

!pip install lxml

tag_title.attrs #attribute

이 둘의 차이는

tag_title.get('class1') #속성이 없는 클래스 호출할 때 get은 오류안뜸

tag_title['class1'] #오류뜸

tag_title.get('class1',default="hi") #값이 없을 때

data_text = tag_title.text
data_text

data_text = tag_title.string
data_text

같은 값을 가져오지만 타입이 다르다

data_text = tag_title.text
data_string = tag_title.sring
print("text : ",data_text, type(data_text))
print('string : ',data_string, type(data_string))

tag_p = soup.p
tag_p

data_text = tag_p.text
data_string = tag_p.string
print('text : ',data_text, type(data_text))
print('string : ',data_string, type(data_string))

html = """<html> <head><title>test site</title></head> <body> <p><span>test1</span><span>test2</span></p> </body></html>"""
soup = BeautifulSoup(html,'lxml')
tag_p = soup.p
tag_p

data_text = tag_p.text
data_string = tag_p.string
print('text : ',data_text, type(data_text))
print('string : ',data_string, type(data_string))

조건문을 활용하여 데이터를 확인할 수 있다. 아래의 코드는 span태그가 있는지의 여부를 묻는다.

if tag_p.span.string !=None:
    print('있다')

contents 속성과 children 속성을 이용하여 자식태그를 가져올 수 있다.

contents 속성을 이용하여 list 형태로 자식 태그를 가져온다.

tag_p_children = soup.p.contents
print(tag_p_children)

tag_p_children = soup.p.children
tag_p_children #iterate

문제 ) 반복문을 이용하여 둘다 출력하기

예시

a_tuple = (1,2,3)
b_iterator = iter(a_tuple)
print(b_iterator.__next__())
print(b_iterator.__next__())
print(b_iterator.__next__())

이것을 응용하자

tag_p_contents = soup.p.contents
tag_p_contents #[<span>test1</span>, <span>test2</span>]

tag_p_children = soup.p.children
tag_p_children # <list_iterator at 0x243f02ea220>

for i in tag_p_contents:
    print(i)

for i in tag_p_children:
    print(i)

728x90

'Data Analysis > web crawling' 카테고리의 다른 글

[Crawling] Generator 만들기 / 자바스크립트 yield와 비교 (0)	2021.06.08
[Crawling] iterator 만들기 (0)	2021.06.08
[crawling] requests vs urllib / 파싱모듈 (0)	2021.06.08
[Crawling] 크롤링 get요청, post요청하기 (0)	2021.06.07
[Crawling] 데이터 보내는 방법 (0)	2021.06.07

[Crawling] 크롤링 get요청, post요청하기

2021. 6. 7. 17:21

728x90

* 본 포스팅은 주피터 노트북에서 진행하였다.

포스트 요청시 보낼 데이터 만들기

data = dict1 = {"key1":"hong","key2":"icebear"}
data = urllib.parse.urlencode(data)
data=data.encode('utf-8')
data

Post 요청하기

req_post = Request(url, data=data, headers={}) #2번째 인자 : data, 3번째 인자 : header

page=urlopen(req_post)
page

Get 요청하기

req_get = Request(url+"?key1=values1&key2=values2",None, headers={}) #2번째 인자 : data, 3번째 인자 : header

page=urlopen(req_get)
print(page)

data를 만들때는 encode 함수를 이용하여 바이너리 형태로 인코딩하여 전송하여야 한다.

728x90

'Data Analysis > web crawling' 카테고리의 다른 글

[Crawling] get,[] 차이 / 반복문 이용 (0)	2021.06.08
[crawling] requests vs urllib / 파싱모듈 (0)	2021.06.08
[Crawling] 데이터 보내는 방법 (0)	2021.06.07
[Crawling] html코드를 가져오기 (0)	2021.06.07
[Crawling] 파이썬 크롤링 시작하기 (0)	2021.06.07

PREV 1 2 3 4 ···6 NEXT

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

아이스베어의 개발 일기

파이썬

[CodeUp] 코드업 기초 100제 6004번 풀이 - 파이썬(python)

6004 : [기초-출력] 출력하기04(설명)(py)

'알고리즘 (Python) > 코드업 기초 100제' 카테고리의 다른 글

[CodeUp] 코드업 기초 100제 6002번 풀이 - 파이썬(python)

6002 : [기초-출력] 출력하기02(설명)(py)

'알고리즘 (Python) > 코드업 기초 100제' 카테고리의 다른 글

[CodeUp] 코드업 기초 100제 6001번 풀이 - 파이썬(python)

6001 : [기초-출력] 출력하기01(설명)(py)

'알고리즘 (Python) > 코드업 기초 100제' 카테고리의 다른 글

[CodeUp] 코드업 기초 100제 6003번 풀이 - 파이썬(python)

6003 : [기초-출력] 출력하기03(설명)(py)

'알고리즘 (Python) > 코드업 기초 100제' 카테고리의 다른 글

[Crawling] Slack 파이썬으로 연동 / 채팅하기

'Data Analysis > web crawling' 카테고리의 다른 글

[Crawling] 가상 돔을 활용한 크롤러 / selenium

'Data Analysis > web crawling' 카테고리의 다른 글

[Crawling] Generator 만들기 / 자바스크립트 yield와 비교

자바스크립트의 yield와 비교해보기

'Data Analysis > web crawling' 카테고리의 다른 글

[Crawling] iterator 만들기

'Data Analysis > web crawling' 카테고리의 다른 글

[Crawling] get,[] 차이 / 반복문 이용

'Data Analysis > web crawling' 카테고리의 다른 글

[Crawling] 크롤링 get요청, post요청하기

'Data Analysis > web crawling' 카테고리의 다른 글

+ Recent posts

티스토리툴바

단축키

내 블로그

블로그 게시글

모든 영역