Mumwe mutengi akauya kwandiri aine dambudziko. Ndakanga ndine dhatabhesi muExcel spreadsheet ine marekodhi anopfuura miriyoni, mashoma acho akadzokororwa. Apa ndipo pakatanga kuyedza kwangu kugadzirisa nyaya yemitsara yakapetwa.
Sezvo ini ndisingade kutambisa nguva yevaverengi, Ndinokuudza kuti nyaya yacho yakapera kugadziriswa nenzira iri nyore, uchishandisa iyo Excel 365 basa iyo inoshanda chaizvo kubvisa duplicate mitsara. Nekudaro, seyakanaka linuxer ndakaedza nzira dzakaomarara kutanga. Ndizvo zviri kutaurwa nechinyorwa ichi
Nyaya yema duplicate mitsara
Izvo zvinofanirwa kutaurwa kuti, kunyangwe LibreOffice Calc isina dambudziko kuvhura 136 MB memory file uye inogona kutariswa zvizere uye kushanda nayo, zvaisaita kuchengetedza, mune ese mafomati emuno uye eMicrosoft. Ndine komputa ine 8 GB yekuyeuka uye AMD A6-9500 processor, asi iwe unofanirwa kuedza kuichengeta kana iyo application yaizovhara kana komputa ichaparara.
Kazhinji ini ndiri kufarira kushandisa iyo yekutanga application iyo gwaro rakagadzirwa, iyo inoponesa yakawanda misoro yemusoro (Mazhinji acho imhosva yeMicrosoft maapplication ayo asingawanzo kubvuma kukanganiswa) asi iyo Windows 10 chikamu chandinowanzo kushandisa. zvemacases aya zvakaramba kushanda, so before reinstalling i decided to try other things.
mapere nenyoka
En imwe nyaya Ini ndatokuudza matanho ekuisa zvirongwa kubva kuPip package maneja mune itsva shanduro dzeUbuntu Sezvo ruzivo rwekugadzira macros muLibreOffice Calc iri shoma uye yapera, ndakafunga kushandisa Python script.
Chaizvoizvo, tinobiridzira. Iyo python script inotora iyo data kubva kuExcel spreadsheet, uye inoipinza kunze uchishandisa raibhurari yekuongorora data. kudana Pandas. Kamwe raibhurari iyi ichinge yagadzirisa data, nyowani Excel spreadsheet inogadzirwa.
Ndakafunga kuyedza chimwe chinhu chiri nyore, ndichikumbira kuti chindiudze huwandu hwemitsara ine duplicate data. Maitiro acho ndeaya anotevera (Yeuka kutarisa kutanga pachinyorwa chandakuratidza)
pip3 install pandas openpyxl
Uyu murairo unoisa maraibhurari anotibvumira kushanda nedata uye kupindirana neExcel templates.
Ndakabva ndavhura text editor ndokunyora kodhi inotevera.
# Importa la biblioteca Pandas
import pandas as pd
def detect_duplicate_rows(nzira):
# Rodha iyo Excel faira mu pandas DataFrame
df = pd.read_excel(nzira)
# Tsvaga yakapetwa mitsara
duplicates = df.duplicated()
# Ratidza nhamba dzemutsara dzemitsetse inodzokorora
duplicate_lines = duplicates[duplicates].index.tolist()
kana duplicate_lines:
print("Mitsetse inotevera yemitsara yakadzokororwa yakawanikwa:")
yemutsetse mune duplicate_lines:
dhinda(mutsara)
zvimwe:
print("Hapana duplicate mitsara inowanikwa muspredishiti.")
# Excel faira nzira
excel_file = "file.xlsx"
# Fonera basa racho kuti uone zvakapetwa mitsara
detect_duplicate_rows(excel_file)
Ndakaichengeta seexcel1.py ndokumhanya nayo python3 excel1.py
Nekuti mhedzisiro yacho yairatidza kuti pakanga paine mitsetse yakawanda yakapetwa ndakafunga kugadzira script yaizodzima uye kubvisa iyo isina mutsara. Kodhi ndeiyi:
import pandas as pd
# Rodha iyo Excel spreadsheet
excel_file = 'file.xlsx'
df = pd.read_excel(excel_file, injini='openpyxl')
# Bvisa yakadhindwa data uchisiya chete mutsara wekutanga uine iyo data
df_no_duplicates = df.drop_duplicates(chengeta='kutanga')
# Nyora mhedzisiro yekubvisa dhizaini kune nyowani Excel faira
df_without_duplicates.to_excel('file2.xlsx', index=Nhema, injini='openpyxl')
Ndakaichengeta pasi pezita rekuti excel2.py ndokumhanya nayo nemurairo python3 excel2.py
Ini handizive kuti sei isingashande nekuti inopedzisira yavhara terminal mushure memaminitsi mashoma uye iyo faira nyowani haina kumbogadzirwa.kana. Zvakadaro, ndakapedzisira ndadzoreredza Windows neHofisi uye nekumhanyisa murairo uri nyore unounza Microsoft hofisi suite yakavakirwa mukati, zvichiratidza kuti reza yaOckham inoshanda kune nyika yesoftware zvakare. Zvakadaro, chirongwa chekutanga chandakanyora chakandibatsira kuti nditarise nekukurumidza kuti data rese rakapetwa rakabviswa. Zvakadaro, ndinoronga kuenderera mberi nekuferefeta nekuzivisa zvabuda.
Nenzira, zvichida imhaka yekushaikwa kwezvinyorwa, asi tsika yevanhu vanoshandisa maspredishiti kana paine vatungamiriri ve database inoda kuziva.
Wakamboedza kugadzirisa script kuti uone payanga ichitadza?
Kunyangwe kana zvasvika kune vatengi, chakanakira kuunza mhedzisiro nekukurumidza sezvinobvira, nekuda kwekuda kuziva unogona kugadzirisa script uye edza kuona chikanganiso, nekudaro uchizvichengetera nguva uye kugadzira script inogona kukubatsira iwe ramangwana.
Aiwa, hazvina kuitika kwandiri.
Ndakaiisa pane zvekuita.
Zvinoitika kwandiri kuti idambudziko rekurangarira, iyo faira yakakura kwazvo.
Edza kushandisa XlsxWriter pane openpyxl
Cherekedza. Ndatenda.
Ndinokukurudzira kuti uyere kushandiswa kwegondohwe kwechinyorwa. Kurodha dataframes mundangariro kunoita kuti zvive nyore kuti mabasa akasimba ashaye zviwanikwa. Iwe unogona kunge uchifarira kuchengeta ziso pane polar
Ndatenda. Cherechedza.
Kana iyo excel faira iine data chete, pasina mafomula, macros, magiraidhi, nezvimwe, yakanaka sarudzo ndeyekutumira kunze se CSV uye kuigadzirisa nemhando, nekudaro:
1- Vhura koni (cmd.exe)
2- Nyora: rongedza / rakasiyana file.csv > output.csv
3- Vhura zvakare muexcel uye chengetedza seXLSX fomati
Kana uchishandisa linux:
1- Vhura Terminal,
2- Nyora: ronga -u file.csv > output.csv
Pfungwa yakanaka. Ndichazviedza
Mashoko ekupedzisira ndinoita ndeangu. Handizivi kuti sei madhatabhesi akagadzirwa kana akasaashandisa gare gare uye ona kana spreadsheet iri nyore kushandisa, kunyanya dhatabhesi.