Nyaya yema duplicate mitsara

Kuedza kudzima marekodhi muExcel uchishandisa yemahara software

Mumwe mutengi akauya kwandiri aine dambudziko. Ndakanga ndine dhatabhesi muExcel spreadsheet ine marekodhi anopfuura miriyoni, mashoma acho akadzokororwa. Apa ndipo pakatanga kuyedza kwangu kugadzirisa nyaya yemitsara yakapetwa.

Sezvo ini ndisingade kutambisa nguva yevaverengi, Ndinokuudza kuti nyaya yacho yakapera kugadziriswa nenzira iri nyore, uchishandisa iyo Excel 365 basa iyo inoshanda chaizvo kubvisa duplicate mitsara. Nekudaro, seyakanaka linuxer ndakaedza nzira dzakaomarara kutanga. Ndizvo zviri kutaurwa nechinyorwa ichi

Nyaya yema duplicate mitsara

Izvo zvinofanirwa kutaurwa kuti, kunyangwe LibreOffice Calc isina dambudziko kuvhura 136 MB memory file uye inogona kutariswa zvizere uye kushanda nayo, zvaisaita kuchengetedza, mune ese mafomati emuno uye eMicrosoft. Ndine komputa ine 8 GB yekuyeuka uye AMD A6-9500 processor, asi iwe unofanirwa kuedza kuichengeta kana iyo application yaizovhara kana komputa ichaparara.

Kazhinji ini ndiri kufarira kushandisa iyo yekutanga application iyo gwaro rakagadzirwa, iyo inoponesa yakawanda misoro yemusoro (Mazhinji acho imhosva yeMicrosoft maapplication ayo asingawanzo kubvuma kukanganiswa) asi iyo Windows 10 chikamu chandinowanzo kushandisa. zvemacases aya zvakaramba kushanda, so before reinstalling i decided to try other things.

mapere nenyoka

En imwe nyaya Ini ndatokuudza matanho ekuisa zvirongwa kubva kuPip package maneja mune itsva shanduro dzeUbuntu Sezvo ruzivo rwekugadzira macros muLibreOffice Calc iri shoma uye yapera, ndakafunga kushandisa Python script.

Chaizvoizvo, tinobiridzira. Iyo python script inotora iyo data kubva kuExcel spreadsheet, uye inoipinza kunze uchishandisa raibhurari yekuongorora data. kudana Pandas. Kamwe raibhurari iyi ichinge yagadzirisa data, nyowani Excel spreadsheet inogadzirwa.

Ndakafunga kuyedza chimwe chinhu chiri nyore, ndichikumbira kuti chindiudze huwandu hwemitsara ine duplicate data. Maitiro acho ndeaya anotevera (Yeuka kutarisa kutanga pachinyorwa chandakuratidza)

pip3 install pandas openpyxl Uyu murairo unoisa maraibhurari anotibvumira kushanda nedata uye kupindirana neExcel templates.

Ndakabva ndavhura text editor ndokunyora kodhi inotevera.

# Importa la biblioteca Pandas
import pandas as pd

def detect_duplicate_rows(nzira):
# Rodha iyo Excel faira mu pandas DataFrame
df = pd.read_excel(nzira)

# Tsvaga yakapetwa mitsara
duplicates = df.duplicated()

# Ratidza nhamba dzemutsara dzemitsetse inodzokorora
duplicate_lines = duplicates[duplicates].index.tolist()
kana duplicate_lines:
print("Mitsetse inotevera yemitsara yakadzokororwa yakawanikwa:")
yemutsetse mune duplicate_lines:
dhinda(mutsara)
zvimwe:
print("Hapana duplicate mitsara inowanikwa muspredishiti.")

# Excel faira nzira
excel_file = "file.xlsx"

# Fonera basa racho kuti uone zvakapetwa mitsara
detect_duplicate_rows(excel_file)

Ndakaichengeta seexcel1.py ndokumhanya nayo python3 excel1.py
Nekuti mhedzisiro yacho yairatidza kuti pakanga paine mitsetse yakawanda yakapetwa ndakafunga kugadzira script yaizodzima uye kubvisa iyo isina mutsara. Kodhi ndeiyi:
import pandas as pd

# Rodha iyo Excel spreadsheet
excel_file = 'file.xlsx'
df = pd.read_excel(excel_file, injini='openpyxl')

# Bvisa yakadhindwa data uchisiya chete mutsara wekutanga uine iyo data
df_no_duplicates = df.drop_duplicates(chengeta='kutanga')

# Nyora mhedzisiro yekubvisa dhizaini kune nyowani Excel faira
df_without_duplicates.to_excel('file2.xlsx', index=Nhema, injini='openpyxl')

Ndakaichengeta pasi pezita rekuti excel2.py ndokumhanya nayo nemurairo python3 excel2.py

Ini handizive kuti sei isingashande nekuti inopedzisira yavhara terminal mushure memaminitsi mashoma uye iyo faira nyowani haina kumbogadzirwa.kana. Zvakadaro, ndakapedzisira ndadzoreredza Windows neHofisi uye nekumhanyisa murairo uri nyore unounza Microsoft hofisi suite yakavakirwa mukati, zvichiratidza kuti reza yaOckham inoshanda kune nyika yesoftware zvakare. Zvakadaro, chirongwa chekutanga chandakanyora chakandibatsira kuti nditarise nekukurumidza kuti data rese rakapetwa rakabviswa. Zvakadaro, ndinoronga kuenderera mberi nekuferefeta nekuzivisa zvabuda.

Nenzira, zvichida imhaka yekushaikwa kwezvinyorwa, asi tsika yevanhu vanoshandisa maspredishiti kana paine vatungamiriri ve database inoda kuziva.


Siya yako yekutaura

Your kero e havazobvumirwi ichibudiswa. Raida minda anozivikanwa ne *

*

*

  1. Inotarisira data: AB Internet Networks 2008 SL
  2. Chinangwa cheiyo data: Kudzora SPAM, manejimendi manejimendi.
  3. Legitimation: Kubvuma kwako
  4. Kutaurirana kwedata
  5. Dhata yekuchengetedza: Dhatabhesi inobatwa neOccentus Networks (EU)
  6. Kodzero: Panguva ipi neipi iwe unogona kudzora, kupora uye kudzima ruzivo rwako

  1.   JF404 akadaro

    Wakamboedza kugadzirisa script kuti uone payanga ichitadza?
    Kunyangwe kana zvasvika kune vatengi, chakanakira kuunza mhedzisiro nekukurumidza sezvinobvira, nekuda kwekuda kuziva unogona kugadzirisa script uye edza kuona chikanganiso, nekudaro uchizvichengetera nguva uye kugadzira script inogona kukubatsira iwe ramangwana.

    1.    Diego ChiGerman Gonzalez akadaro

      Aiwa, hazvina kuitika kwandiri.
      Ndakaiisa pane zvekuita.
      Zvinoitika kwandiri kuti idambudziko rekurangarira, iyo faira yakakura kwazvo.

      1.    Juan akadaro

        Edza kushandisa XlsxWriter pane openpyxl

        1.    Diego ChiGerman Gonzalez akadaro

          Cherekedza. Ndatenda.

  2.   ngirozi akadaro

    Ndinokukurudzira kuti uyere kushandiswa kwegondohwe kwechinyorwa. Kurodha dataframes mundangariro kunoita kuti zvive nyore kuti mabasa akasimba ashaye zviwanikwa. Iwe unogona kunge uchifarira kuchengeta ziso pane polar

    1.    Diego ChiGerman Gonzalez akadaro

      Ndatenda. Cherechedza.

  3.   louis akadaro

    Kana iyo excel faira iine data chete, pasina mafomula, macros, magiraidhi, nezvimwe, yakanaka sarudzo ndeyekutumira kunze se CSV uye kuigadzirisa nemhando, nekudaro:

    1- Vhura koni (cmd.exe)
    2- Nyora: rongedza / rakasiyana file.csv > output.csv
    3- Vhura zvakare muexcel uye chengetedza seXLSX fomati

    Kana uchishandisa linux:

    1- Vhura Terminal,
    2- Nyora: ronga -u file.csv > output.csv

    1.    Diego ChiGerman Gonzalez akadaro

      Pfungwa yakanaka. Ndichazviedza

  4.   Gregorio ros akadaro

    Mashoko ekupedzisira ndinoita ndeangu. Handizivi kuti sei madhatabhesi akagadzirwa kana akasaashandisa gare gare uye ona kana spreadsheet iri nyore kushandisa, kunyanya dhatabhesi.