cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 

Naohiro Masukawa

%3CLINGO-SUB%20id%3D%22lingo-sub-658268%22%20slang%3D%22ja-JP%22%20mode%3D%22NONE%22%20mode%3D%22NONE%22%20mode%3D%22NONE%22%3EDer%20%E2%80%9EPDF-Ladeassistent%E2%80%9C%20macht%20die%20Datenanalyse%20einfacher%2C%20als%20Sie%20sich%20vorstellen%20k%C3%B6nnen.%3C%2FLINGO-SUB%3E%3CLINGO-BODY%20id%3D%22lingo-body-658268%22%20slang%3D%22ja-JP%22%20mode%3D%22NONE%22%20mode%3D%22NONE%22%20mode%3D%22NONE%22%3E%3CP%3E%3CFONT%20size%3D%225%22%20color%3D%22%23FF6600%22%3EDatenextraktion%20aus%20PDF-Dateien%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3EIch%20analysiere%20oft%20%C3%B6ffentlich%20verf%C3%BCgbare%20Daten%20und%20manchmal%20m%C3%B6chte%20ich%20eine%20Tabelle%20in%20einer%20PDF-Datei%20in%20eine%20Datentabelle%20umwandeln.%20Insbesondere%20die%20von%20Beh%C3%B6rden%20ver%C3%B6ffentlichten%20Daten%20liegen%20erstaunlich%20oft%20nicht%20im%20Excel-%20oder%20CSV-Format%20vor%2C%20sondern%20sind%20als%20Tabellen%20in%20PDF-Dateien%20eingebettet.%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3EIn%20solchen%20F%C3%A4llen%20w%C3%A4hlte%20ich%20bis%20vor%20vier%20oder%20f%C3%BCnf%20Jahren%20die%20entsprechende%20Tabelle%20in%20der%20PDF-Datei%20aus%2C%20kopierte%20sie%20und%20f%C3%BCgte%20sie%20in%20Excel%20oder%20JMP%20ein%2C%20aber%20es%20gab%20viele%20F%C3%A4lle%2C%20in%20denen%20sie%20nicht%20richtig%20als%20Daten%20erkannt%20wurde%20und%20in%20der%20Im%20schlimmsten%20Fall%20gab%20es%20Zeiten%2C%20in%20denen%20ich%20aufgab%20und%20die%20Daten%20manuell%20erstellte.%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3EF%C3%BCr%20einen%20solchen%20Fall%20gibt%20es%20in%20JMP%20Version%2015%20eine%20neue%20Funktion.%3CFONT%20color%3D%22%23FF0000%22%3E%20%E2%80%9EPDF-Ladeassistent%E2%80%9C%20war%20f%C3%BCr%20mich%20ein%20Segen.%3C%2FFONT%3E%20Das%20Erstellen%20von%20Daten%20aus%20einem%20PDF%2C%20das%20fr%C3%BCher%20zeitaufw%C3%A4ndig%20war%2C%20ist%20jetzt%20in%20wenigen%20Sekunden%20erledigt!%20!%20Dadurch%20gewinnen%20Sie%20Zeit%20f%C3%BCr%20eine%20tiefgreifende%20Analyse%20Ihrer%20Daten.%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3EDas%20letzte%20Mal%20habe%20ich%20geschrieben%3CA%20href%3D%22https%3A%2F%2Fcommunity.jmp.com%2Ft5%2FNaohiro-Masukawa%2F%25E3%2582%25B8%25E3%2582%25A7%25E3%2583%25B3%25E3%2583%2580%25E3%2583%25BC%25E3%2582%25AE%25E3%2583%25A3%25E3%2583%2583%25E3%2583%2597%25E3%2582%2592%25E8%25A7%25A3%25E6%25B6%2588%25E3%2581%2599%25E3%2582%258B%25E3%2581%25AB%25E3%2581%25AF%25E3%2581%25A9%25E3%2581%2586%25E3%2581%2597%25E3%2581%259F%25E3%2582%2589%25E8%2589%25AF%25E3%2581%2584%25E3%2581%258B-%25E6%258C%2587%25E6%2595%25B0%25E3%2582%2592%25E5%2588%2586%25E6%259E%2590%25E3%2581%2597%25E3%2581%25A6%25E3%2581%25BF%25E3%2581%25A6%25E3%2582%258F%25E3%2581%258B%25E3%2581%25A3%25E3%2581%259F%25E3%2581%2593%25E3%2581%25A8%2Fba-p%2F652413%22%20target%3D%22_blank%22%3E%20Blogartikel%20(Gender%20Gap%20Index)%3C%2FA%3E%20zitiert%20eine%20Tabelle%20in%20einem%20PDF-Bericht%2C%20aber%20ohne%20den%20PDF-Import-Assistenten%20w%C3%A4re%20ich%20%C3%BCber%20den%20Aufwand%20beim%20Erstellen%20der%20Daten%20frustriert%20gewesen.%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3EDeshalb%2C%20in%20diesem%20Blog%2C%3CFONT%20color%3D%22%23FF0000%22%3E%20Lassen%20Sie%20mich%20Ihnen%20die%20tollen%20Funktionen%20des%20%E2%80%9EPDF%20Load%20Wizard%E2%80%9C%20vorstellen.%3C%2FFONT%3E%20Auch%20wenn%20Sie%20diese%20Funktion%20bereits%20nutzen%2C%20kann%20es%20sein%2C%20dass%20Sie%20einige%20Dinge%20nicht%20wissen.%20Lesen%20Sie%20sie%20daher%20bitte%20sorgf%C3%A4ltig%20durch.%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%225%22%20color%3D%22%23FF6600%22%3EWas%20Sie%20mit%20dem%20PDF-Importassistenten%20tun%20k%C3%B6nnen%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3E%E2%80%9EPDF-Import-Assistent%E2%80%9C%20ist%20eine%20Funktion%2C%20die%20es%20Ihnen%20erm%C3%B6glicht%2C%20die%20zu%20importierenden%20Daten%20anzupassen%20und%20dabei%20vor%20dem%20Import%20der%20PDF-Datei%20die%20Vorschau%20zu%20pr%C3%BCfen.%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3EWenn%20Sie%20in%20der%20JMP-Men%C3%BCleiste%20%5BDatei%5D%20%26gt%3B%20%5B%C3%96ffnen%5D%20ausw%C3%A4hlen%20und%20die%20zu%20importierende%20PDF-Datei%20ausw%C3%A4hlen%2C%20wird%20der%20%E2%80%9EPDF-Importassistent%E2%80%9C%20automatisch%20gestartet%20(*1).%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%20lia-image-align-inline%22%20image-alt%3D%22nao_masukawa_0-1689308836580.png%22%20style%3D%22width%3A%20815px%3B%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_0-1689308836580.png%22%20style%3D%22width%3A%20815px%3B%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_0-1689308836580.png%22%20style%3D%22width%3A%20815px%3B%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_0-1689308836580.png%22%20style%3D%22width%3A%20815px%3B%22%3E%3Cspan%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_0-1689308836580.png%22%20style%3D%22width%3A%20815px%3B%22%3E%3Cimg%20src%3D%22https%3A%2F%2Fcommunity.jmp.com%2Ft5%2Fimage%2Fserverpage%2Fimage-id%2F54675i81795538CCA3907F%2Fimage-dimensions%2F815x559%3Fv%3Dv2%22%20width%3D%22815%22%20height%3D%22559%22%20role%3D%22button%22%20title%3D%22nao_masukawa_0-1689308836580.png%22%20alt%3D%22nao_masukawa_0-1689308836580.png%22%20%2F%3E%3C%2Fspan%3E%3C%2FSPAN%3E%3C%2FSPAN%3E%3C%2FSPAN%3E%3C%2FSPAN%3E%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3EAuf%20der%20linken%20Seite%20des%20Assistenten%20wird%20eine%20Vorschau%20angezeigt%2C%20wie%20oben%20gezeigt.%20Tabellen%20im%20PDF%20werden%20automatisch%20erkannt%20und%20rechts%20in%20der%20%E2%80%9ETabellenvorschau%E2%80%9C%20angezeigt.%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3E%C3%9Cberpr%C3%BCfen%20Sie%2C%20ob%20die%20Tabelle%2C%20die%20Sie%20laden%20m%C3%B6chten%2C%20hier%20erkannt%20wird.%20Wenn%20keine%20Probleme%20vorliegen%2C%20klicken%20Sie%20auf%20die%20Schaltfl%C3%A4che%20%5BOK%5D%20und%20sie%20wird%20als%20JMP-Datentabelle%20ge%C3%B6ffnet.%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%20lia-image-align-inline%22%20image-alt%3D%22nao_masukawa_2-1689309464655.png%22%20style%3D%22width%3A%20688px%3B%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_2-1689309464655.png%22%20style%3D%22width%3A%20688px%3B%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_2-1689309464655.png%22%20style%3D%22width%3A%20688px%3B%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_2-1689309464655.png%22%20style%3D%22width%3A%20688px%3B%22%3E%3Cspan%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_2-1689309464655.png%22%20style%3D%22width%3A%20688px%3B%22%3E%3Cimg%20src%3D%22https%3A%2F%2Fcommunity.jmp.com%2Ft5%2Fimage%2Fserverpage%2Fimage-id%2F54677i6FEC2643804BEBB1%2Fimage-dimensions%2F688x389%3Fv%3Dv2%22%20width%3D%22688%22%20height%3D%22389%22%20role%3D%22button%22%20title%3D%22nao_masukawa_2-1689309464655.png%22%20alt%3D%22nao_masukawa_2-1689309464655.png%22%20%2F%3E%3C%2Fspan%3E%3C%2FSPAN%3E%3C%2FSPAN%3E%3C%2FSPAN%3E%3C%2FSPAN%3E%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3EIn%20diesem%20Beispiel%20k%C3%B6nnen%20wir%20die%20im%20Artikel%20beschriebenen%20experimentellen%20Daten%20problemlos%20laden.%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3EDanach%20werde%20ich%20zwei%20erstaunliche%20Dinge%20zusammen%20mit%20tats%C3%A4chlichen%20Beispielen%20vorstellen.%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CP%3E%3CSTRONG%3E%3CFONT%20size%3D%224%22%20color%3D%22%230000FF%22%3EErstaunlicher%20Teil%201%3A%20Tabellen%20mit%20denselben%20Spaltennamen%20werden%20automatisch%20zu%20einer%20verkettet%3C%2FFONT%3E%3C%2FSTRONG%3E%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3EIm%20Folgenden%20finden%20Sie%20eine%20PDF-Datei%20der%20vom%20Kabinettssekretariat%20ver%C3%B6ffentlichten%20statistischen%20Tabelle%20zum%20Besch%C3%A4ftigungsstatus%20allgemeiner%20nationaler%20Beamter%20(*2).%20Angenommen%2C%20Sie%20m%C3%B6chten%20diese%20Tabelle%20verwenden%2C%20um%20in%20JMP%20eine%20Datentabelle%20zu%20erstellen%2C%20die%20die%20Anzahl%20der%20Teilzeitmitarbeiter%20in%20jedem%20Ministerium%20und%20jeder%20Agentur%2C%20die%20Differenz%20zum%20Vorjahr%20(Personen)%20und%20die%20Ver%C3%A4nderung%20gegen%C3%BCber%20dem%20Vorjahr%20(%25)%20anzeigt.%20.%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%20lia-image-align-inline%22%20image-alt%3D%22nao_masukawa_0-1689309656146.png%22%20style%3D%22width%3A%20455px%3B%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_0-1689309656146.png%22%20style%3D%22width%3A%20455px%3B%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_0-1689309656146.png%22%20style%3D%22width%3A%20455px%3B%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_0-1689309656146.png%22%20style%3D%22width%3A%20455px%3B%22%3E%3Cspan%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_0-1689309656146.png%22%20style%3D%22width%3A%20455px%3B%22%3E%3Cimg%20src%3D%22https%3A%2F%2Fcommunity.jmp.com%2Ft5%2Fimage%2Fserverpage%2Fimage-id%2F54678iA6E6703DF110409C%2Fimage-dimensions%2F455x398%3Fv%3Dv2%22%20width%3D%22455%22%20height%3D%22398%22%20role%3D%22button%22%20title%3D%22nao_masukawa_0-1689309656146.png%22%20alt%3D%22nao_masukawa_0-1689309656146.png%22%20%2F%3E%3C%2Fspan%3E%3C%2FSPAN%3E%3C%2FSPAN%3E%3C%2FSPAN%3E%3C%2FSPAN%3E%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3EEine%20Funktion%2C%20die%20Tabellen%20in%20einem%20PDF%20automatisch%20erkennt%2C%20kann%20n%C3%BCtzlich%20sein%2C%20aber%20in%20der%20Praxis%20gibt%20es%20nicht%20viele%20F%C3%A4lle%2C%20in%20denen%20Sie%20alle%20Tabellen%20in%20einem%20PDF%20lesen%20m%C3%B6chten%2C%20sondern%20nur%20eine%20bestimmte%20Tabelle.%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3EIn%20einem%20solchen%20Fall%20oben%20rechts%20im%20Assistenten%3CSTRONG%3E%20%5BAlle%20Tabellen%20ignorieren%5D%3C%2FSTRONG%3E%20Klicken%20Sie%20auf%20die%20Schaltfl%C3%A4che%2C%20um%20die%20automatische%20Auswahl%20abzubrechen.%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3EGehen%20Sie%20anschlie%C3%9Fend%20mithilfe%20der%20Vorschau%20auf%20der%20linken%20Seite%20zu%20der%20Seite%20mit%20der%20Tabelle%2C%20die%20Sie%20laden%20m%C3%B6chten%2C%20und%20klicken%20Sie%20oben%20links%20auf%20der%20Seite%20auf%20die%20Schaltfl%C3%A4che%20mit%20dem%20roten%20Dreieck.%3CSTRONG%3E%20%5BDiese%20Seite%20automatisch%20erkennen%5D%3C%2FSTRONG%3E%20Wenn%20Sie%20ausw%C3%A4hlen%2C%20werden%20nur%20die%20Tabellen%20auf%20dieser%20Seite%20automatisch%20erkannt.%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%20lia-image-align-inline%22%20image-alt%3D%22nao_masukawa_3-1689310355773.png%22%20style%3D%22width%3A%20400px%3B%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_3-1689310355773.png%22%20style%3D%22width%3A%20400px%3B%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_3-1689310355773.png%22%20style%3D%22width%3A%20400px%3B%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_3-1689310355773.png%22%20style%3D%22width%3A%20400px%3B%22%3E%3Cspan%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_3-1689310355773.png%22%20style%3D%22width%3A%20400px%3B%22%3E%3Cimg%20src%3D%22https%3A%2F%2Fcommunity.jmp.com%2Ft5%2Fimage%2Fserverpage%2Fimage-id%2F54681i900702E1A6EE5D4C%2Fimage-size%2Fmedium%3Fv%3Dv2%26amp%3Bpx%3D400%22%20role%3D%22button%22%20title%3D%22nao_masukawa_3-1689310355773.png%22%20alt%3D%22nao_masukawa_3-1689310355773.png%22%20%2F%3E%3C%2Fspan%3E%3C%2FSPAN%3E%3C%2FSPAN%3E%3C%2FSPAN%3E%3C%2FSPAN%3E%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3EBei%20einigen%20Tabellen%20funktioniert%20die%20automatische%20Erkennung%20jedoch%20m%C3%B6glicherweise%20nicht.%20Erstellen%20Sie%20in%20diesem%20Fall%20ein%20Rechteck%2C%20indem%20Sie%20an%20die%20Stelle%20ziehen%2C%20an%20der%20die%20Tabelle%20platziert%20werden%20soll.%20Die%20Tabelle%20wird%20dann%20innerhalb%20des%20Rechteckrahmens%20erkannt.%3CFONT%20color%3D%22%23FF0000%22%3E%20In%20der%20Praxis%20ist%20diese%20Methode%20zum%20Ziehen%20und%20Ausw%C3%A4hlen%20von%20Tabellen%20praktisch.%3C%2FFONT%3E%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%20lia-image-align-inline%22%20image-alt%3D%22nao_masukawa_4-1689310510280.png%22%20style%3D%22width%3A%20471px%3B%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_4-1689310510280.png%22%20style%3D%22width%3A%20471px%3B%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_4-1689310510280.png%22%20style%3D%22width%3A%20471px%3B%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_4-1689310510280.png%22%20style%3D%22width%3A%20471px%3B%22%3E%3Cspan%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_4-1689310510280.png%22%20style%3D%22width%3A%20471px%3B%22%3E%3Cimg%20src%3D%22https%3A%2F%2Fcommunity.jmp.com%2Ft5%2Fimage%2Fserverpage%2Fimage-id%2F54682i4158B49B62DB0DBA%2Fimage-dimensions%2F471x245%3Fv%3Dv2%22%20width%3D%22471%22%20height%3D%22245%22%20role%3D%22button%22%20title%3D%22nao_masukawa_4-1689310510280.png%22%20alt%3D%22nao_masukawa_4-1689310510280.png%22%20%2F%3E%3C%2Fspan%3E%3C%2FSPAN%3E%3C%2FSPAN%3E%3C%2FSPAN%3E%3C%2FSPAN%3E%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3ENach%20Auswahl%20der%20entsprechenden%20Tabelle%20erscheint%20auf%20der%20rechten%20Seite%20die%20Vorschau%3CSTRONG%3E%20%E2%80%9ETabellen%20mit%20passenden%20Spaltennamen%20verketten%E2%80%9C%3C%2FSTRONG%3E%20W%C3%A4hlen%20Sie%20aus%20und%20klicken%20Sie%20auf%20die%20Schaltfl%C3%A4che%20%5BOK%5D.%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%20lia-image-align-inline%22%20image-alt%3D%22nao_masukawa_5-1689310672493.png%22%20style%3D%22width%3A%20999px%3B%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_5-1689310672493.png%22%20style%3D%22width%3A%20999px%3B%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_5-1689310672493.png%22%20style%3D%22width%3A%20999px%3B%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_5-1689310672493.png%22%20style%3D%22width%3A%20999px%3B%22%3E%3Cspan%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_5-1689310672493.png%22%20style%3D%22width%3A%20999px%3B%22%3E%3Cimg%20src%3D%22https%3A%2F%2Fcommunity.jmp.com%2Ft5%2Fimage%2Fserverpage%2Fimage-id%2F54683iCB0696C8BD57E94A%2Fimage-size%2Flarge%3Fv%3Dv2%26amp%3Bpx%3D999%22%20role%3D%22button%22%20title%3D%22nao_masukawa_5-1689310672493.png%22%20alt%3D%22nao_masukawa_5-1689310672493.png%22%20%2F%3E%3C%2Fspan%3E%3C%2FSPAN%3E%3C%2FSPAN%3E%3C%2FSPAN%3E%3C%2FSPAN%3E%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3EAnschlie%C3%9Fend%20werden%20die%20in%20zwei%20Teilen%20angezeigten%20Tabellen%20zu%20einer%20Tabelle%20zusammengefasst%20und%20geladen.%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3ENormalerweise%20m%C3%BCssten%20Sie%20zwei%20Tabellen%20laden%20und%20sie%20dann%20mit%20%5BVerketten%5D%20zu%20einer%20zusammenfassen%2C%20aber%20dieser%20Assistent%20erledigt%20das%20f%C3%BCr%20Sie.%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3EIndem%20wir%20die%20erstellte%20Datentabelle%20einfach%20ein%20wenig%20modifizierten%2C%20konnten%20wir%20Histogramme%20der%20Anzahl%20der%20Mitarbeiter%2C%20der%20Ver%C3%A4nderungen%20im%20Jahresvergleich%20und%20der%20Ver%C3%A4nderungen%20im%20Jahresvergleich%20erstellen%20und%20die%20Ministerien%20und%20Beh%C3%B6rden%20untersuchen%2C%20bei%20denen%20es%20sich%20um%20Ausrei%C3%9Fer%20handelte.%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%20lia-image-align-inline%22%20image-alt%3D%22nao_masukawa_6-1689310808098.png%22%20style%3D%22width%3A%20935px%3B%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_6-1689310808098.png%22%20style%3D%22width%3A%20935px%3B%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_6-1689310808098.png%22%20style%3D%22width%3A%20935px%3B%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_6-1689310808098.png%22%20style%3D%22width%3A%20935px%3B%22%3E%3Cspan%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_6-1689310808098.png%22%20style%3D%22width%3A%20935px%3B%22%3E%3Cimg%20src%3D%22https%3A%2F%2Fcommunity.jmp.com%2Ft5%2Fimage%2Fserverpage%2Fimage-id%2F54684i11D8609CA4955FB6%2Fimage-dimensions%2F935x451%3Fv%3Dv2%22%20width%3D%22935%22%20height%3D%22451%22%20role%3D%22button%22%20title%3D%22nao_masukawa_6-1689310808098.png%22%20alt%3D%22nao_masukawa_6-1689310808098.png%22%20%2F%3E%3C%2Fspan%3E%3C%2FSPAN%3E%3C%2FSPAN%3E%3C%2FSPAN%3E%3C%2FSPAN%3E%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CP%3E%3CSTRONG%3E%3CFONT%20size%3D%224%22%20color%3D%22%230000FF%22%3EGro%C3%9Fartiger%20Teil%202%3A%20Kombinieren%20Sie%20Tabellen%2C%20die%20sich%20%C3%BCber%20mehrere%20Seiten%20erstrecken%2C%20zu%20einer%3C%2FFONT%3E%3C%2FSTRONG%3E%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3EDiese%20PDF-Datei%20zeigt%20die%20Ergebnisse%20des%20Skisprung-Nationalmannschaftswettbewerbs%20(*3).%20Es%20werden%20die%20Ergebnisse%20vom%201.%20bis%20zum%208.%20Platz%20angezeigt%2C%20diese%20sind%20jedoch%20nicht%20auf%20einer%20Seite%2C%20sondern%20auf%20zwei%20Seiten%20zusammengefasst.%20Das%20Tabellenformat%20ist%20nach%20L%C3%A4ndern%20geordnet%2C%20ich%20m%C3%B6chte%20diese%20Tabellen%20jedoch%20gerne%20in%20einer%20Tabelle%20zusammenfassen.%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%20lia-image-align-inline%22%20image-alt%3D%22nao_masukawa_0-1689311359586.png%22%20style%3D%22width%3A%20623px%3B%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_0-1689311359586.png%22%20style%3D%22width%3A%20623px%3B%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_0-1689311359586.png%22%20style%3D%22width%3A%20623px%3B%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_0-1689311359586.png%22%20style%3D%22width%3A%20623px%3B%22%3E%3Cspan%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_0-1689311359586.png%22%20style%3D%22width%3A%20623px%3B%22%3E%3Cimg%20src%3D%22https%3A%2F%2Fcommunity.jmp.com%2Ft5%2Fimage%2Fserverpage%2Fimage-id%2F54685iC435240D32827176%2Fimage-dimensions%2F623x363%3Fv%3Dv2%22%20width%3D%22623%22%20height%3D%22363%22%20role%3D%22button%22%20title%3D%22nao_masukawa_0-1689311359586.png%22%20alt%3D%22nao_masukawa_0-1689311359586.png%22%20%2F%3E%3C%2Fspan%3E%3C%2FSPAN%3E%3C%2FSPAN%3E%3C%2FSPAN%3E%3C%2FSPAN%3E%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3EDieses%20PDF%20wird%20in%20der%20Vorschau%20angezeigt.%20W%C3%A4hlen%20Sie%20die%20Zieltabelle%20aus%20und%3CSTRONG%3E%20%E2%80%9EAlle%20Tabellen%20zu%20einer%20verketten%E2%80%9C%3C%2FSTRONG%3E%20%C3%9Cberpr%C3%BCfen%20Sie%20dies%20und%20klicken%20Sie%20auf%20die%20Schaltfl%C3%A4che%20%5BOK%5D.%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%20lia-image-align-inline%22%20image-alt%3D%22nao_masukawa_2-1689311639748.png%22%20style%3D%22width%3A%20999px%3B%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_2-1689311639748.png%22%20style%3D%22width%3A%20999px%3B%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_2-1689311639748.png%22%20style%3D%22width%3A%20999px%3B%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_2-1689311639748.png%22%20style%3D%22width%3A%20999px%3B%22%3E%3Cspan%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_2-1689311639748.png%22%20style%3D%22width%3A%20999px%3B%22%3E%3Cimg%20src%3D%22https%3A%2F%2Fcommunity.jmp.com%2Ft5%2Fimage%2Fserverpage%2Fimage-id%2F54687iD8AEFB935E9ADC38%2Fimage-size%2Flarge%3Fv%3Dv2%26amp%3Bpx%3D999%22%20role%3D%22button%22%20title%3D%22nao_masukawa_2-1689311639748.png%22%20alt%3D%22nao_masukawa_2-1689311639748.png%22%20%2F%3E%3C%2Fspan%3E%3C%2FSPAN%3E%3C%2FSPAN%3E%3C%2FSPAN%3E%3C%2FSPAN%3E%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3EEs%20l%C3%A4dt%20mehrere%20Tabellen%20in%20eine.%20Anders%20als%20im%20vorherigen%20Beispiel%20enth%C3%A4lt%20die%20hier%20verwendete%20PDF-Datei%20einige%20Tabellen%2C%20die%20keine%20Spaltennamen%20haben%2C%20daher%20habe%20ich%20%E2%80%9EAlle%20Tabellen%20zu%20einer%20verketten%E2%80%9C%20verwendet.%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3EDanach%20war%20etwas%20Datenverarbeitung%20erforderlich%2C%20aber%20ich%20konnte%20ohne%20gro%C3%9Fen%20Zeitaufwand%20f%C3%BCr%20jedes%20Team%20(f%C3%BCr%204%20Spieler)%20ein%20Ergebnisdiagramm%20erstellen.%20Es%20gibt%20Mannschaften%2C%20bei%20denen%20die%20Ergebnisse%20der%20vier%20Spieler%20unterschiedlich%20sind%2C%20und%20Mannschaften%2C%20bei%20denen%20dies%20nicht%20der%20Fall%20ist%2C%20was%20ein%20interessantes%20Ergebnis%20ist.%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%20lia-image-align-inline%22%20image-alt%3D%22nao_masukawa_3-1689311780588.png%22%20style%3D%22width%3A%20999px%3B%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_3-1689311780588.png%22%20style%3D%22width%3A%20999px%3B%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_3-1689311780588.png%22%20style%3D%22width%3A%20999px%3B%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_3-1689311780588.png%22%20style%3D%22width%3A%20999px%3B%22%3E%3Cspan%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22nao_masukawa_3-1689311780588.png%22%20style%3D%22width%3A%20999px%3B%22%3E%3Cimg%20src%3D%22https%3A%2F%2Fcommunity.jmp.com%2Ft5%2Fimage%2Fserverpage%2Fimage-id%2F54688iEA07616F474165FF%2Fimage-size%2Flarge%3Fv%3Dv2%26amp%3Bpx%3D999%22%20role%3D%22button%22%20title%3D%22nao_masukawa_3-1689311780588.png%22%20alt%3D%22nao_masukawa_3-1689311780588.png%22%20%2F%3E%3C%2Fspan%3E%3C%2FSPAN%3E%3C%2FSPAN%3E%3C%2FSPAN%3E%3C%2FSPAN%3E%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3EWir%20werden%20diese%20wunderbare%20Funktion%20weiterhin%20voll%20ausnutzen%20und%20intensiv%20an%20der%20Datenanalyse%20arbeiten!%20!%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%224%22%3Evon%20Naohiro%20Masukawa%20(JMP%20Japan)%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%223%22%3EZitieren%20von%20PDF-Dateien%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%223%22%3E*1%3A%20Rationeller%20Entwurf%20einer%20skalierbaren%20Bioprozessplattform%20f%C3%BCr%20die%20Produktion%20von%20bakterieller%20Zellulose%3C%2FFONT%3E%3CBR%20%2F%3E%3CFONT%20size%3D%223%22%3E%3CA%20href%3D%22https%3A%2F%2Fwww.sciencedirect.com%2Fscience%2Farticle%2Fabs%2Fpii%2FS0144861718312839%22%20target%3D%22_blank%22%20rel%3D%22noopener%20nofollow%20noreferrer%22%3E%20https%3A%2F%2Fwww.sciencedirect.com%2Fscience%2Farticle%2Fabs%2Fpii%2FS0144861718312839%3C%2FA%3E%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%223%22%3E*2%3A%20Statistiktabelle%20zum%20Besch%C3%A4ftigungsstatus%20nationaler%20Beamter%20des%20Kabinettssekretariats%20auf%20Generalebene%3C%2FFONT%3E%3CBR%20%2F%3E%3CFONT%20size%3D%223%22%3E%3CA%20href%3D%22https%3A%2F%2Fwww.cas.go.jp%2Fjp%2Fgaiyou%2Fjimu%2Fjinjikyoku%2Ffiles%2F20220701_toukeihyou_gaiyou.pdf%22%20target%3D%22_blank%22%20rel%3D%22noopener%20nofollow%20noreferrer%22%3E%20https%3A%2F%2Fwww.cas.go.jp%2Fjp%2Fgaiyou%2Fjimu%2Fjinjikyoku%2Ffiles%2F20220701_toukeihyou_gaiyou.pdf%3C%2FA%3E%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%3E%3CFONT%20size%3D%223%22%3E*3%3A%20Offizielle%20Ergebnisse%20des%20FIS%20SKISPRING-WELTCUP%3C%2FFONT%3E%3CBR%20%2F%3E%3CFONT%20size%3D%223%22%3E%3CA%20href%3D%22https%3A%2F%2Fmedias2.fis-ski.com%2Fpdf%2F2023%2FJP%2F3093%2F2023JP3093RL.pdf%22%20target%3D%22_blank%22%20rel%3D%22noopener%20nofollow%20noreferrer%22%3E%20https%3A%2F%2Fmedias2.fis-ski.com%2Fpdf%2F2023%2FJP%2F3093%2F2023JP3093RL.pdf%3C%2FA%3E%3C%2FFONT%3E%3C%2FP%3E%3C%2FLINGO-BODY%3E%3CLINGO-LABS%20id%3D%22lingo-labs-658268%22%20slang%3D%22ja-JP%22%20mode%3D%22NONE%22%20mode%3D%22NONE%22%20mode%3D%22NONE%22%3E%3CLINGO-LABEL%3EDatenzugriff%3C%2FLINGO-LABEL%3E%3CLINGO-LABEL%3EDatenexploration%20und%20-visualisierung%3C%2FLINGO-LABEL%3E%3C%2FLINGO-LABS%3E
Choose Language Hide Translation Bar
「PDF読み込みウィザード」は想像以上にデータ分析を楽にしてくれます

PDFファイルからのデータ抽出

私は公開されているデータを分析することがありますが、ときどきPDFファイルにある表をデータテーブルにしたいことがあります。特に官公庁等で公開されているデータはExcelやcsv形式でなく、PDFファイルの中にテーブルとして埋め込まれていることが案外多いです。

 

そんなとき、4,5年前まではPDFファイル上で該当のテーブルを選択してコピーし、ExcelやJMPにペイストしていたのですが、きちんとデータとして認識しないことも多々あり、最悪の場合はあきらめて手入力でデータを作成することもありました。

 

そんなとき、JMP のバージョン15で追加された「PDF読み込みウィザード」は、私にとって福音でした。今まで時間がかかっていたPDFからのデータ作成を、本の数秒でできてしまうのです!! そのため、データを深く分析することに余った時間を回すことができるのです。

 

前回、私が書いたブログ記事(ジェンダーギャップ指数)は、PDFの報告書の中にあるテーブルを引用していますが、PDF読み込みウィザードの機能がなかったら、データを作成するのが面倒できっと挫折していたでしょう。

 

そこで本ブログでは、「PDF読み込みウィザード」のすごいところを紹介してみます。すでにこの機能を使っている方でも意外と知らないことがあるかもしれませんので、是非ともご一読ください。

 

PDF読み込みウィザードでできること

「PDF読み込みウィザード」は、PDFファイルを読み込む前に、プレビューを参照しながら読み込むデータを調整して読み込んでいく機能です。

 

JMPのメニューバーから[ファイル] > [開く]を選択し、読み込むPDFファイルを選択すると自動的に「PDF読み込みウィザード」が起動します(*1)。

 

nao_masukawa_0-1689308836580.png

上図のようにウィザードの左側にはプレビューが表示されます。PDF内にあるテーブルを自動的に認識し、認識されたものが右側の「テーブルのプレビュー」に表示されます。

 

ここで読み込みたいテーブルが認識できているか確認し、問題ないようであれば [OK] ボタンをクリックすると、JMPのデータテーブルとして開きます。

nao_masukawa_2-1689309464655.png

この例では、論文に記載されている実験データを簡単に読み込むことができています。

 

この後、実例とともにすごいところを2つピックアップしてご紹介します。

 

すごいところ その1:  同じ列名のテーブルを自動的に1つに連結してくれる

以下は、内閣官房が公開している一般職国家公務員在職状況統計表のPDFです(*2)。この表から各省庁における非常勤職員の人数、前年差(人)、前年比(%) をJMPのデータテーブルにしたいとします。

nao_masukawa_0-1689309656146.png

自動的にPDF内のテーブルを認識する機能が役立つこともありますが、実務上はPDFの中にあるテーブルすべてを読み込むケースというのはあまりなく、特定のテーブルのみ読み込みたいというケースが多いです。

 

このようなとき、ウィザードの右上にある[すべてのテーブルを無視] ボタンをクリックし自動選択を解除します。

その後、左側のプレビューで読み込みたいテーブルがあるページに移動し、該当ページ左上にある赤い三角ボタンから [このページを自動検出] を選択すると、そのページにあるテーブルだけを自動検出します。

 

nao_masukawa_3-1689310355773.png

ただし、テーブルによっては自動検出がうまくいかないものもあります。そのときはテーブルにしたいところをドラッグして長方形を作成すると、長方形の枠の中にあるテーブルを認識します。実務上は、このドラッグしてテーブルを選択する方法が便利です。

nao_masukawa_4-1689310510280.png

該当のテーブルを選択した後、プレビュー右側にある 「列名が一致するテーブルを連結」を選択し、[OK] ボタンをクリックします。

nao_masukawa_5-1689310672493.png

すると、2つに分かれて表示されていたテーブルを1つのテーブルにまとめて読み込まれます。

 

普通は2つのテーブルが読み込まれ、その後 [連結]を使ってテーブルを1つにまとめる必要があるところを、このウィザードでやってくれているのです。

 

そのため、作成されたデータテーブルを少し加工するだけで、職員数、前年差、前年比のヒストグラムを作成し、外れ値となる省庁を調べることができました。

nao_masukawa_6-1689310808098.png

 

すごいところ その2: 複数ページにわたる表も1つに連結してくれる

こちらのPDFファイルは、スキージャンプの国別団体戦の結果を示したものです (*3)。1位~8位までの結果が示されていますが、1ページには収められていなく2ページにわたっています。国別にテーブル形式になっていますが、これらのテーブルもまとめて1つのテーブルにしたいです。

 

nao_masukawa_0-1689311359586.png

このPDFをプレビュー表示しています。対象となる表を選択し、「すべてのテーブルを1つに連結」をチェックして[OK] ボタンをクリックします。

 

nao_masukawa_2-1689311639748.png

複数のテーブルを一つにまとめて読み込んでくれました。ここで使用したPDFファイルは、前例とは違い列名がないテーブルもあるので、「すべてのテーブルを1つに連結」を使いました。

 

この後、若干のデータ加工が必要ですが、あまり時間をかけずにチームごとの得点プロット(4人分)を作ることができました。4人の得点がばらついているチームとばらついていないチームがあり、興味深い結果になっています。

nao_masukawa_3-1689311780588.png

今後も、この素晴らしい機能を駆使し、データ分析に励んでいきます!!

 

by 増川 直裕(JMP Japan)

 

 

PDFファイルの引用

*1 : Rational design of a scalable bioprocess platform for bacterial cellulose production
https://www.sciencedirect.com/science/article/abs/pii/S0144861718312839

*2: 内閣官房 一般職国家公務員在職状況統計表
https://www.cas.go.jp/jp/gaiyou/jimu/jinjikyoku/files/20220701_toukeihyou_gaiyou.pdf

*3: FIS SKI JUMPING WORLD CUP Official Results
https://medias2.fis-ski.com/pdf/2023/JP/3093/2023JP3093RL.pdf

Last Modified: Dec 21, 2023 2:01 PM