Gene GSU2898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2898 
Symbol 
ID2688617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3182745 
End bp3191156 
Gene Length8412 bp 
Protein Length2803 aa 
Translation table11 
GC content65% 
IMG OID637127591 
Producthigh-molecular-weight cytochrome c 
Protein accessionNP_953940 
Protein GI39997989 
COG category 
COG ID 
TIGRFAM ID[TIGR01904] Geobacter sulfurreducens CxxxxCH...CXXCH domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAACAG CGCTCGTCAG GAAAATCAGG GGAATGGGGA TACGGACTAA AATCGGCATT 
ACCGTAATGC TGGTGATGGC CTGTTTCCTG TACCAGGCCA TGTTCCGGCC CCTGATCGGC
GATACGGCCA CCAATACCTA TTACTTCACC CTTGATTCTG CAGCCGTAAA CCTGGGCGCC
GACGGCTACA CCACGACCAG GACCAGCCGC GACGGCAAAA TTTCCATGAA ACCGGGTGTC
TATACCACCT CCCGGTACGT TACCGCCTCC GTCAGCACCG CCGAGCAGAA CATGATCAGG
GCCTACGGGC CGGTCTATGC CACCAAGCAG ACCCTCACCG CCCCGTCGGT CACCATCGGG
ATGCGGGATC GCAACGGCAC CACCAACAAC ATGTACTGGA AGGCGTACGT GTACGCCTAC
AATCCGAAGG GGACCGCCAA TAACGGCGTG CTGCTCTGGA CCTCGGACGA GAAAGAGGCC
CACCCGGCGG TCCAGACGCC GCTGGAGCTG ACGTTTACCA ACCCCCAGCC GAAGGATGTC
GAGGCCGGCT ACCGGCTCAA GGTGGTCGTC ACCTGCCGGA TGGCGAGCAC CTCGTCGTCG
GCGCGCTTCT ACTGGGGCAA CAGCACCAAC TACTCCTATT TCACCGTGAC CGAGGCCCCC
TATGTGGCCA ACTCGGTGAC GGTCAACAAC CTGTCCGACT ACTACAGCGG CCAGCTCGCG
TCGGTGACCC AGGGGGACGG CGCTATCCCC ATGCTCAAGA TGGACCTGTA CTCCAACGTG
AGCGGCGGAG CCACCTGGAG CGGCGGCAAG CTCGACAAGA TCGGCACCAA CACCAGCGTC
TACGTGAACG AGGACGAGCC GGGCGACGTC ACCTTCTCCA TCTTCAAGGA TGCCAACGGC
GACGGGTTGT TCCAGAAGAC CGACACCCAG GTCGGCGGGC CCTACACCTT TACCCAGCTC
ACCGGCCAGG CCTATGAGCT GGCAACCCCG CAGACCATTA CCACGACGCC GCAGCGCTAT
TTCATCGTCT ATAGTATCGC CCGGAACTCC ACCTACGGCA CCACCGTGGG AGCCCGGGTG
GCCAACAGCT CCTACTTCGC GGTCACCGGC GCGGCCGGCG GCGTGGTGAA CGTTACGAGC
ACCTCCTCCT CAACCCCCAC CATCCAGTAC GGCGGCACCG CGGTCACGAA GATCTATGCC
GCGGACTGGG ACGAGGGCAC GACCCTGGCG GGGATTTCCG AAACGGGCGG CCCCTCCGCC
ACCGACACCG CCTGCATCAC CAGCAACACC ACCGGGTCAG GCTTCCCCCT CGTGGGCCTG
CTCAATTACC CGAGCCACAC CTGCGCCAGC GTTGCCGGGC GGGGATACTC CGCCACCACC
GCCCAGCCCG ATTTCATCAG GCTCTACTTC GCCGGGGCCG GATATCATTC GGTCATGAAA
ACCATCAAGG GCGGATCCTT TGTCTACCGG GTCTATACCC CCTTCGGCGG CGGCACGGTG
ACGCTGCAGC TCTTCTACGT GACCAGCGAC GGGGTCAGGG TCAACGCCCC CATAACCTCC
CGCTACACCA CCGGCAGCTC GATCAGCCAG ACCATCACCA CCTCCCTGGC CGGACAGGAT
TTCAGCAATG TGCCGGCCGG CGCCCGGCTC GGCATCCAGA TCGGCGTCAC CGCCGGCATG
CGCATCGGTC TCGGCAGCTC GGTCAATGCC CAGCTTATGG TGCAGGAGAC TGCGGCCGAG
AACGAAAACG TGGACGTGGG CAACGGCTCG GCCATTCCCA ATGCCACTGT CTATGCCTCG
GACGTCAATA AGGTCATCGA CAGCTTCACC CTGACTGCGG CCAAGGCCAA GACGGTCACG
GCCGTCACCA TCAAGGGGAA CGCCACGTTC ACCGGCACCA ATATCAAAGA GGTGAAGCTG
TACGCCGATG CCGGCACCAT CGGCACCCTT GACGGCAGCG ACACCCTGCT CGGCTCCACC
GCCACGATTT CCGGCAACAC CGCGACCATC TCCGGGCTCA ACCTGGCGAT CAGCACCGCG
GTCAGGCGCT ACCTGGTGGT GGTGAACATC GGCGATGCCC CCAACACCAA CGTGATCCTC
ACCGCCGTTG TCGACGACCT GACGGTAGCC TCCACCGGGG GGATCGGGGT GGACAACGAC
AGTACGTCGG CCACGCTGAC CATTCTGCCC ACGACCACCC TGAGCGATTT CATCGCCGCC
GAGCCCCCCA ACGCGATCAT TCCCTGGAAC GCCGGCCCCA CCAAAGTGGA TGCCTTCGGA
CTCAGGACCA ACGGCGGCGT CAACGACACG ATCCGCAACG TCACGGTCAC CCTGTCGACC
ACCAGCGGGC TGCCGGCCGG CAAGGTCATC TCCGACTACG TGGGACGGGT GGATATCGTC
ACTGCCGCGG GCACGTCCCT GGGGCACCTG ACCGCCCCCA CCATGGCCGA CAACTGGCAG
GTCCCCACCA CCGGCCTGGC CGCCACCCAG ATCCCCACCG ACTACTACGT GGCCATCACC
CCCAAGGGGA ACCAGGGGAT CACCTTCACC GTCAAGGGCC GGGTAACGTC GGTGACCCAT
TCCAGGACGA CGAACGCCCT GCTGGTGAAC GATGCCGGCA GCGCCACCAT CCTGATGGAC
GAAGAGCCTC CCAACGAATC GTCCCTCACG GCCGTCACCG GGACCTACCA CAACGACACC
GACCGGGCCG AAGTCAACCT GAGCTGGCTC GGCACCACCG ATGCCGGCGG CCAGCCGGTC
ACCTACAAGC TGGTGCGCGG CCTGGGCAAC GCGCCGGCCC CGCGTACCTG CACGGTGGAT
AACGCCAAGA CCTTCCTGGC CTACCAGGGC CCCGCCACCT CCGTGGTCGA CAAGGGGCTG
GACGAAGGGG TCAACTACGG TTATCGCCTC TGTGTCATCG ACTCGGTCAA CAATATCAAC
GCCGGCGTCA CCGCCAGCGC AACGGCGGCC ATCAAGAACC GGTGCAACGA GCTGCCGGAG
CTGATCGTCA ACCCCACGGC GTCCTATGTC AAGGCGGGCA CCACCGTCAA GCTGACCATC
GGCATCAAGA ACAAGGATAC CGGGGTCTGC GGACCCACCA CCTTCAGCCT CGTCACCCAA
GGCACGAACA TCGACGACAG CAACTTCACC GTTGCCGCCT TCGAGGCCAA CGATTTTGTC
ATTTCGACCA ACAACGGCTC GAAATACACG CACCTGGACA TCACGGCCAA GCCGGGAGCC
ATCGAGGGGG CGGTCAAGAC CTTCCACGTG AAAGTGGTCA AGAGCAGCGG CGGGGAAACC
GTGTGCCCCG ATCCCATCGA GGTCGTGGTC AACAAGTATG GCACCATGAT GCACAGCAGC
CTGCAGCTCG GTACCCAGAA GTACGGCAAG TGGGGTGTCA ATTTTACCTG CAGTACGTGT
CATTCTCCCG ACGCGACCAA CATCAAGCAG GTCAGGAATG TGATAACCAC GCCCACCGGA
CCCCGTCCGG TCCTCTTCGA CACGATCTCC ACGGCCATCA ATGCGAATGT CGCGGGCGTG
TTCGGCAATG ACAGGAGGAG CGGCACCGCA TCCACCAACG TGTGCGAGGT CTGCCATCAC
CGGGCCCGGT TCCACCAGTA CAGCGCCGCC AAGGTTGCCT GGAAGGATCA TAACAACAAC
GGCGACTGCC TGAAGTGCCA TCCCCACAGC ATCGGCTTCA AGACGAAAGC CACCGGCCAG
TCCTGCGACG ACTGCCACGG CAATCCGCCC ACCAGCTACG AGATGCTGGT GGTGCCGCCG
ACCGAGGTCC TGTTCCCCTT TGCCAGCAAT GCCGGCTCCC ATGGGAAGCA CAATGCGCGG
CAGGTGACAT GTACGGCCTG TCACAGCAAC GCCAACCACC TGGTCACGGC GACCCCCGAC
ATGCAGTTGA ACCTGGGTTT CAGCGTCGCT AACGGAACCT TCCCCGGCTT CGTGGGGAGC
GTCACCACGG GCACGATACG CACCCTCGCG CCGGGCAACG ACTATTCCTG GTCCGGCGCC
GCGGGCACCA CCATCCAGCA GGCCCCCAAT ACCATCATGA CCTGCAGCGT CTATTGCCAC
GGCTGGGAGG GCAACGGCGG CTACAACACC GAACCGGCCT GGACCGGCAT CACCCAGGTG
GGGTGCGGCT CGTGCCATGC CGCCACGGCA GATGTCCCGC CGCCGTCGGG GAGCCATGCC
AAGCATGCCG GCAACGAGCC CGGCTACGGC AACGGCATCG CCTGCGCCAA GTGCCACGGC
TTTCGCAACT ATTCCACCAG CGCGTCCCAC ATCAACGGCA ACGTGGAATG GGACCTCGCC
GCCAACAGCA CTACTGCCAG GTACGCCGGA GTCGCGGCAG GCTCCACCGG CGCCAAGGCC
CCCACCGCCC CGGGCAGCTA CGGCACCTGC TCCAACCTGT ACTGCCACAG TGATGTCCAG
TCCAACAACG GCACCGGCGG TCCCACCAGC TTTGCAACGC CGGTCTGGGG CGGCTCCACC
AACTGCAACA GTTGCCACCA GGCCGATCCC AACACCACCG GCGGCCATCC CCAGCACGCC
GGAGAAGAGG TGACCGGCTT CGACTGCCGC ATCTGTCACG CCAACGGCGG GAGCACCAAC
TCCCTCAACC ATGGCAACAG CAAGATCAAC TTCATGTTCA CGGGCCTTGG GGAGAATACC
CACTATTCCT ACAGTTCCGC CAAGACGCCC GGTTCCGCCC CCTACGGCAC CTGCTACAAC
GGCAACTGCC ACGGCGCCCG CCGGACCCTC GCCTGGGAAC CGCCGAACCA CGCCGTTCCG
CTCTGCGAGA AGTGCCACAC CACCAGCCCG TCCGCCGCAG GGTTCTACAG CACCTCCGGA
CCCGGCAGCA CCACGTCCAA GACCGATGCC TACGTGGGAG CGCACTTCCA GCACATCACC
TCCATGCCCT TCAGGTATTC GGCCAGGATC GACTGCTCCG GCTGCCACCT GAAGCCCACG
GGCCCCTACA CCCCTGGCCA CATCGATTCG GCCCTGCCGG CCGAGGTCAT CTTCGGCGCC
ATTGCCGGCA GCGGCGTGCA GAACGGCTAT TCCAGCGCAG AGCACCAGCC GTCCTACAAC
TATGCGAGCA GGGAGTGCAG CAACGTCTGG TGCCACGGCG GCGGCATGGC CTCCAACGTG
GGGGCCGGCC CCTACGGCTC TGCCGTCACA GACGGCGCTT CCCTCGGCTC GCCGGCGCCC
GCCGTCTGGA ATTCTCCCTA TCTCACCGGC GTGGGCACCA ACGACTGCGT CAAGTGCCAT
GCTTTCCCGC CGGCAGCGCC GCTGCCCGGC TACACCCACT GGGACGACAA CAACAACAGG
CCGTTCGTCG CCAACCAGTG CATCCTCTGC CACAAGCATG TGGACAACAC GGGATACGCC
TTCAAGGACC CGAAACTGCA CGTCAACGGC GTTGTGGACA GTTGCAATAC CTGTCACGGC
CGGCCGCCCG TCGACGAGGC CGGCATGACC ATCCCGGCCG TGGGAGCCCT TACCCCCGGC
ATGGTGGGCG CGCACCAGGC CCACGCCCTC AATCCGAGCA TCGGCAAGGA TTGCAACGTC
TGCCACTACC AGTACTCGCA GGAGATGCCG AGCTACGACA TGGAGATGGG CTTCAACGCC
TATGGCGGCA GGGTCACCAG CGGCACGTTC TACGGCTATT CGACCCTGTC CGACAACTAC
TCGCCGAGAA TCGTCTACAA ATCCACCAAT GCGGGAACGG TGGTCCGCCG GACGACCAAC
GCGGATACCC TCAACACCTG TGCGAACCTC TACTGCCACG GCGGCGGAAC GTCCACCCGG
GCGGCGCTGC AGGGTGGGAG CAACACCAGG CCCAACTGGG AAGGCGGTTC GTCCCAGGCC
GCGTGCGGCA CCTGCCACGG CGTGACCGCC GACACCTACC ATGCCACCGG GTCCCATGAC
GCCCACGTGA GCACCGCCTT TGGCAAGCCG CGTCTCGGCT GCTCCAACTG CCACGGCGTC
AAGGAGAACA ACTACCACGT GGACGGCAAG GTGGAATGGG CGTTCTACAG CACGGCCCAA
CGCCTGAACC AGAAGGTCGC CAACCCGCAG TACACCCCGG CCGCCGGCAA CGGCACTGCC
GGTGCCAGCG GCGCGACCAA CGGCCTGGCG CCGAGCACCG CCTTCGGCAC TTGCGCCGTC
TACTGCCACA GCGACGGCAG GGGCAACTAC GCCAGCCCGC TGCCGGTCTG GGGCGGCGCG
CCCATGAACT GCGGCAGCTG CCACAAGAAC CAGACCTCGG CCTTCACCGA CAGCCACCAG
AAGCATTCCG CCAGCTCCGC CAACGGCGGC TACGGCATCG ACTGCTTCAT CTGCCACCTG
GGTTCCGGTT CCGGCAATCC CAAGCACGTG AACGGCGATA TCGACGTGGT CTTCAACTCT
ACGGTGGTGG GCGTGACGGC CACTTACGAC AGCGGTGCCA AGAAGTGCTT CAGCATCCTC
TGCCACGACA CCACGGCGGT GGCCGGACCC ACCTGGGGCG TCCCGAGCAC CGGCACCTAT
GACGGCGGCA CCCACAAGCC CACCTGCATC GGCTGCCACA GTGGCGAGGT GAACACCCGC
GCCGCGGTGA TCCCGCAGTT CGGCGGCGAG TCGCACCACG TGCAGGGCGT GCAGATCAGC
AACACGGTCT GCTACCAGTG CCACTGGGAG GCCAACGCAA ACGGCACCGC CAATACGACC
TATCACACCA GAACGGCCGG CCAGCCGGTC AACCTGGTCA TCCGGACCAC TACCTCAAGA
CCGGTAGCCT ACACCGAGGG GAGCACCGGC ACCGCCTATA CCTCCAACGG CACCCGGACG
GAGCTTGCCA AGCTCAACAG CAACTGCCTC GGCTGCCACA ATGCCACCAA CGCGGCCAGC
CAGCCCTTCG GCGACGGCAT GACGCCGACG CAGTACGCAT GGGACGGCAA GAGCATTGCC
GAGCGGTACA GCGTTGCCAC TACCACCACG TGGGCCAAGG TCACCGGCAA CAACACGGTA
GCCAAGAGCC TCACCAAGGC CTATTCGGCC CACGGCCGCG CAGACCTGAA CCAGCGGGGC
TGGACCGTCG GGAACAGCAC CACCGGCGAG GTCTATGCCA ATACCAGCGG CACGGTCAAC
GTGCTCTGCT ATGACTGCCA CAACTCCCAC GGCACCAGCG CCACCGGCAT CATGAGCAGC
TACTCCAGCG CCACCGGCCG CAACATGGGC GGTATCCTCA AGGCAACCAG CAACGGCATC
GGCGGCTACA CCGCCGACTA CACGCCGTAT GCGGGGGGCG ATGCCGTCGA GCCCAACAAG
AACGCCTACA ACCCGGGTGC GGCCCTCTGC TTCGACTGCC ACAATACCGC CAGCGCCGGC
GCAACCGCTC CCTGGGGGTA CGGTCCGACG GCCAGCGGCG GGACCTTCGG CTCCACCCAG
GCCATCTACG GCTACCACGA TACGCCGTAC TTCGGCAGCG GCACCTTCGC CAACACCCAG
ACCTATGCCT ACAAGGCATT GAACCCCGAC AACAAGGGCG GGCACTTCGG CGCATCCTCC
AGCCTGACCA CCACCGCCGC CAAGCCCATC AACGGCCTCT GCACGCCGTG CCACGACCCC
CACGGCGTGA GCCCGTCCCT GGGAGCCAAC CAGGCCTATG CCGTACCCCT CCTCAAGGGG
ACCTGGGTCA CCTCGCCGTA CAAGCAGGAT GCGGCGCCGG CCAGCAAAAC CGAAGCCCGC
GGCGGCGGCA AAAAGAGAAG TGCCATGAAC GTCGGCAGCA CGCCGGGCTA CCGGATCGAC
CAGAACAGCA TGGGGATTGC CGCAGCAGCG ACACGCAGCC AGTGGACCTT CCCCAACAAC
GCATCCAGCC AGACACCGAG TACCATGCAG GGTACCACCG ATGCCCAGTT CGCCGGACTC
TGCACCGGCT GTCACGCCCA GGCCGACCTC AACAACACGG CAGCGCCGGC CACCTCGAAC
TGGAAGACCA TGCGGCGGGT CCACAACACG GTCAAGGGAT GGGCGACCGC CAGCGGGGGC
AATGCCAACA ACAAGGTCCA CGCCTTTACC TGCTCCAAGT GCCACACGCC GCACAACGCC
AAGCTGCCGC GCCTGCTGGT GACCAACTGC CTCGACGTGA AGCACCGCGG CCGGGCGGCC
TCCGGCGGCA GCATGACCGG CCCAGCATCC CAGAGCGGAT CCAAGGGGGC GGGTGTGGGT
CGCTTCCCGC AGGGGGGCGG CGGTACGGGA GACCAGCCCT TGGGCACTGC GGGCAAGTGG
TTCTTCGGCA AGGCCACTCA ATCTACCAGC ATAACCACAA ACAGCCAGAC CCTCTGCCAC
CAGAGCGCCA CCGCCGGCGG CTCGACCTAC TCGCAGGACG GGCAGTTGTG GAATACGAAG
TCACCGTGGT AG
 
Protein sequence
METALVRKIR GMGIRTKIGI TVMLVMACFL YQAMFRPLIG DTATNTYYFT LDSAAVNLGA 
DGYTTTRTSR DGKISMKPGV YTTSRYVTAS VSTAEQNMIR AYGPVYATKQ TLTAPSVTIG
MRDRNGTTNN MYWKAYVYAY NPKGTANNGV LLWTSDEKEA HPAVQTPLEL TFTNPQPKDV
EAGYRLKVVV TCRMASTSSS ARFYWGNSTN YSYFTVTEAP YVANSVTVNN LSDYYSGQLA
SVTQGDGAIP MLKMDLYSNV SGGATWSGGK LDKIGTNTSV YVNEDEPGDV TFSIFKDANG
DGLFQKTDTQ VGGPYTFTQL TGQAYELATP QTITTTPQRY FIVYSIARNS TYGTTVGARV
ANSSYFAVTG AAGGVVNVTS TSSSTPTIQY GGTAVTKIYA ADWDEGTTLA GISETGGPSA
TDTACITSNT TGSGFPLVGL LNYPSHTCAS VAGRGYSATT AQPDFIRLYF AGAGYHSVMK
TIKGGSFVYR VYTPFGGGTV TLQLFYVTSD GVRVNAPITS RYTTGSSISQ TITTSLAGQD
FSNVPAGARL GIQIGVTAGM RIGLGSSVNA QLMVQETAAE NENVDVGNGS AIPNATVYAS
DVNKVIDSFT LTAAKAKTVT AVTIKGNATF TGTNIKEVKL YADAGTIGTL DGSDTLLGST
ATISGNTATI SGLNLAISTA VRRYLVVVNI GDAPNTNVIL TAVVDDLTVA STGGIGVDND
STSATLTILP TTTLSDFIAA EPPNAIIPWN AGPTKVDAFG LRTNGGVNDT IRNVTVTLST
TSGLPAGKVI SDYVGRVDIV TAAGTSLGHL TAPTMADNWQ VPTTGLAATQ IPTDYYVAIT
PKGNQGITFT VKGRVTSVTH SRTTNALLVN DAGSATILMD EEPPNESSLT AVTGTYHNDT
DRAEVNLSWL GTTDAGGQPV TYKLVRGLGN APAPRTCTVD NAKTFLAYQG PATSVVDKGL
DEGVNYGYRL CVIDSVNNIN AGVTASATAA IKNRCNELPE LIVNPTASYV KAGTTVKLTI
GIKNKDTGVC GPTTFSLVTQ GTNIDDSNFT VAAFEANDFV ISTNNGSKYT HLDITAKPGA
IEGAVKTFHV KVVKSSGGET VCPDPIEVVV NKYGTMMHSS LQLGTQKYGK WGVNFTCSTC
HSPDATNIKQ VRNVITTPTG PRPVLFDTIS TAINANVAGV FGNDRRSGTA STNVCEVCHH
RARFHQYSAA KVAWKDHNNN GDCLKCHPHS IGFKTKATGQ SCDDCHGNPP TSYEMLVVPP
TEVLFPFASN AGSHGKHNAR QVTCTACHSN ANHLVTATPD MQLNLGFSVA NGTFPGFVGS
VTTGTIRTLA PGNDYSWSGA AGTTIQQAPN TIMTCSVYCH GWEGNGGYNT EPAWTGITQV
GCGSCHAATA DVPPPSGSHA KHAGNEPGYG NGIACAKCHG FRNYSTSASH INGNVEWDLA
ANSTTARYAG VAAGSTGAKA PTAPGSYGTC SNLYCHSDVQ SNNGTGGPTS FATPVWGGST
NCNSCHQADP NTTGGHPQHA GEEVTGFDCR ICHANGGSTN SLNHGNSKIN FMFTGLGENT
HYSYSSAKTP GSAPYGTCYN GNCHGARRTL AWEPPNHAVP LCEKCHTTSP SAAGFYSTSG
PGSTTSKTDA YVGAHFQHIT SMPFRYSARI DCSGCHLKPT GPYTPGHIDS ALPAEVIFGA
IAGSGVQNGY SSAEHQPSYN YASRECSNVW CHGGGMASNV GAGPYGSAVT DGASLGSPAP
AVWNSPYLTG VGTNDCVKCH AFPPAAPLPG YTHWDDNNNR PFVANQCILC HKHVDNTGYA
FKDPKLHVNG VVDSCNTCHG RPPVDEAGMT IPAVGALTPG MVGAHQAHAL NPSIGKDCNV
CHYQYSQEMP SYDMEMGFNA YGGRVTSGTF YGYSTLSDNY SPRIVYKSTN AGTVVRRTTN
ADTLNTCANL YCHGGGTSTR AALQGGSNTR PNWEGGSSQA ACGTCHGVTA DTYHATGSHD
AHVSTAFGKP RLGCSNCHGV KENNYHVDGK VEWAFYSTAQ RLNQKVANPQ YTPAAGNGTA
GASGATNGLA PSTAFGTCAV YCHSDGRGNY ASPLPVWGGA PMNCGSCHKN QTSAFTDSHQ
KHSASSANGG YGIDCFICHL GSGSGNPKHV NGDIDVVFNS TVVGVTATYD SGAKKCFSIL
CHDTTAVAGP TWGVPSTGTY DGGTHKPTCI GCHSGEVNTR AAVIPQFGGE SHHVQGVQIS
NTVCYQCHWE ANANGTANTT YHTRTAGQPV NLVIRTTTSR PVAYTEGSTG TAYTSNGTRT
ELAKLNSNCL GCHNATNAAS QPFGDGMTPT QYAWDGKSIA ERYSVATTTT WAKVTGNNTV
AKSLTKAYSA HGRADLNQRG WTVGNSTTGE VYANTSGTVN VLCYDCHNSH GTSATGIMSS
YSSATGRNMG GILKATSNGI GGYTADYTPY AGGDAVEPNK NAYNPGAALC FDCHNTASAG
ATAPWGYGPT ASGGTFGSTQ AIYGYHDTPY FGSGTFANTQ TYAYKALNPD NKGGHFGASS
SLTTTAAKPI NGLCTPCHDP HGVSPSLGAN QAYAVPLLKG TWVTSPYKQD AAPASKTEAR
GGGKKRSAMN VGSTPGYRID QNSMGIAAAA TRSQWTFPNN ASSQTPSTMQ GTTDAQFAGL
CTGCHAQADL NNTAAPATSN WKTMRRVHNT VKGWATASGG NANNKVHAFT CSKCHTPHNA
KLPRLLVTNC LDVKHRGRAA SGGSMTGPAS QSGSKGAGVG RFPQGGGGTG DQPLGTAGKW
FFGKATQSTS ITTNSQTLCH QSATAGGSTY SQDGQLWNTK SPW