Gene Gmet_0581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGmet_0581 
Symbol 
ID3740590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter metallireducens GS-15 
KingdomBacteria 
Replicon accessionNC_007517 
Strand
Start bp636065 
End bp644803 
Gene Length8739 bp 
Protein Length2912 aa 
Translation table11 
GC content62% 
IMG OID637777855 
Producthigh-molecular-weight cytochrome c 
Protein accessionYP_383549 
Protein GI78221802 
COG category 
COG ID 
TIGRFAM ID[TIGR01904] Geobacter sulfurreducens CxxxxCH...CXXCH domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTTCG TACAGGGGGG AGGTGTTTTT GCCCCTGTAC GTCGTAACAA AGTAACGGAC 
TTTCCAGCAC TGAACGAGGG AACTTATATG GGAACGACAA TCGCCAGGCA GATCAGGAGC
ATGGGGCTGC GGACTAAAAT CGGCATCATT GTGATGCTGG TCGTGGGCTG TTTCCTGTAT
CAGATCATGT TCAAGCCGAT GATTGGCGAT ACAGCCACCC AGACCTATTA TTTCACCACT
GATTCCACAT CGGTGAACCT CGGCGCCGAT GGCAGCACCA GCACCGCGGC GTCGCTCGGC
GGCAAAATAT CCATGAAGGT CGGTGTCTAT GCCTTCTCCC GCAGTGTCAG CGCCGCGTCC
AATACCTCCG AACAGAGGAT GATCAGCGCC TATGGGCCGG TCTATGCCGC CAAGCAGACC
ATTAACGCGC CTGCCGTCAC GATCGGCGTG AGGGACCGCA ACGGCACCGC AAATGCCATT
TACTGGAAGG CCTACGTCTA CGCCTATAAC CCGGCGGTGG CCCCCGGCAC CGGCAACCCC
ACGGCTACCG GTGCCGCCAA TAACGCGAGA TTGCTCTGGA CATCGGACGA GATGGAGGCC
CATCCCTCGG TGCAGACGCC GCTGGACATG ACCTTCACCA ATCCTGCACT GCAGACCATA
GATGCCGGAC AGCGGATCAA GGTGGTCATC ACCGCCCGCC TGGCAAGTAC CGCCTCGTCG
GCCCGCCTCT TCTGGGGGGG CGGAAGCAAC TATTCGTTCT TTACCGTCAC CGAAGCCCCC
TATGTGGCTG ATTCGGTTAC CGTCTCAAAC CTGGCCGATT ACTATGCCGG CGGTCTCACC
TCCGTCACCC AGGGGGACAC CAATGTTCCC ATGCTTAGAT TCGACCTGTA CACCAACGTG
GCCGGGGGGG TCAACTGGTC GGGCGGGCTT CTGGATAAGA TCGGCACCAA TACCAGCGTC
CTCAACCCCC TTCTGCCGGT GGACGAACCG GGAGACGTCT CCTTTTCGAT CTACCGGGAT
GCCGATAATG ACGGCGTCTT CGAACCCACC GACACCCTGA TCGGCGGACC ATACAATTTC
AGCCAGTTGA CCAAACAGGG GTATTCGTTG CCGACTCCGG AAGCACTTAC CGCCACGCCC
CGGCGCTATT TCATCACTTA CACCATTCGC AAGAACGCGA CCTATAACAC CACTGTCGGC
GCCCGCATTG TGGATGGCAG TTATTTCACC GTCGACGCCG CGGCGACGAA CGGCGTGCGA
AATGTGACCA GCACGTCGTC CTCAACGCCC CTCATCCAGT ATGGGGGCAC CCCGGTCGTC
AAGAACTACG CGGCGGACTG GGATGCCGGC ACGTCACTGA CGAACATCGC CGAAACCGGC
GGGCCGGGCT CCACAACCTC CACCTGTATT ACCAGAACCA CGGCAGGGTC GGCATTCCCG
CTGGTCGGCC TGCTCAACTA CCCCAACCAC TCTTGTGCCA GTGTGGCGGG GCAGAACTAC
AGCAACACCA GCGGCAGCAC CCAGCCGGAT TTCGTCAGGC TCTATTTCAG CGGGGACGGC
TACCACTCCG ATATGCTGTC CATCAAGGGG CGCAGTTTTA CCTACCGGCT CTATACCCCG
TCAGGCGGCG GCACGGTCAC CCTGCAGATG TTCTACGTCA CCAGCGGGGG CGTGCGGGTT
AATGCCCCGA TTGCATCGAC CTACAAATCC ACCGGTACTC TCAGCCAGAC CATTACCACC
TCCCTGTCCG GTCAGGATTT TTCCAATGTC CCCCTTGGCG CCCGCCTCGG CATCCAGATC
GGCGTCACCG CCAATGCGCA GATCGGCCTC GGCGGCGCGG TCGGCGCCCA ACTGCAGGTG
GAGGAAACGG CGGCCCTGAA TGAAAACGTC GATGTGGGCG ACGGTTTCCC CAGTGTCAAT
GCCAACGTCT ATGCGGGCGA CACCAGCAAG GTGATCGATT CGTTCACCTT GAGCGCTTCC
AAGGCAAAGA CCGTCTCTTC GATTTCGATC AAGGGCAACC CCCTGTTCAA CAGCACAAAT
ATCAAAAATG TCTGGATTTA TCAGGACAAC TCGACCGGCG GCATGCTTGG GGCCCTTGAC
GGCACCGACA CCCTGATCGG GTCCACCTCC GTAATCACCG GCAACGTCGC GACGGTTTCG
GTCGGCAGCC TGGCCATCGA CAACAAGACC AAGCGATTCC TGGTGGTCGT CGATATCGGC
GACACCCCCA ACACCAACGT AATCCTGAAC GCTCTGGTCA GTGACCTGGC GGTTGTCACT
TCCGGCACCA TCGGCGAGAA CAGCGACAGT TCGTCGGCGA ATCTGACGAT CCTGCCGACC
ACGACGGTGA GCGGCGGCAC TGCCGAGCCT CCCGGCGTGA TCGTTCCTTC CGGCGCCGGC
GCCACCAAGC TGGATGCGTT CAATCTTGCC ACCAACGGCG GGGTCAACGA CACGTTCTCC
AGCGTCACGG TCACGCTTTC GACGACCAGC ACCCTGCCCG CGGGCAAGGT CATCTCCGAC
TACGTTGCGC GGCTTGATAT CATCAAGGCT GACGGCACTT CCTTCGGGCA CCTGACTTCT
CCCACCCAGG TCAATAACTG GCAGGTCACG ACCACCGGCC TGGCCGCCAC CACCACGCCC
ACCGATTATT ACGTGGCCGT TACTCCCAAG GCGGGCCAGG GGATTACCTA CGACGTCAAG
GCCCAGGTTG TCTCGGTGGT CCACTCCAGG ACGACCAACC GGCTGCTGCT GAGCGACCCG
TCCAGCGCAA CGGTCATCAT GGACCAGCAG CCGCCGACCG ACCCGACCCT GACTGTCGCC
ACCGGCACCT ATCACAATGA TATTGACCGC GCCGAGATCA ACCTGAACTG GACCACCGCG
ACCGATACCA GCGGGAGCGC GGTCAGCTAC GTTGTCGTGC GTGGCCTGGG GAATGCGCCG
CCGCCACGCA ATTGCACGGT GGACAACGTA AAGACCTTCC CGGTTTACTC CGGCACGGGC
ACATCACTTA TCGACAAAAA CCTGGATGAA GGCATTTCGT ACGGCTACCG CGCCTGTGCG
GTGGACTCGG TCAACAACGT CAGCGTGGGA ACCGCCGGCA GCGCCATCGC CAGTATCAAG
AACAGATGTA CGGAGCTTCC CTCACTTGAG ATCAATCCCA ATGCCTCATA CATCAAGGCC
GGCAATACGC TGGGCCTGGA TGTCGCCATC ACCAGCAACG ACACGGGCGT CTGTGCGGCA
ACCACCTACA CGCTCTCGAT AGTGGGCACC GACATCGACG ACAGCAACTA CACCGTCTCC
ACATTCAGCG GCAACAACTT CATAATCCCG ACCATGGGGT CGCAGTATAC GAAGCTGAAC
ATTACCGCGA AGCCCGGCGC GGTTCAGGGG GCGGTCAAGA CCTTCCAGGT GAAGGTCGCC
AAGAGCTCCG GCGGAGAAAC CCTGCACGCG GACCCGGTCT ATGTCACGGT CAACAAATAT
GGCACCATGA TGCACAGCAG TCTGCAACTG GGCACCACCA AGTACGGTAT CTGGGGGAAG
AATTACGACT GCGCCACCTG CCATTCCCCC ACCGCCACCA ATATCAAGCA GGTCAAAAAC
AGCATCGCCA CCCCGACCGG TAACCGGCCG GTCGTGTTCA ATATTCTCTC CACCGCTTCC
AGTGCCAATG TGGCTGGTGT TTTCGGCAAT GACCACAGAA GCGGGACAGC CACCACCAAT
GTGTGCGAAG TCTGCCACCA CAATGCGCGT TTCCACCAGT ACAGCTCGTC GAAGGTGACC
TGGAAGGACC ATAACAACAA CGAAGACTGC ATGAAGTGCC ACTCGCACAA GCTCGGTTTC
AAGACGGTGG CGCAGGCGGG GTCGTGTACC GACTGCCACG GCTATCCCCC CACCATCAAG
GAGCAGCTGG TGGTGCCGAC GACCAACGTG CTCTCCTCGT ATGCAACCAA TGCCGGCTCC
CATGGCAAGC ACAACGACCG CGGCCTCAAG TGCCAGGCCT GCCACAGCAA CGGCAACCAC
CTGGTCACCG CCGTACCGGA CAAAAACATC AATATGGGGT TCAAGGTCAA CGGGACCAGC
TTCCCCGGTT GGTTCGGCCA ATACACCACC GGCATAATGC GTTCGCTGAC GCCCCGTAAC
GGCTATACCT TTGCGACCGC CCCGGGCACG ACGGTGCAGC AGGCACCCGG CACCATAATC
AACTGCAATG TCTACTGCCA CGGCTGGGAC GGCAATGGCG GCTACAATAC CGATCCGGCC
TGGACCGGCA TCAGCCAGGT CGGCTGCGGC TCATGCCATG CCGCCACCAA CGACCAGCCC
CCCACCTCCG GCAGCCACCA CAAGCATGCC AGCAATGAAC CCGGTTTCGG CAACGGCATT
GCCTGCAGCA AGTGCCACGG TTTCCGCAAT TACTCCACCA GCTCGGCCCA CATCAACGGC
AATGTGGAAT GGGATCTCTC CACGCTGCCT CCCGGAACCT TCGGGCTGGC CCTTTACAGG
GGCGTGGACA AGGGGAACAC CGGCGCTCCG GCCCCCACGC CGCCGGGCAG TTACGGTTCC
TGCTCCAACC TTTACTGCCA CAGCAATGTC CAGAGCAACA ACGGCACCGG TCCCCCGACG
TCCTATTCCA CCCCGACCTG GGGGGGGACG ACCACCTGCA ACAGTTGTCA CCAGGCGCAG
CCGAATGTGA CCGGCGGGCA TCCGCAGCAT GCCGGCGCCG GCGTGACGGG CTTTGATTGC
CGCATCTGCC ATGGAAACGG CGGTGACGCC AACCCGCTCA ACCATGCCAA CGACTACATA
AACTTCCAGT TCGGCGGCCT TGCCGAGAAC ACGCACTATT CCTACAGCTC GGCCAAGGTG
CCCGGCTCCG CCTCCTATGG CACCTGCTAC AACGGCAACT GCCACGGCTT GCGCCGTCCC
AAGACCGGCC CGACCGCGCT GACCTGGGGC CCGGCCAATG ACGCTATCCC GCTTTGCGAC
AAGTGCCACA CTACGGATCC ATCCCTCAAA AATGGCTTCT ACAGCACCAT GGGCCCCAAC
GGGACCACCT CGAACACGGA CCCGTACGTT GGCGCCCACT TCCAGCATAT CACCTCGATG
CCGTTCAAGC TGTCGTCCCA GTACGATTGC TCCGAATGCC ACAACAAGCC GACCGGCCCC
TACACTCCCG GCCATATCGA CTCGCAGCTG CCGGCCGAGC TGACTTTCGG CGCAACCGCA
TCGAGCGGCG CGGTCCTGAC CGGTTACACC AGCGCCCAGC ACCAGCCCGG CTACAACTAC
GGTGCCCACC AGTGCAGCAA TATCTGGTGC CACGGCTCCG GCATGGACTC CGTCGAGGGG
ACCGGCCTCT ACGGTTCCGC CGTGAGCGAC GGAGCCACCC CCAATGCCAG CCGGATCGCC
TCGCCGGTCT GGAATGCACC GTTCCTGAAC GGTACCACCG CCGACTGCAA CAAGTGTCAC
GCATCACCGC CGCCGGCACC GCTCCCCGGA TACAATCACT GGGACGATGA CAACAGCCGC
CCCTACCAGG CCAACCAGTG CATCAATTGC CACAAGCATG TCAATGCGGC CGGCAACGGC
TTCACCAAGC CGGAAATCCA CGCCAACGGC GTTGTCGACA GCTGTCTGAC CTGTCACGGC
CTCCCGCCGA CCGACAACAG CATGACCAAC CCGCCGATTA ATGCCCTGAG TGCCGGCATG
ACCGGCGCGC ACCAGGGGCA TTTCCTGAAC CCGAATATCG GCAAGCGCTG CACTTTCTGC
CACTACAACA ATTCCGGCGA CATGCCCAGC TACAAGCTGG AGATCGGTTT CAACGCCTTT
GGCGGCAAGG TCCGCCGCGG AACGTTCTAC GGCTACTCCA CCCTGACCAA CAGCTATTCC
CAGCCGATTG TCTACTTCGC GACCATCACC AGCACCACCG TGCGGCGGAC CACGAACACG
GCCAAGCTCA ATACCTGTGA GAACGTGTAC TGCCACGGTG GCGGCTCGGG CGCGGCCCTG
CCACCATTGG GGGGCGGCAG CAACACCAAG CCGGACTGGG AGCTGGGCTA TACCGAGGCC
ACCTGCGGCA GCTGCCACGG CGTGACCGGC GAAACCTACC GGACCAGGGG GTCCCATGGC
GCCCATGTGG GCACGCTCTT TGGCGAGCCG AAACTGGCCT GCTCCAACTG CCACGGCGTC
AAGGAGAACA ACTACCACGT GAACGGCCAA GTGGAGTGGG AGTTCTACAC CTCCGCCAAG
CGGATGAACC AGATTGCGGT CAACGACAGC TTTAAAGACA GCCTCGGCAA TGTTGTCGTT
CCCGGCTACA AAGCCGCCGG CGCCTCGTCC TTTGCGGCCA AAGGCGGCAC CGGCAACCTG
GCGCCCAGCG CCGCGTACGG CACCTGCCAG GTCTACTGCC ACAGCGACGT TTACGATCAC
CAATTCAAGG CGATCACCTG GGGTAGCGGC GCCACCACCT GCAACTCCTG CCACCGGGAC
CAGACCTCGG CCGGCAGGTA CACCGGCGCG CACCAGAAAC ATACCGCCAG TTCGGCCAAC
GGAGGCTACG GCATCGACTG CGTCATGTGC CACTACGGTT CCGGCGCCGG CAACCCGCTC
CATGTGAACG GCACGGTTGA CATCATCTTC AACTCGTCAG TGGTCGGTCC CAATGGCGTG
TATGCACCCG GCGCCACCGA AGGGACCGGT ACCTGCAAGA ATATCCTCTG CCACGTTTCC
GACGCAACCA CCGGTCCGGC CTGGAACGGC GGATCTGCCA GCGGCAGCTA TGCCACTGGC
ACCAACAAGC CGACCTGCAT CGGCTGCCAC AGCGGCGAGG TCGGCGGCCG CACCGCGGTT
ATCCCGCAGT TTGCCGGCGC GTCGCACCAC GTCCAGGGTG TAACCATGAG CGCCACCTAC
TGCTACCCGT GCCATATGGA GGCAAACGCC GACGGCACCG CCAATGCCAC CTACCATGAC
CGGACCACCG GCAAGTCGGT TGATCTGGTG CTCTATGGCA ACGGCACGCG CGGCACGGTC
TTCACCAGGT ACACGGCCGC GGGAAGCGCT ACCCGCAAGC GGACGGAATA CGCCAAGATC
AACAACGTCT GCATCGGCTG CCACAGCACC AAGAACAATG CCACAACGCC CTTCAGCGCC
AGCGGCGATA CCAGGACACC CAAGACCTAT GCCTGGGACA ACAGCAGCAT CTTCAACCGG
TACAGTTCCA CCGCCACCAC GCTGTGGGGC AAGGTCACCG GCAACGACAC GGTGAAGAAG
GGGCTTAACA AGGCCTTCTC CGCCCATGGC AATGCCAGCG GCAACCAGCG CGGCTGGGCG
TTCGGCTCCT ACACGGGCGG CCCGAAACAC AGCAACACCA GCGGCGCCGT CAACAATGTG
CTCTGCTTCG ACTGCCACAA CTCCCACGGC ACCGTGGCCA GCGGCATCAT GACCAGCTAC
TCCAGCGCCA CGGGCCGCTA CAGCGGCGGC ATGCTCAAGA CCACCGTCCA GGGGGTCGGC
GGCTACAACT CCACCTATGC GCCGGTGGCA GGCGGCGATT CGGCCGCGCC CAACAGGAAC
GCCTACAACA CGGGTGCGTC CCTCTGCTTC GACTGCCACA ACAACAAGAC GTCCAATACC
TCCATGCCGT GGGGCTACAA TGACACCTTC GGCTCCAACC AGCCCATCTA CGGCTTCCAT
GACAAGCCGT ACTTCGGCAA CTACTCCACC TTTGCCATGA CGGTCACCTA TCCGTACAAG
GCCAGCAACC CTGGCAACAA GGGGGGCCAC TACGGCCCGT CGTCACCGCT GACCACTGCC
GTCACCCAGC GGACGTTCGC GAACGGCATC AAGGACAATC CGTACTCGGC CGGGGTTTCC
TCGCCCATCA ACGGCCTCTG TACGCCGTGC CACGACCCCC ATGGCGTCAG CAAGAACACG
ACCTATGTGA GCGACCGCAA CTACGGCGTG CCGCTCCTCA AGGGGACCTG GGTCACCTCC
CCCTACCGGC AGGATTCGGC ACCGCGGAAC ACCAACGAAG CCCGGGGCGG CAGTACCCAT
TCATCGTCCT ATGTGCCCAT TACCGTCGGC AGCACGCCCC GGTACTTCAT CGACCAGAAC
AGCATGCAGG CGGGAACGGT GGGCGAACCG ACGACCGCCC GGTCCTGGAC GTTCACCACT
TCCGCCGCGA CACTGCAGAC CACGGCCGAT ACCCAGTTCG CCGGGCTCTG CACCGGCTGC
CACAACAAGT CGGTGCTCAA CAACACGGCG GCGGTGACCG GTGCCCCCGG TTCGACCGGA
AGCTGGAAAG CCATGACCCG GATCCACAAC ACCGTCGATG GCTGGGCACT CACAACGGGC
AGCGGCGGCA ACGTGAGCAA CAAAGTGCAT GCCTTCACCT GCTCCAAGTG CCACACGCCC
CACAACGCCC GGCTACCACG GCTCATGGTG ACCAACTGCC TCGATGCCAA ACATCGCGGC
GGGGTGGCTG CCGGCGGGAA TCCGGTGGAG TACTCGAAAT GGTATCAGAG CGGGGCCGGC
AAGGGACGGT TCCCTGTAGG CGGCGGCGGG TTCAAGTCCC GTGGCTCGGC GATCAACCCG
GGGACCTGGT TCTTCGGCAA GTCCCAGAGC ACGGTCCAGA ACGCGGCGCC GGCACTGACC
CTGACCACCC AGACCCAGTG CCACAACACG GCAACAGCCG GTGGCATAAC CTACACCAAC
TACACGACGC AGCACTGGAA CAACAAAACA CCGTGGTAG
 
Protein sequence
MRFVQGGGVF APVRRNKVTD FPALNEGTYM GTTIARQIRS MGLRTKIGII VMLVVGCFLY 
QIMFKPMIGD TATQTYYFTT DSTSVNLGAD GSTSTAASLG GKISMKVGVY AFSRSVSAAS
NTSEQRMISA YGPVYAAKQT INAPAVTIGV RDRNGTANAI YWKAYVYAYN PAVAPGTGNP
TATGAANNAR LLWTSDEMEA HPSVQTPLDM TFTNPALQTI DAGQRIKVVI TARLASTASS
ARLFWGGGSN YSFFTVTEAP YVADSVTVSN LADYYAGGLT SVTQGDTNVP MLRFDLYTNV
AGGVNWSGGL LDKIGTNTSV LNPLLPVDEP GDVSFSIYRD ADNDGVFEPT DTLIGGPYNF
SQLTKQGYSL PTPEALTATP RRYFITYTIR KNATYNTTVG ARIVDGSYFT VDAAATNGVR
NVTSTSSSTP LIQYGGTPVV KNYAADWDAG TSLTNIAETG GPGSTTSTCI TRTTAGSAFP
LVGLLNYPNH SCASVAGQNY SNTSGSTQPD FVRLYFSGDG YHSDMLSIKG RSFTYRLYTP
SGGGTVTLQM FYVTSGGVRV NAPIASTYKS TGTLSQTITT SLSGQDFSNV PLGARLGIQI
GVTANAQIGL GGAVGAQLQV EETAALNENV DVGDGFPSVN ANVYAGDTSK VIDSFTLSAS
KAKTVSSISI KGNPLFNSTN IKNVWIYQDN STGGMLGALD GTDTLIGSTS VITGNVATVS
VGSLAIDNKT KRFLVVVDIG DTPNTNVILN ALVSDLAVVT SGTIGENSDS SSANLTILPT
TTVSGGTAEP PGVIVPSGAG ATKLDAFNLA TNGGVNDTFS SVTVTLSTTS TLPAGKVISD
YVARLDIIKA DGTSFGHLTS PTQVNNWQVT TTGLAATTTP TDYYVAVTPK AGQGITYDVK
AQVVSVVHSR TTNRLLLSDP SSATVIMDQQ PPTDPTLTVA TGTYHNDIDR AEINLNWTTA
TDTSGSAVSY VVVRGLGNAP PPRNCTVDNV KTFPVYSGTG TSLIDKNLDE GISYGYRACA
VDSVNNVSVG TAGSAIASIK NRCTELPSLE INPNASYIKA GNTLGLDVAI TSNDTGVCAA
TTYTLSIVGT DIDDSNYTVS TFSGNNFIIP TMGSQYTKLN ITAKPGAVQG AVKTFQVKVA
KSSGGETLHA DPVYVTVNKY GTMMHSSLQL GTTKYGIWGK NYDCATCHSP TATNIKQVKN
SIATPTGNRP VVFNILSTAS SANVAGVFGN DHRSGTATTN VCEVCHHNAR FHQYSSSKVT
WKDHNNNEDC MKCHSHKLGF KTVAQAGSCT DCHGYPPTIK EQLVVPTTNV LSSYATNAGS
HGKHNDRGLK CQACHSNGNH LVTAVPDKNI NMGFKVNGTS FPGWFGQYTT GIMRSLTPRN
GYTFATAPGT TVQQAPGTII NCNVYCHGWD GNGGYNTDPA WTGISQVGCG SCHAATNDQP
PTSGSHHKHA SNEPGFGNGI ACSKCHGFRN YSTSSAHING NVEWDLSTLP PGTFGLALYR
GVDKGNTGAP APTPPGSYGS CSNLYCHSNV QSNNGTGPPT SYSTPTWGGT TTCNSCHQAQ
PNVTGGHPQH AGAGVTGFDC RICHGNGGDA NPLNHANDYI NFQFGGLAEN THYSYSSAKV
PGSASYGTCY NGNCHGLRRP KTGPTALTWG PANDAIPLCD KCHTTDPSLK NGFYSTMGPN
GTTSNTDPYV GAHFQHITSM PFKLSSQYDC SECHNKPTGP YTPGHIDSQL PAELTFGATA
SSGAVLTGYT SAQHQPGYNY GAHQCSNIWC HGSGMDSVEG TGLYGSAVSD GATPNASRIA
SPVWNAPFLN GTTADCNKCH ASPPPAPLPG YNHWDDDNSR PYQANQCINC HKHVNAAGNG
FTKPEIHANG VVDSCLTCHG LPPTDNSMTN PPINALSAGM TGAHQGHFLN PNIGKRCTFC
HYNNSGDMPS YKLEIGFNAF GGKVRRGTFY GYSTLTNSYS QPIVYFATIT STTVRRTTNT
AKLNTCENVY CHGGGSGAAL PPLGGGSNTK PDWELGYTEA TCGSCHGVTG ETYRTRGSHG
AHVGTLFGEP KLACSNCHGV KENNYHVNGQ VEWEFYTSAK RMNQIAVNDS FKDSLGNVVV
PGYKAAGASS FAAKGGTGNL APSAAYGTCQ VYCHSDVYDH QFKAITWGSG ATTCNSCHRD
QTSAGRYTGA HQKHTASSAN GGYGIDCVMC HYGSGAGNPL HVNGTVDIIF NSSVVGPNGV
YAPGATEGTG TCKNILCHVS DATTGPAWNG GSASGSYATG TNKPTCIGCH SGEVGGRTAV
IPQFAGASHH VQGVTMSATY CYPCHMEANA DGTANATYHD RTTGKSVDLV LYGNGTRGTV
FTRYTAAGSA TRKRTEYAKI NNVCIGCHST KNNATTPFSA SGDTRTPKTY AWDNSSIFNR
YSSTATTLWG KVTGNDTVKK GLNKAFSAHG NASGNQRGWA FGSYTGGPKH SNTSGAVNNV
LCFDCHNSHG TVASGIMTSY SSATGRYSGG MLKTTVQGVG GYNSTYAPVA GGDSAAPNRN
AYNTGASLCF DCHNNKTSNT SMPWGYNDTF GSNQPIYGFH DKPYFGNYST FAMTVTYPYK
ASNPGNKGGH YGPSSPLTTA VTQRTFANGI KDNPYSAGVS SPINGLCTPC HDPHGVSKNT
TYVSDRNYGV PLLKGTWVTS PYRQDSAPRN TNEARGGSTH SSSYVPITVG STPRYFIDQN
SMQAGTVGEP TTARSWTFTT SAATLQTTAD TQFAGLCTGC HNKSVLNNTA AVTGAPGSTG
SWKAMTRIHN TVDGWALTTG SGGNVSNKVH AFTCSKCHTP HNARLPRLMV TNCLDAKHRG
GVAAGGNPVE YSKWYQSGAG KGRFPVGGGG FKSRGSAINP GTWFFGKSQS TVQNAAPALT
LTTQTQCHNT ATAGGITYTN YTTQHWNNKT PW