Gene Phep_1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1039 
Symbol 
ID8252133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1215613 
End bp1224396 
Gene Length8784 bp 
Protein Length2927 aa 
Translation table11 
GC content47% 
IMG OID644934692 
ProductFibronectin type III domain protein 
Protein accessionYP_003091321 
Protein GI255530949 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.761605 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAGAA ATCTACTTAT TTTAATCGTT CTGATACTAT GCTCATTTTC TGCAAAAATA 
AGTCATGCAC AGTTAAGTCC TGGCGATATC GCCATTATTG GCGTTAACGG TGACACCAGG
CTAAGTCCCA CAACTCCTTA CACAGGAACT ATTGCATTGG TTACACTCGT GGATATTCCT
TCCGGGACGG TTGTTAAAAT TACAGACTAT ACTTATAACG GAACTGTTTT TGGAGCAAGC
TCCTCAGATG GACTCCTTAC TTGGACCATC AGTCAGGCCA TTCCGATAGG CACCGTTTTC
TCGATTTCCT TTACCAACAA CACATTAGCG GCTCCGGTAA TTCAACCTTC CACATATGGA
TCTGTTACGA AAGTCGGTTG GACAAACTCA GGCATTCGTC CGATTAGCAA TGTCAGCGGA
GACAGCTGGT TGGTTTATAC CACAAATCCA GACAATAGCT TTAATTTCCT TTATGGATTT
TTAAATACAA ACTACACCAC GCCTCCCATA GGTGGCACTG ATACTTCAAC CGGATGGGCA
ACAACGGGCA CCACCCCTGC TAACCAGACC TCTGTTCTGC CCGCACAATT GAGCGGAACA
AATGCTTATA ACACATTAGT CACTTATTCA GGCGGACCTG ATTTAAGCAG ACAGTTCAAT
TCTTATTCAT CTTTGTTTAG CGGAACAAAA GAACAACTGT TAAATAACAT CAAAACACCT
GTCAATTGGG CCAGCACAAA TACGGAGGCT GACGCAAAAG ACCTTTCTCC GGGAGCAAGT
GGTGGTGCTT TTCAAGGTAC TCAACCCATT TATACGATTA CTGAACTAAC CGTAACCTCC
AGTACGGCAG ATGGCACCTA TAAAATAGGT GACGTTATCC CTGTACAGGT TAATTTTTCT
GCGGCAGTAA CTGTAACGGG CACGCCCCAG CTTCTGCTGG AAACCGGATC CACAGACAGG
ACAATCAATT ATTCATCAGG AAGCGGTACA TCAACGCTTA CTTTTAATTA TACTGTACAG
TCAGGAGATG TAAGCGCCGA CCTGGATTAC CAGAGCACAA CTGCTTTAAC CCTTAATGGC
GGTACCATCA GTGCCAGCGG TACAAATGCC ACACTAACCT TACCTGCAAC AGGTGCCGCG
GGGTCTTTAG GTGCAAATAA AAATATTGTA ATTGATGGCC AGGTTCCAAC GGTATCTTCG
ATGGTCAGAG CCAGTACTAA CCCTACTAAT GTATCAACAC CTGTCAATTA CACGGTTACC
TTTAGTGAAC CGGTGACAGG ATTAAATGCA TCAGACTTTA CATTTACTGT CTTACCTATA
AGTGGTGGTC CTACACCATC CGTTACTACT GTAACCCCGG TAAGCAGCAC AGTTTATACT
GTCACAGTAA ACACAGGTTC AACAAGTGGA CAATTTCACA TGGATTTACC CGCTTCCGGA
TCAGGAATAG CTGATCTGGC AGGAAATGCG ATGATAGCCT CTTTTCAGGG CAATGTTTAC
TCGATAACCG CACCACCAAC TACCGTTACC AGCGTAACCT CCAGTACGGC AGATGGCACC
TATAAAACAG GTGACGTGAT CTCTGTACAG GTTAATTTTT CTGCGGCAGT AACTGTAACG
GGCACGCCAC AGCTTCTGCT GGAAACCGGA ACCACAGACA GGACAATCAA TTATTCATCA
GGAACCGGTA CAACTGCGCT TACTTTTAAC TATACTGTAC AGGCAGGAGA TAACACTGCC
GACCTGGATT ACCAGAGCAC AACTGCTTTA ACCCTTAATG GCGGTACCAT CAGTACCAGC
GGTACAAATG CTACACTAAC CTTACCTGCA CCAGGTGCCG CAGGATCTTT AAGTGCAAAT
AACGATATTA CAATTGATGC CATAGCCCCG GTTGCGCCAT CTACACCAGA TCTGGTAGCC
GCAAGTGATA CCGGTATATC CTCCACCGAT AATCTTACGA ATGTAACCAC CCCCCGTATT
ACCGGTAACG CAGAAGCAAA TACGACCATT ACCCTCTACG ATACTGATGG TATAACTGTA
ATCGGGAGTG CTTTTGTGAA CGGCGCCGGC AAATGGACGG TAAACATTTC AACACCACTT
AGCCAGGGAG ACCATACCAT TAAAGCTACG GCCACCGATG CAGCCGGTAA CATCAGCGTT
CTGTCTTCCG GTTTATTATT TACTATTGAT ACCACTGCCC CTACATTGGC TATCACAAGT
AATGTCAGTA CTTTAAAAGC AGGAGAAACG GCAACAATAA CCTTCACTTT CAGTGAAAAT
CCTGAAACAT CATTTACATG GGACGGCACA ACAGGATCTA TAGTAGTCTC GGGAGGTACA
CTGGATGCTA TCACAGGTAC AGGACTAACC AGAACAGCAA CCTTCACACC AACAGCAGCA
CAAAACAACG GAACGGCTAG TATTACCGTT TCAGCTGGTG CATATACAGA TGCAGCAGGC
AATAACGGTG GTGCAGGTAT TACACCAGCA TTGTCTTTTG ACACACAGCT TCCGGCTGCG
CCTTCAACTC CTGTCCTGGC ATCAGCAAGT GATACAGGTA TACCAGGCGA CAATATCACC
AAACTGGCCA CACCTGCTTT TACAGGTACA GCCGAAGCAA ATGCTACCGT TATCTTATAT
GATACTGATG GTACCACTTC ACTGGGAACC ATCGCTGCTG ATGGCAGCGG TAACTGGTCA
ATCACTTCTT CAGCACTGAC CCAAGGATCT CATACCATTA AAGCTACAGC CACCGATGCA
GTTGGTAACG TCAGCATTCT CTCTTCCGGT TTAGTGGTGA CCGTAGATAA TACTGCCCCT
ACACTGGCCA TTACAAGTAA TGTCAGCACA TTAAAAGCAG GAGAAACAGC AACAATCACT
TTCACATTCA GTGAAAATCC CGGATCATCA TTTTCTCTGG GAGATATAAC GGTCTCAGGA
GGTTCACTGG GGGCGGTCAC AGGTACAGGA CTGACCAGAA CAGTAACTTT TACACCAACA
GCAGCACAAA ACAACGGAAT AGCCAGCATT ACTGTTACAG CCGGTACTTA TACAGATGCA
GCTGGCAATA ACGGCGGTGC AGGTACTACA CCAGCATTAT CTTTTGATAC ACAGCTTCCG
GCTGCCCCGT CAACTCCTGT ACTTGCAGCA GGAAGTGATA CCGGTATATC AGGTGATAAT
ATCACCAGCG TAACCACGCC TGCTTTTACA GGTACGGCTG AAGCAGGTAG TACTGTAACA
CTTTATGATA CAAACGGTAC TACTGTACTG GGAACTATCG CTGCAGACGG CAGCGGTAAC
TGGTCAATCA CTTCTTCAAC TCTGACTGCA ACTTCACATA CCATCACTGC AAAATCTACT
GATGCAGCTG GTAACACCGG CCTCGCATCA GCAGGTCTGA CGATCATTAT TGATAACACT
GCGCCGAATG CACCATCAAC CCCTGTTCTT GCAGCAGGAA GTGATACAGG TATATCAGGT
GACAACATCA CCAGCGTAAC CACACCTGCT TTTACAGGTA CGGCAGAACC AGGTGCTACG
GTTACCTTAT ACGATACAAA TGGTACGACT ATACTGGGCA CTGTCGCTGC TGATGGCAGC
GGTAACTGGT CAATTACTTC TTCAACTCTG ACTGCAACTT CGCACACCAT CACTGCAAAA
GCAAAAGATG CATCAGGAAA TACCAGTACA GCATCGGCGG GTCTGACGAT CATTATTGAT
AACACTGCGC CGAATGCCCC GTCAACTCCT GTACTTGCAG CAGGAAGTGA TACAGGTGTA
TCAGGTGACA ACATCACCAG CGTAACCACA CCTGCTTTTA CAGGTACGGC AGAACCAGGT
GCTACAGTGA CACTTTATGA TACAAATGGT ACGACTATAC TGGGTACTGT CGCTGCTGAC
GGCAGCGGTA ACTGGTCAAT TACTTCTTCA ACTCTAACTG CAACTTCACA TACCCTTACT
GCAAAAGCAA AAGATGCGGC TGGTAATGTT AGTACGGTAT CAGCAGGTCT GACGATCATC
ATTGATACCA CTGCTCCAAA TGCACCATCA ACCCCTGTTC TTGCAGCAGG AAGTGATACA
GGTATATCAG GTGACAACAT CACCAGCGTA ACCACACCTG CTTTTACAGG TACGGCAGAA
CCAGGTAGTA CTGTAACACT TTATGATACA AACGGTACTA CTGTACTGGG CACTGTCGCT
GCAGACGGCA GCGGTAACTG GTCAATCACT TCTTCACTGC TGACCGCAAC CGCTCATACC
GTCACTGCAA AAGCAAAAGA TGCATCAGGA AATACCAGTA CGGTATCAGC AGGTCTGACG
ATCATCATTG ATAATACTGC GCCGAATGCA CCATCAACCC CTGTTCTTGC AGCAGGAAGT
GATACAGGTA TATCAGGTGA CAACATCACC AGCGTAACCA CACCTGCTTT TACAGGTACG
GCTGAAGCAG GTAGTACTGT AACACTTTAT GATACAAACG GTACTACTGT ACTGGGAACT
ATCGCTGCAG ACGGCAGCGG TAACTGGTCA ATCACTTCTT CAACTCTGAC TGCAACTTCA
CATACCATCA CTGCAAAATC TACTGATGCA GCTGGTAACA CCGGCCTCGC ATCAGCAGGT
CTGACGATCA TCATTGATGC CACTGCTCCA AATGCACCAT CAACCCCTGT TCTTGCAGCA
GGAAGTGATA CAGGTATATC AGGTGACAAC ATCACCAGCG TAACCACACC TGCTTTTACA
GGTACGGCAG AACCAGGTGC TACGGTTACC TTATACGATA CAAATGGTAC GACTATACTG
GGCACTGTCG CTGCTGACGG CAGCGGTAAC TGGTCAATTA CTTCTTCAAC TCTGACTGCA
ACTTCGCACA CCATCACTGC AAAAGCAAAA GATGCATCAG GAAATACCAG TACAGCATCG
GCGGGTCTGA CGATCATTAT TGATAACACT GCGCCGAATG CACCATCAAC CCCTGTTCTT
GCAGCAGGAA GTGATACAGG TATATCAGGT GACAACATCA CCAGCGTAAC CACACCTGCT
TTTACAGGTA CGGCAGAACC AGGTGCTACG GTTACCTTAT ACGATACTGA CGGTATTACG
ATATTGGGAA CTGTCGCTGC TGATGGCAGC GGTAACTGGT CAATCACTTC TTCAACTCTG
ACTGCAACTT CACATACCAT CACTGCAAAA GCAAAAGATG CATCAGGAAA TACCAGTACA
GCATCGGCAG GATTAGCAAT CACTATAGAT GGTGCTGCTC CTACAATAGC CATTACAAGT
AATGTCAATA CATTAACAGC AGGACAAACA GCAACGATCA CCTTTACTTT TAGTGAAGAT
CCGGGAACAA CATTTGCCTG GAACGGTTCA ATAGGAGATG TGGTAGTTTC AGGAGGTACA
CTGGCTACGA TATCAGGAAC AGGATTGCTT AGGACAGCAC CTTTTACACC TGCTCCTGGA
CAAAACAACG GGACGGCCAG CATTACTGTT ACAGCAGGTG CTTACACTGA TGCTGCTGGT
AATAATGGTG GGGCAGGAAC ATCACCAGCT TTAACTTTTG ATACTGCATT ACCTACCTTA
AGTGTTGTAA ATATTTCATC TGGCAATGCA GTTCCCACAA TTGCGAAGGT GGGGGATGTA
GCTACACTGA CCTTTACTTC CAGCGAAACA GTAACCCCGG TAGTGACCAT TGCAGGGCAT
ACAGTTATCC CAACAGCTTT GGGCAACAAC TGGACGGCTG CTTATACTTT TACAGGTGCT
GATGCGGAAG GACTGGTTGC TTACAGCATT GCTTTTAGTG ATGTGTCGGG AAATACAGGA
ACTGTGGTTA CCACAGGTAA CGGGTTGATC ACTTTTGATC AGTCTGCTCC GGCCACACCG
GCAGGTTTGG CTGCAACACC AGGTGATACG CAGATTGTAC TGAACTGGAC AGCTAGTCCG
GCAACAGATC TGGCCAAATA CAGGATCTTA TCGGGAACTA CTGCCACCCC GTCAACAACT
TTGGCAGATG TCCCGGCAGG GACAACAACC TATACAAATG CAGGTTTAAC CAACGGTACA
GGTTATTATT ACCGGATTCA GGCAATTGAC CAGGCTGGTA ACATCAGCGC AGCAAGTGCA
GATGTTACAG CAGTTCCGAA AGCCAATCAG ACGATCACTT TTACTACTAT CGCTACAAAA
ACTTACGGTG ATGTGTCATT TGCCCTTGGC AATGCGAACT CATCAGGTGG TTTAACGGTG
ACCTATACCG CTGCAGACCC TTCAGTGGTT TCCATTACGG GAAATACCGC TACGATATTA
AAGGCGGGGA GCACGGTGAT TACGGCGAGC CAGACAGGAA GTGCCAGCTA TAATGCAGCA
TTGAATGTCC TGCAAACACT AACGGTAAAT ACGAAAGCTT TGGTTGTAGT AAATACCGAC
CGCTCAAAAG CCTATGGTGA TGTACTGGCC AATGCTGACT TTACAGGCAG CATCACCGGT
ATCCAGAACG GAGACAACAT TACTTTGACT CGCAGCAGTA CGGGAGCAGC AGCTATGGCA
GTTGCAGGGA CTAACTATCC AATCGTAGCC ACACTGGCCG ATCCAGACAG CAAGCTGGGC
AACTATACAG TCACTAACCC CAACGGAACT TTGACAGTAA CTTCAAAAAC ATTGACCATT
ACCGCTTCGG CAAGGACCAA AACCTATGGC GATGCCGTAA CTTTTGCAGG TACCGAGTTT
ACCACAACAG GACTGATCAA TGGCAATACC GTCACCGGTG TGACCCTGAC CAGCACAGGA
GCAGCTGCTA CAGCGAGCAT TGCTGGTGGG CCTTACCCGA TTGTGGCAAC AGCAGCTACT
GGCACAGGGT TAAATAACTA CACCATCACC TATGTAAACG GTACTTTAAC GGTTAACCCG
AAAGCTTTGG TTGTGGTGAA TACCGACCGC TCGAAAGCCT ATGGCGATGT ACTGGCCAAT
GCTGACTTTA CAGGCAGCAT CACCGGTATC CAGAACGGAG ACAACATTAC TTTGACCCGC
AACAGTACAG GGGCAGCTGC TACGGCAGTT GCAGGGACTA ACTATCCAAT CGTAGCCACA
CTGGCCGATC CTGACAGCAA GTTGGGCAAC TATACCGTTA CCAACCCGAA TGGGGTATTG
ACTATAACGG CAAAAACACT GACCATTACC GCTTCAGCAA GGACCAAAGC CTATGGTGAT
GCCGTAACTT TTGCCGGCAC CGAGTTTACC ACCACAGGAC TGATCAACGG CAATACCGTC
ACCGGTGTTA CTTTAACCAG TACAGGCGCT TCGGGTACTG CAACTGTAGC AGGTTCAACC
TATCCAATTG TGCCCGCTGC AGCTGTTGGT ACGGGATTGA GCAACTACAG CATCGTTTAT
ATGAATGGAG CTTTGACGGT AGGCAGGAAA GTGCTGACCA TTACTGCCGA TAACAAAGAA
CGTTTTGCAG GAACAGCAAA CCCGGCATTA ACAGTAAATT ATTCTGGTTT TGTCAATGGT
GAAAGCAATT CGGTATTGAC TACTCTACCA ACCATAAGCA CAACTGCAAT TACCACAAGT
CCGGCTGGCA CTTATGACAT TAATGCCAGT GGTGCAGTAG CAGCCAACTA CAGTTTCAGT
TATGTAAAAG GAACACTGAC GGTTAAAGGA GGTGCGCCAA CCAATATCAA TCTGGCTGGG
GTAGCGCTTT ACGAGAACAG CGCAGCAGGT ACCAATGCCG GAACCTTAAG CAGTACTTCT
GATGATCCTT CGGCAACCTT TACCTATACT TTAGCTGCCG GAACAGGCGA TACAGACAAT
GCATCATTTG CCATCATCGG TAACAAGATC AATACTGCAT CAGTACTTAA CTTTGAAAGC
AAAGCCAGTT ACAGTGTGCG CGTAAAAAGT ACCACACAAT ATGGCCTGAG CCTGGAAAAA
ATACTGACCA TTACCCTTAT TGATGTAAAT GAGATTCCTA CACTTGCAAC GATAGCTGAT
CAAACGATCT GCTTTACCAC AGCAGCACAA GCCTTAGCTT TAACTGGCAT TAGTGCCGGC
CCTGAAACTG CACAGAGCAC TGTTTTAAGT GTGAGCAGTA ACAATGCTGG TTTGTTTGAG
GCGCTGACAG TAACCGGCAG TGGTGCTACC GCAACCTTAA GCTACCGCGT AAAAGCCGGT
GCAATAGCTG GAACAGCTAC TGTAACGGTA ACAGTGAAAG ACAATGGTGG CACTGCAAAC
GGGGGAATAG ATACTTACAG CAGGACTTTA ACCATTACAG TTAATGCCCT GCCCGTGGTG
GCTATCAACA GTGATAAAGG AACGGAGATC AGCAAGGGAG AGACCGTATT GCTTACCGCA
ACAGGTGGTA GCAGTTATAC CTGGGCAGCC AACAGCAGTA TCATAGGTAC AACGAACGCT
CCGGTAATAA CCGTAAGGCC AAGCCAGACT ACAACCTATA CTGTAACGGT GACCAATGCC
AGCGGATGTA CAGAAACAAA AACCATTACT TTAACTGTAC TGGAAGATTT TGTGAAGATC
AAAGCCACCA ACATCATGTC TCCAAACGGC GATGGGATAA ATGACAAATG GGTTATCGAT
AACATTGATT TTTATCCGAA TAATGAGGTG AAGATCTTTG ACAGGACTGG TCGTCCGATC
TATAGCAAAA AAGCTTACGA CAATAGCTGG GAAGGTACCT TAAATGGAGC GCCACTGGCT
GAAGGCACCT ATTACTATAT CATAGACTTT GGGACGAGCA GGCCACGTTT CAAGGGTTTC
ATTACCATTA CCAGACCAGA GTAA
 
Protein sequence
MQRNLLILIV LILCSFSAKI SHAQLSPGDI AIIGVNGDTR LSPTTPYTGT IALVTLVDIP 
SGTVVKITDY TYNGTVFGAS SSDGLLTWTI SQAIPIGTVF SISFTNNTLA APVIQPSTYG
SVTKVGWTNS GIRPISNVSG DSWLVYTTNP DNSFNFLYGF LNTNYTTPPI GGTDTSTGWA
TTGTTPANQT SVLPAQLSGT NAYNTLVTYS GGPDLSRQFN SYSSLFSGTK EQLLNNIKTP
VNWASTNTEA DAKDLSPGAS GGAFQGTQPI YTITELTVTS STADGTYKIG DVIPVQVNFS
AAVTVTGTPQ LLLETGSTDR TINYSSGSGT STLTFNYTVQ SGDVSADLDY QSTTALTLNG
GTISASGTNA TLTLPATGAA GSLGANKNIV IDGQVPTVSS MVRASTNPTN VSTPVNYTVT
FSEPVTGLNA SDFTFTVLPI SGGPTPSVTT VTPVSSTVYT VTVNTGSTSG QFHMDLPASG
SGIADLAGNA MIASFQGNVY SITAPPTTVT SVTSSTADGT YKTGDVISVQ VNFSAAVTVT
GTPQLLLETG TTDRTINYSS GTGTTALTFN YTVQAGDNTA DLDYQSTTAL TLNGGTISTS
GTNATLTLPA PGAAGSLSAN NDITIDAIAP VAPSTPDLVA ASDTGISSTD NLTNVTTPRI
TGNAEANTTI TLYDTDGITV IGSAFVNGAG KWTVNISTPL SQGDHTIKAT ATDAAGNISV
LSSGLLFTID TTAPTLAITS NVSTLKAGET ATITFTFSEN PETSFTWDGT TGSIVVSGGT
LDAITGTGLT RTATFTPTAA QNNGTASITV SAGAYTDAAG NNGGAGITPA LSFDTQLPAA
PSTPVLASAS DTGIPGDNIT KLATPAFTGT AEANATVILY DTDGTTSLGT IAADGSGNWS
ITSSALTQGS HTIKATATDA VGNVSILSSG LVVTVDNTAP TLAITSNVST LKAGETATIT
FTFSENPGSS FSLGDITVSG GSLGAVTGTG LTRTVTFTPT AAQNNGIASI TVTAGTYTDA
AGNNGGAGTT PALSFDTQLP AAPSTPVLAA GSDTGISGDN ITSVTTPAFT GTAEAGSTVT
LYDTNGTTVL GTIAADGSGN WSITSSTLTA TSHTITAKST DAAGNTGLAS AGLTIIIDNT
APNAPSTPVL AAGSDTGISG DNITSVTTPA FTGTAEPGAT VTLYDTNGTT ILGTVAADGS
GNWSITSSTL TATSHTITAK AKDASGNTST ASAGLTIIID NTAPNAPSTP VLAAGSDTGV
SGDNITSVTT PAFTGTAEPG ATVTLYDTNG TTILGTVAAD GSGNWSITSS TLTATSHTLT
AKAKDAAGNV STVSAGLTII IDTTAPNAPS TPVLAAGSDT GISGDNITSV TTPAFTGTAE
PGSTVTLYDT NGTTVLGTVA ADGSGNWSIT SSLLTATAHT VTAKAKDASG NTSTVSAGLT
IIIDNTAPNA PSTPVLAAGS DTGISGDNIT SVTTPAFTGT AEAGSTVTLY DTNGTTVLGT
IAADGSGNWS ITSSTLTATS HTITAKSTDA AGNTGLASAG LTIIIDATAP NAPSTPVLAA
GSDTGISGDN ITSVTTPAFT GTAEPGATVT LYDTNGTTIL GTVAADGSGN WSITSSTLTA
TSHTITAKAK DASGNTSTAS AGLTIIIDNT APNAPSTPVL AAGSDTGISG DNITSVTTPA
FTGTAEPGAT VTLYDTDGIT ILGTVAADGS GNWSITSSTL TATSHTITAK AKDASGNTST
ASAGLAITID GAAPTIAITS NVNTLTAGQT ATITFTFSED PGTTFAWNGS IGDVVVSGGT
LATISGTGLL RTAPFTPAPG QNNGTASITV TAGAYTDAAG NNGGAGTSPA LTFDTALPTL
SVVNISSGNA VPTIAKVGDV ATLTFTSSET VTPVVTIAGH TVIPTALGNN WTAAYTFTGA
DAEGLVAYSI AFSDVSGNTG TVVTTGNGLI TFDQSAPATP AGLAATPGDT QIVLNWTASP
ATDLAKYRIL SGTTATPSTT LADVPAGTTT YTNAGLTNGT GYYYRIQAID QAGNISAASA
DVTAVPKANQ TITFTTIATK TYGDVSFALG NANSSGGLTV TYTAADPSVV SITGNTATIL
KAGSTVITAS QTGSASYNAA LNVLQTLTVN TKALVVVNTD RSKAYGDVLA NADFTGSITG
IQNGDNITLT RSSTGAAAMA VAGTNYPIVA TLADPDSKLG NYTVTNPNGT LTVTSKTLTI
TASARTKTYG DAVTFAGTEF TTTGLINGNT VTGVTLTSTG AAATASIAGG PYPIVATAAT
GTGLNNYTIT YVNGTLTVNP KALVVVNTDR SKAYGDVLAN ADFTGSITGI QNGDNITLTR
NSTGAAATAV AGTNYPIVAT LADPDSKLGN YTVTNPNGVL TITAKTLTIT ASARTKAYGD
AVTFAGTEFT TTGLINGNTV TGVTLTSTGA SGTATVAGST YPIVPAAAVG TGLSNYSIVY
MNGALTVGRK VLTITADNKE RFAGTANPAL TVNYSGFVNG ESNSVLTTLP TISTTAITTS
PAGTYDINAS GAVAANYSFS YVKGTLTVKG GAPTNINLAG VALYENSAAG TNAGTLSSTS
DDPSATFTYT LAAGTGDTDN ASFAIIGNKI NTASVLNFES KASYSVRVKS TTQYGLSLEK
ILTITLIDVN EIPTLATIAD QTICFTTAAQ ALALTGISAG PETAQSTVLS VSSNNAGLFE
ALTVTGSGAT ATLSYRVKAG AIAGTATVTV TVKDNGGTAN GGIDTYSRTL TITVNALPVV
AINSDKGTEI SKGETVLLTA TGGSSYTWAA NSSIIGTTNA PVITVRPSQT TTYTVTVTNA
SGCTETKTIT LTVLEDFVKI KATNIMSPNG DGINDKWVID NIDFYPNNEV KIFDRTGRPI
YSKKAYDNSW EGTLNGAPLA EGTYYYIIDF GTSRPRFKGF ITITRPE