Gene Mnod_5047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_5047 
Symbol 
ID7303740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp5117126 
End bp5125825 
Gene Length8700 bp 
Protein Length2899 aa 
Translation table11 
GC content68% 
IMG OID643602677 
ProductFibronectin type III domain protein 
Protein accessionYP_002500196 
Protein GI220924894 
COG category[S] Function unknown 
COG ID[COG4733] Phage-related protein, tail component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCGCA TCGATTTCGA TCCGGCCCCC GCGCCGGAGG ACAGCGTGCG CCAGGACGCT 
CCGATCCGGG GCAGGAAGTC CTCCTCCGGC CGCGGCAAGA CCGGCGGCTC GGGCTCGACC
GCGCCCGACA CCCTATTCTC GAACGCCACG GTCCGCCTGG TGGACCTGCT CGGCGAGGGC
GAGATCACGG GCGTCGTCGG CGGCCTGAAG GGCGTCTACT TCAACGACGT GCCGGTCCAG
AACGCGGACG GCACCTTCAA CTTCAAGGGC CTCTCGGCCG ACTTCCGCAC CGGCACGCCG
GATCAGTCCT ACATGCCCGG CTATCCCGAG GTGGAGACGC CGCGGGAGGT CGGGGTCAAG
GTCACCAAGG CGACGCCGGT CACCGCCTCG ATCAGCGACG GCGAGGCCGA CCGGGCCCGC
GTCATCATCG AGCTGCCGGC GCTGTTCCTG GCCAAGAACG ACGGCTCGGT CCGCCAGAAC
AGCGTCAGCT TCCGCATCGA GGCCCGCTAC AGCGGCGGGC CGTGGGTGAA CCAGCTCGGC
GACCTCACCA TCACCGGCAA GAACACCTCG CCCTACTTCG TGTCCTACGA GGTTGCCCTG
CCGCGCAATC CGGCCGGGTC GAGCCCGCCC TGGCAGGTGC GGGTCACCCG CCTGACCGAC
GACACCGACG GCTTCAACAC CAGCCAGGAC AAGTGGACCA GCCAGAGCGA CCTCGTCTTC
TATTCGCTCA CCGCGATCCA GGACGCCAAG TTCAGCTATC CGCATTCGGC CCTGGTCGGC
CTGACCGCGG ACGCGAGCAA CTTCGGCTCC TCCGTCCCGG CCCGGACGTA TCTGGTCGAC
GGCCTGCTCA TCAAGGTGCC GTCGAACTAC GACCCGATCG CGCGCACTTA TTCCGGCATC
TGGGACGGCA CGTTCAAGGA GGAGTGGACC GACAACCCGG CCTGGGTGTT CTTCGACGTG
CTGTGGAACG ACCGTTACGG GCTCGGCGAG TTCATCAGCG TCGAGAGCAT CGACAAGTGG
ACGCTGTACG AGATCGGCCG CTACTGCGAC GTGCCGGTCG CGGACGGCCG CGGCGGGCAG
GAGCCGCGCT TCCGCTTCAA CGCCCAGATC AGCACCCAGC AGGATGCCTT CGATCTGCTG
CAGCAGATCA GCGCGATCTG GCGCGGCATG GCCTACTGGT CCTCGGGCGC GGTCACGGCC
ACCCAGGACC GGCCCGACGA CGTCCGCCAG CTGGTCACGC CGGCCAACGT CATCGAGGGG
CTGATCACCT ACAGCTCCTC GGGGCGCAAG GCCCGCCACA CCGTGGCGCT GGTCAGCTGG
ACCGACCCGG ACAATCTGTT CAAGCCGCAG ATCGAGGTGG TCGAGCACGG CGAGGGCATT
GCCCGCTACG GCTACAATCC GACCAAGATC GACCTGCTCG GCTGCACCAG CCGGGGCCAA
GCCCACCGCG AGGGCCTGTG GCGCCTTCTG GTCGAGAACT ACGCCACGCA GACCGCGACC
TACCGGGCGG GCCTCGACCA TGCCGTGCGC CGCCCCGGCG ACATCATCGC GATCGCCGAC
CCGCAGATCA GCACCATCGA CGCCGGCGGG CGCCTGAAGG CCGGCTCGAC GGCCTCGACC
CTCCTCCTCG ACCGCCCGGT CACCCTGAAG AGCGGCGTGC CCTACGAGAT CTCCGTCACC
CTGCCGGACG GCAGCGTGGC GGAGCGGCAG ATCGCGACCC TGGCCGGCGT CGACCTCACC
GAGGTCAACG TCTCGCCCGC GCTGCCGGCC GTGCCGGACG CCGCGGCGGT GTGGCAGATC
GCCGGCGAGG TGGTGCCGCA GCTGTTCCGG ATCGTCGGGA TCAAGGAGGT CGAGCCGCAC
ATCTACGAGA TCCAGGCGCT CCAGCACGAG CCCTCGATCT ATGCGGCCGT AGACGACGGG
GCGGCCTTCG AGCCGCTGAC CATCGGCGAG TTCCCGAACG TCGTTCTCGC CCCGACGAAC
CTGTCGGTGA GGGAGAGCAC CTATTTCGAG AACAACCTGC CGCGGCAGAG CCTTCTCCTC
AGCTGGACGG CCGGCCAGCC CTTCAACTCG GTTGCCTACT ACGTCACCGC GGTGAAGCCG
AACGGCTCGC TGGTGACCCT ACCGAAGCGC AGCACCACCT CGGCGGACTT TGACGACGCG
GCCACGGGCG AGTGGACCTT CATCGTCCAG GCGGAGGGGT TGAACGGCCG CCTCTCGGAA
GCTGCCCAGA TCATCTACAC GGTCCAGGGC TGGGAAGGCC TGCAGGGGCC GACCGTCACC
GGCCTGCAGG TCAAGGGCGG CGGCAGCGTC TTCACGGGGC GGAGCTGCAC CCTGGAGTGG
GGCCTGACCT GGCCGGCGGA CGTGAGGCCC TACGAGGTCG GCTACGCCTT CCGGGTCTTC
GACGCGGACA CCACCGCGCT CCTCCACACC GAGATCATCA CCGCCGCGCA GGCGACCTAC
GACTACGAGG AGAACCTCAA CGAGGGCGGG CCGCGGCGGC GCTTCCGCGT GTCGGTCGCC
GCGCGCGACG CGATCGGCCG CGAGAGCCAG CCCGCGGTGC TCGTCGTCTC CAACCCGCCG
CCCGGCGTGG TGGTCCCCAC CGCCACCTGG ACGACGGAGA GCATCGCGGT CCAGTACACC
CCGCCGGGCG ACCCGGATCT CCGGGGCGCC CTCATCTGGG TCAGCAGGAC GTCCGGCTTC
AACCCGCTGA CGACGGCGCC GGTCTACGAT GGGCCGAACA CGCTCCAGTT CTTCACGGCC
GATCCGGACA CGTGGTACTA CGTCCGGGTC GCGCTCTACG ATGACTTCGG CAAGAACCCG
GCCGAGCTGA ACATCTCGGG CGAGATTGCG GTCCGCACGA ACGACCTCAT CATCGACGTC
CAGGCGCCCG ACATCCCGAG CGGGCTGACG CTCGCGACCG CACTCGAGGT CTCGGCGACC
GGCGTTGCCA GCTCGCGCAT CGACGCCGTC TGGAACCCGG TTGGCTCCAC CAACATGGGG
TATTTCGAGT TTGAGATGAC GGAGGGCGAC GGCGTCACGA ACCCTTCGTG GACCCGCGAC
GCCGCCGATG CGGGCCAGCC CAAGTTCACG TGGCGCAACC TGAAGCCGGG GCAGCTCTAT
GCGGTTCGTG TTCGCTCGGT GAACAAGACC CGCGAGGCGG TCTCGGGCTG GTCCCCGATC
GCCACCATCA CCGCGGCCAA GAACACCAAC AAGCCCGGCG CCATCACCAA CTTCACCGTG
GACGCCGCCT ACCGGACGGC AAGCCTGTCG TGGACCAATC CGAGCGACCC GGACCTTGCC
GCGATCGAGG TGTGGATCGG GACGCGCGAC GACGGGGCGG ACGCCACCCT GTTCGCCAAG
GTGCCGGTCC CGCTCAACTT CTTCAGCGAC ACCACCCTGG AGATCGTACA GACCCGCAAA
TACTGGGTGC GGCCGGTCAA CAGCTCGGGC GCGGCCGGCG ACTTCGTCGG CCCGAAGACG
GCGACCACGG CGGCTCTCCC GGCGGCGGCG CTCCAGAACG AGACCATCGA CGCCACCAAG
ATCGCCTCAT CCATCGTGGC GCCGGCCGTG GTCACCAGCC TGCCGGATCC GGCGACCTGG
ACCGGCCCGA AGCTCGCCTA CAACGCGACG GACGGCAAGC TCTACCGGCT CGTGAACGGC
GCCTGGACGG CGGCGGTTGC GGCGGTCGAT GTCGGACCGG GCCTGACCGC GGCCCAGATC
GCCAGCGTCA ATGCGGCGGC GGTCGCCGGC CAGCTCACGA AGGAGCAGAT CGCGAGCATC
AACGCGGCCT CGGTCGTCGG CCAGATCGTC GCCGCGCAGA TCGAAAGCAT CACGGCGGCT
CAGATCAGCG GGCAGCTCAC GGCCTCGCAG ATCGTCTCGC TGGCAGCCAC CCAGATCTCC
GGCACTCTGT CGGACAGCCA GCTCGCCGCC ATCAGCGCCT CGAAGCTGGT CGGGCAGGTG
GTCGCTTCGC AGATCGCCTC GCTCGCCACC TCCCAGCTCA CCGGCACGAT TTCGGATGCG
CAGATCGCCC AGCTCTCGGC CGCCAAGGTC GCAGGGCAAC TCTCCGACGC CCAGCTTGCA
GGCATCAGCG CCTCCAAGCT CGTCGGACAG GTGGTCGCGA GCCAGATCGC CTCGCTGACC
GCCGCGCAGA TCACGGGGCA GCTCACAGCG GCCCAGATCA CCTCACTGAC GGCTGCCCAG
ATCACCGGGC AGATCGGCCG GACGCAGATC GCGGACAGCG CCATCGACAC GCCCCAGCTC
AACGCGGGTG CGGTCTCGAC CGCGAAGCTC GCCGCCTGGG CAGTCACGGC CTCGAAGCTC
GCCGTGGCGA GCCTGAACCT CGCGACCAAC GGCGGCCTCC AGCAGGGTTC CTCCGGCTGG
GTCGCTGGCC CGGGTAACAC GGGCGTCAAC CCGATCGTGG AGGGCATCCG CACCGACTAC
GCGCCGGCCG GCATGCGGGT GCTCTCGGCC CGCTACGATA CGGCCCCGAC CACGGGGACC
TACGCCGAAT TCATCTACTC CAACCCGGAT GCGAGCGGGG CCGGGCAGCG CATCCGAGTT
ACGGGCGGCG CGCTCTACGA GGTCTCGGCC TACATCTCGG CCCACCGCTG CTCGGCTTAC
GTGTCGATCA TCTGGTGGGA CGCCGCCGGT ACCTACATCA CGGAAGCCAC CGGCAACAGC
ATCGTCGCGA CGCAGCTCGC CTCCGGCTCC TCCCTGGCGG ACTGGGACAA GTTCGCCCGC
TCCTGGGTGA TCGCCACCGC GCCGGCCAAC GCCGCCTTCG CAGACATCCG TATCCGCTGG
ACCGGCTTCG GCATCCAGCC CTACTGCATG GCCGGCGGGC TGCTGTTCGC CCAGGCGGTC
GCGGGCCAAA CCGAGCCGAG CCCGTATTCC GACGCGGGCG TCACCATGAT CGAGGGCAGG
AACATCCGCA CGGGCTCGAT CTACGGCGAC CGGCTTGTTG CCCGCACCAT CACCGCAGGG
CAGATCGCCA CGGGAACAAT CACGGCCACG GAGATCGCCG GCTCGACCAT CACGGCCGAC
AAGATCGCCG GCCGCACCAT CACGGCCTCG CAGATCGCGA CCGGGACGCT CACCGCCACC
GAGCTGGCGG CGGGCTCGGT GACCACCTCG AAGCTCGCGG TCGCCAGCCT GAACCTCGCC
ATCAACGGCG ACATGGGCAA CCTGAACACG CTGGGCAATC CGGCCGCCTG GAGCAGCGAC
AGCTCCGGCC TTGCCGGCGC CACCATGCTC CCCAGCGGGG TCGACTACCT CTACTGCCCG
GCCGGCCTGC GGGCCATGAA GGTCGCCGCC ACCGCGGTTC CGACCAACCA GGGCTCGATC
GGGCAGGTCT GGTTGAGCCG GGTCGAGACG GACGGCACTC TCTATCCCTT CTCGGTGCGC
GCCGGCACGA CTTACGAGGT CTCGGCCTAC CTCAGTGCCC ACCGCTGCAC CGCCTATGTC
GGGTTGGTCT GGTATGACGC CAACCAAGCC TATATCGGCG AGACCTGGGG CAGTCAGATC
GTCAACTGGG GCGGCAGCAT CTACAACTCC CAGGCGGATT GGGAGCAGTT TGGCCGCTCG
AAGCTGATCG TCATCGCCCC AGCCGGTGCA GTCTACTGCC GGCCTTACGT CCGCTATGGG
ATCACATGGT ATCCGTCAGA CCCTGCACCG CCCTACTGCT TCATCAGCGG TGTGATGCTG
GCGGTCGCAG TGGCAGGGCA GACGGAGGTC TCGCCCTTCT CGCCGGCCGG CCTCACCACC
ATCAACGGCG CCTCGATCCG CACCGGCTCG GTTGCCGCCG ACAAGATCAT CGCGAAGTCG
ATCACCGCAG GCCAGATCGC CACCGGCACC ATCACGGCGA CCGAGATTGC CGGATCCACG
ATCACCGGGG ATCGGATCGC CGGCAACACC ATCACGGGCG GCAACATCCA GGCGGGCAGC
CTGACCGCGC GCGAACTCGC GGCCGGCTCC GTCACCACCT CGAAGCTCGC GGTCTCCAGC
TACAACGTCG CCTACAACAC GGAGCTGGCG CAGAGCACCC GGGGCTGGAG CATCTCCGGC
ACGAGCTGGG GTGCGAACAT GCCCCTGATC TGGCAGGAAA CCCTTTGGGT GCCGGCCGGC
GCGACGGGCC TCCGGCTCAC CTGCGACGTC GCGGCGGCGG CCGTCCCGGC AGGCGGCTAC
TTCGAGCTGT TGCACGGCCG CGCCACCGCC ACCGGCCTCT GGGCCTTCTA CCCCTGCGTG
GGCGGATCCT TCTACGAGTT CAGCGTCTAC GCGACGGGGC ACCGCTGCGC GCCCCAGATG
TATGTCCAGT TCCTCGACGC GGCCGGGAAC CATGTCGCCT ACGCGGGCGG CGTGGTCGGT
GCCCCCAACC AGGGCGTGGC CGGCGGCGCC CTGAAGGACT ACCCCCGCCT CGGGTTCATC
GCCCAGGCTC CGAGCAACGC AACCCAGTTC CGAGTCATGT ATCGGGGCGT CAACATCACG
GGTGACAACC CGATCGTGAT GCTCGCCGGC ATGATGTATG CGGGCGCCCG CGCCGATCAG
ACGGAGTGTT CGCCCTACAC CCCACCCGGG GTCACGCTGA TCGACGGCGG CACCGTCATG
ACGCACACGC TCGCGGCGGA CCGACTCGTC GCACGGTCGA TCACCGCCGG GCAGATCGCG
GTCGGCGCGA TCACGGCAAC CGAGATCGCC GGCTCGACCA TCACGGGCGA CCGCATCGCC
GGGTCGACCA TCACCGGAGA CAAGATCCAG GTCCGCACCA TCACCGCGGC CAATATTGCC
GGCCAGACCA TCACGGCCTG GGAGATCGCC GGCAACACCA TCACGGCAAA CCAGATCGCC
GGCCAGACCA TCCTCGGCTG GAACATCGCC GGCCGCACGA TCTCGGCGGA CAAGCTCGTC
GCCAACTCGA TCACGGCGGG CGAAATCCAG GCCGGCGCCA TCGGCGTCGA TCAGCTCGCG
GCGGGCGCGA TCACGGCCGA CAAGATCGGC GTCGGGCTCA ACTCGACCAA CCTGCTCTAC
AACTCGGACT TCAAGGCGGG CGCGGCCAAT GCGATGCCGC CGGGCGTTAC CGGCTGGGGA
TCGAGCGTCA CGGTCAGCGC CCCGTATGTC GGGCTCAACC AATCGGGCCC CGGCTGGCAG
CCGACCGGCA TGGGCTCGCT CCAGATGTCG TGCGCCGGCA CGCCGCCGGC GGGCCAGATC
CTGGATGCGT ATCTGGCCTA CCCGCGCCAG GACGGTGCGT GGGAGACCAA GTTCCCGGTT
GTCGGGGGCA AGCGGTATGA GGTGTCCGGC TATGTGAGCG CCCACCGCTC GCAGGCCTAC
ATGGTGGTGT CGTGGTTCGA CGCGGGCGGC ATCTATCTCG GGGCTACGAC GACGACCGTC
GTTGAGAACC AGCAGTCGAG CGGCAACCTG AACGCCTGGG CACGCTGCAC CGGCATGGGC
ACCGCGCCGG GTGACGCCGC GACGGCGACC GTCTCGTTGC GCATGGCCTT CAACGGCGGC
AACGGACCCT ACAACTTCTG GAGTGGCATT TACTTCGCCC AGGCGAAGGC CAACCAGACC
CAATACTCGG ACTGGGCGCC GGGCTCCTCG ACCGTGATCT GGGGCGATAC CATCGCCACC
GGCACCATCC ATGCCAACCG GATCACCGCC GGGACGATCA CGGCCGACCG GATCGCCGCG
AACGCCATTA CCGGCGGCCG GATCGCCGCC AACACCCTCA CCGGCTGGCA CGTGGCGGCG
AACTCGATCT ATGCGGACAA GATCGCGATC GGCGGTGGGC AGGCGCTCAC CTCGTGGATG
GGTTCGGACA CCACCAAGAT CAACGGCGGC GCCATTCAGG CCAACTCGAT CCTGGTCAAC
TCCCTGGTCG TCGGCCTGCG CGGCGTCCGC ACGGTGAGCC TCGATTTCTC GGTCGACAAG
ACCACGCGCG TCCTCTCGTG GACCGCAGGC TACGTCCTGT GGATCGATGA CACGGGTGCC
AACCGGGCCG ATCAGGTCGC AGCCGGCAGC TTCAACACGG GCGGCGGCTA CACCTATGTC
TGGTGGAACA AAACCACGCC GGGCCAACTG TATGCCGCCA GGGACAACTG GCCCGACATC
TTCAGCGACA AGAACGCGGT GCTGCTGGCG TCGTATGATG GGTACGCTGG GATCAACAAC
TTCGGCGGCG GCACGATCAT CGACGGATCC CGGATCAACA CCGGCACGAT CACCGCCAAC
CAGATCGCGG CCAACGCCAT CCAGGCCAGC CACATCGCGG CCGGACAGAT CACTGCGGAG
AAGATCGGCG CCGGGCAGGT CACTGCGGAC AAGATCGGGG CCGGCACCAT CACGGCGGGG
GACATCTTCG TCGGGAACAC GCAGTTCGTC CTCCAGGCCA ACGGCGGCGC CCCGCGCATG
TTCGTGCGCG ACGGCGGCGG CAACCTGCGG GTGCTGATTG GCCAGACGAA CGGCTTTACC
CCGCGCGGTG GCGACTGGGG CATCGCGATG TGGGGGAACG ACAACACCTA CATCCTCGGC
CCGGACGGCG TGAACGGCGG CGGCATCTGG GTCAGCTCGA TCGCCGCCGA CAAGCTCTCG
GTGACGCAGC TCTCCGCCAT CACCGCCAAC GTCGGCACGA TGACCGCCGG CCTGCTTCAG
AGCGGCGATG GGCAGATGCA GGTCGACCTC ACCAACAAGC GCATCCTCAT CTGGGGCTGA
 
Protein sequence
MDRIDFDPAP APEDSVRQDA PIRGRKSSSG RGKTGGSGST APDTLFSNAT VRLVDLLGEG 
EITGVVGGLK GVYFNDVPVQ NADGTFNFKG LSADFRTGTP DQSYMPGYPE VETPREVGVK
VTKATPVTAS ISDGEADRAR VIIELPALFL AKNDGSVRQN SVSFRIEARY SGGPWVNQLG
DLTITGKNTS PYFVSYEVAL PRNPAGSSPP WQVRVTRLTD DTDGFNTSQD KWTSQSDLVF
YSLTAIQDAK FSYPHSALVG LTADASNFGS SVPARTYLVD GLLIKVPSNY DPIARTYSGI
WDGTFKEEWT DNPAWVFFDV LWNDRYGLGE FISVESIDKW TLYEIGRYCD VPVADGRGGQ
EPRFRFNAQI STQQDAFDLL QQISAIWRGM AYWSSGAVTA TQDRPDDVRQ LVTPANVIEG
LITYSSSGRK ARHTVALVSW TDPDNLFKPQ IEVVEHGEGI ARYGYNPTKI DLLGCTSRGQ
AHREGLWRLL VENYATQTAT YRAGLDHAVR RPGDIIAIAD PQISTIDAGG RLKAGSTAST
LLLDRPVTLK SGVPYEISVT LPDGSVAERQ IATLAGVDLT EVNVSPALPA VPDAAAVWQI
AGEVVPQLFR IVGIKEVEPH IYEIQALQHE PSIYAAVDDG AAFEPLTIGE FPNVVLAPTN
LSVRESTYFE NNLPRQSLLL SWTAGQPFNS VAYYVTAVKP NGSLVTLPKR STTSADFDDA
ATGEWTFIVQ AEGLNGRLSE AAQIIYTVQG WEGLQGPTVT GLQVKGGGSV FTGRSCTLEW
GLTWPADVRP YEVGYAFRVF DADTTALLHT EIITAAQATY DYEENLNEGG PRRRFRVSVA
ARDAIGRESQ PAVLVVSNPP PGVVVPTATW TTESIAVQYT PPGDPDLRGA LIWVSRTSGF
NPLTTAPVYD GPNTLQFFTA DPDTWYYVRV ALYDDFGKNP AELNISGEIA VRTNDLIIDV
QAPDIPSGLT LATALEVSAT GVASSRIDAV WNPVGSTNMG YFEFEMTEGD GVTNPSWTRD
AADAGQPKFT WRNLKPGQLY AVRVRSVNKT REAVSGWSPI ATITAAKNTN KPGAITNFTV
DAAYRTASLS WTNPSDPDLA AIEVWIGTRD DGADATLFAK VPVPLNFFSD TTLEIVQTRK
YWVRPVNSSG AAGDFVGPKT ATTAALPAAA LQNETIDATK IASSIVAPAV VTSLPDPATW
TGPKLAYNAT DGKLYRLVNG AWTAAVAAVD VGPGLTAAQI ASVNAAAVAG QLTKEQIASI
NAASVVGQIV AAQIESITAA QISGQLTASQ IVSLAATQIS GTLSDSQLAA ISASKLVGQV
VASQIASLAT SQLTGTISDA QIAQLSAAKV AGQLSDAQLA GISASKLVGQ VVASQIASLT
AAQITGQLTA AQITSLTAAQ ITGQIGRTQI ADSAIDTPQL NAGAVSTAKL AAWAVTASKL
AVASLNLATN GGLQQGSSGW VAGPGNTGVN PIVEGIRTDY APAGMRVLSA RYDTAPTTGT
YAEFIYSNPD ASGAGQRIRV TGGALYEVSA YISAHRCSAY VSIIWWDAAG TYITEATGNS
IVATQLASGS SLADWDKFAR SWVIATAPAN AAFADIRIRW TGFGIQPYCM AGGLLFAQAV
AGQTEPSPYS DAGVTMIEGR NIRTGSIYGD RLVARTITAG QIATGTITAT EIAGSTITAD
KIAGRTITAS QIATGTLTAT ELAAGSVTTS KLAVASLNLA INGDMGNLNT LGNPAAWSSD
SSGLAGATML PSGVDYLYCP AGLRAMKVAA TAVPTNQGSI GQVWLSRVET DGTLYPFSVR
AGTTYEVSAY LSAHRCTAYV GLVWYDANQA YIGETWGSQI VNWGGSIYNS QADWEQFGRS
KLIVIAPAGA VYCRPYVRYG ITWYPSDPAP PYCFISGVML AVAVAGQTEV SPFSPAGLTT
INGASIRTGS VAADKIIAKS ITAGQIATGT ITATEIAGST ITGDRIAGNT ITGGNIQAGS
LTARELAAGS VTTSKLAVSS YNVAYNTELA QSTRGWSISG TSWGANMPLI WQETLWVPAG
ATGLRLTCDV AAAAVPAGGY FELLHGRATA TGLWAFYPCV GGSFYEFSVY ATGHRCAPQM
YVQFLDAAGN HVAYAGGVVG APNQGVAGGA LKDYPRLGFI AQAPSNATQF RVMYRGVNIT
GDNPIVMLAG MMYAGARADQ TECSPYTPPG VTLIDGGTVM THTLAADRLV ARSITAGQIA
VGAITATEIA GSTITGDRIA GSTITGDKIQ VRTITAANIA GQTITAWEIA GNTITANQIA
GQTILGWNIA GRTISADKLV ANSITAGEIQ AGAIGVDQLA AGAITADKIG VGLNSTNLLY
NSDFKAGAAN AMPPGVTGWG SSVTVSAPYV GLNQSGPGWQ PTGMGSLQMS CAGTPPAGQI
LDAYLAYPRQ DGAWETKFPV VGGKRYEVSG YVSAHRSQAY MVVSWFDAGG IYLGATTTTV
VENQQSSGNL NAWARCTGMG TAPGDAATAT VSLRMAFNGG NGPYNFWSGI YFAQAKANQT
QYSDWAPGSS TVIWGDTIAT GTIHANRITA GTITADRIAA NAITGGRIAA NTLTGWHVAA
NSIYADKIAI GGGQALTSWM GSDTTKINGG AIQANSILVN SLVVGLRGVR TVSLDFSVDK
TTRVLSWTAG YVLWIDDTGA NRADQVAAGS FNTGGGYTYV WWNKTTPGQL YAARDNWPDI
FSDKNAVLLA SYDGYAGINN FGGGTIIDGS RINTGTITAN QIAANAIQAS HIAAGQITAE
KIGAGQVTAD KIGAGTITAG DIFVGNTQFV LQANGGAPRM FVRDGGGNLR VLIGQTNGFT
PRGGDWGIAM WGNDNTYILG PDGVNGGGIW VSSIAADKLS VTQLSAITAN VGTMTAGLLQ
SGDGQMQVDL TNKRILIWG