Gene MCA2227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA2227 
Symbol 
ID3103285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp2407392 
End bp2417441 
Gene Length10050 bp 
Protein Length3349 aa 
Translation table11 
GC content67% 
IMG OID637171372 
Producthemagglutinin-related protein 
Protein accessionYP_114646 
Protein GI53803747 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACCC GCGAAGCTGC AAACCGCAAA GCCCATCTTC CCCCCGCCGA CCTCGGCCTC 
AAACCGCTCG CCGCCTCCGT CCGTACCGTA CTGGCCGGCG CCTTCCTGGC CGGCGGCGCC
GCGCATGCGG GAAGCCTGCC GGTGCCTGCC GGTGTCTTCG TGTCGAGCGG CAGCGCCGAC
CAGTCGGTCG CCGGCAACCT GATGACGATC AATCAGCACT CGGACCGCGC CATCCTGAAC
TGGAAGAGCT TCGACATCGG CGCCGGAAAC ACGGTGCAGT TCAAGCAGCC CTCGGCCTCG
TCCATCGCCC TCAACCGCAT CTACCAGAAC GATCCCAGCC GCATCTTCGG CAGGCTTTCC
GCCAACGGCC AGGTTTACCT GCTCAACCAG AACGGATTCC TGTTCGGCAA AGGCTCGCAG
GTGGACGTCA ACACCCTGCT GGTCAGCACG CTGAACATCA CCGACGATAC CTTCCAGCGC
GGTATCACCA AGGTCCTGGA CCAGGACGGA CGGGCTGCGC TGGTGGGCGA CGGCAAAGTC
TACCGGGTCG ACGACCAGGG CCAGTTCGTC CTTGACGAGA AGGGGAATCG CGTCAAGACC
GGCATCGAGT TCGCCGAAGG CTCCTCCGTC AAGACCGCCA ATTCCGGCCG CATCATCGCC
GCCGCTCCTT CCGTGGTGAA CAAGGGCGAT CTCTCCTCGC CGGACGGCCA GATTCTCCTC
ATCGCCGCCA CCGACAAGGT CTATCTGCAG GAAGCCGGCA AAGACTCCAC CCTGCGCGGC
CTTCTGGTCG AAGTCGGGAC CGGCGGCGAA GTCGCGAACA TCGGCAGGGC ACTGGCGGAG
CGCGGCAACG TGACGCTGAT GGGATTCGCC GTCAACCAGC AGGGCCGGGT GTCGGCGACC
ACCTCGGTTC GCGTCAACGG CTCGGTGCGG CTGCTGGCAC GGGAAGGTGC CAGCGTGCGG
CGTGAGGGCG ATGCCTGGCT GTTGCAGGCG AACCGCACCA AACGCAGCGC CCCCCTGGAC
GACGGCCTCG GCACCCGGGC CACGGTGACA CTGAAAGGGG GCAGCAAGAC TTCGGCGAAT
CCGGACCTGA ACGACCCGGC CACCGCCGTG GACGGCCAGG CGCAGGATGC TTCCTGGGTG
GAAATCATGG GACACCAGGT CCGGATCGAG AACGGTGCGC AACTGGTGTC CCGCTCGGGC
AAGGTCACGG TGACGGCCAC CGAAAACCCC GCCAACCCCG GCCTCGACAA CGTCAAGAAC
GACGCCCGCG TCTATGTGGA CAAGGGCGCG ACGATCGACG TATCGGGCAT CAAGGACGTC
AGCGTGCCGA TGGAACGCAA CGTGGTGACG GTCGAACTGC GCAGCAATGA ACTGCGGGAC
TCGCCGCTGC AACGCCACGG CGTACTGTAT GGAAAGAAAA TACGGGTCGA CATCCGCAAG
GGCACGCCGA TCGCCGACAT CTCGGGCGAA CTGGAGCGCA TCGCCCGCAC CGTGGCCGAA
CGCAGCACCG CCGGCGGTAC CATCAAACTC ACATCGGAAG GCGACGCCAT CCTCCAGCGG
GGCGCGCTGC TGGACTTCTC CGGCGGTTCG GTGGCCTACC GCTCCGGCTA CATCGATACC
ACCCAGTTGC TGACGCCGGA CGGCAAGACG GTGGACATCG GCAGCGCCGA TCCGAATCAG
ACCTATGCCG GCATCTTCGG CCAGGTCACC CAGAAGTTCA AGGCCTGGAA CGTCACCAAG
ACCTGGGATA TCATCGGCCC CAGGAACCTG GGGCGCTACG AACAGGGCTA TGTGGAGGGC
AAGGCCGCTG GCACGCTCGA CATCAAGGCG GCGGCGCTGG CGCTGGAGGC CGAGATGCGC
GGCGCCGCGA CCGCCGGTCT GCACCAGCGT GAGGCGGGAA CCCAGCCCGC CGGCGGAACC
CTCAAGATCG ACCTGGCGCG CAGCCCCGAC AGTACCCAGT CCGTGATCTT CGGCAGCGCC
CCGGGATCTC TGGGCATCGG CAAAGACCAG CCGTTTCCCC AAGATCCCGA GAAGCCCGGC
CAGCCGGCCG CCCTGGTGCT GTCCGGCAAC AAGCTGCGCG ACTCCGGCAT CATGTACGCC
GATATCAAGA CCAACGGCAA GGTCGCCATC CGCTCCGGCG AGAACCTCGC CATGACCGAC
GGAGGCAGCC TCGCGCTCAC CGGCGGCGAA ATCAAGGTCG ACGGAACGAT CACGGCCCAC
GCCGGCGAAG TCGATCTCTC GACCCGGCTC ACCAGCGCCA CCCAGGGCAA GCTGTCCGGT
GCCATCGACC TGGGCGCAGG CGCCACCATC GACGTCAGCG GCCAGTGGAT CAACGACCGT
CCGGCCGACG CTACCGGCCA CAGCGGGCAG GACCGCAGCC GCGTGCTGGT GAACGGCGGT
ACCGTTTCGG CTAAGGCCGA AGGTGACGTC AATCTCGCCG CCGGCAGCCG CATCGACGTC
AGCGGCGGAG CGCGGCGCAC CGGCAAAGAC AGCATCAAAG CCGGCGACGC CGGGAGCATC
AGCCTGGAAG CCGCGGCCGT CGACGGCTCG GACCTCAAAC TGAAAGGCAC GCTCGAAGGC
TATGCCTTCG CCGGCGGCAA GGGGGGGGCG CTCAGTCTCG TCTCCGACCA GGTGATCCTG
GGCAACGCCA CGGACATCGG GACTGCCAAG GGCGCCGACC CGCTGGTCCT CACCCCGGAT
TTCTTCGGCC AGGGCGGGTT TGCGCATTAC TCCGTAGGCT CCAACAAGAG CGGGGTCACC
GTCAGCGACG GAACCCGTAT CCAACTCAGC GTCCAGAACC GGGTCATCGA TCCGACAGCC
GTCCGCCGCG TCAGCGGCTC GAACTTCCGC GACTTCGCGC AGGTCGAGCT GCTCCCCGAA
CTGTCACGCC AAGCGGGAGA GCTCGATCTG GTGCTGGCCC AAAAGGTCGG GCAAGGCGGA
AAGGACGCTG CTGTGCGCAT TGGCGACGGC GCGGTGATCC ATACCGATGC CGGCGGCAAG
ATTTCGCTGC AATCGGATTC CAGCATCTTC ATGAACGGAA CCCTCGAAGC TGCCGGCGGC
GACGTGGCGA TGACCGTCAC GCCGCCGGCC GGCACCGACC CCGGCTTCAA GGCGGACCAG
GGCATCTGGA TCGGTTCCGG CGCGAAGGTG GACGTGTCCG GCACCGCCCT GCTCTACAGC
GACCGGCCGG ACCACATGAC CGGCAAGGTG CTGGACGGCG GAACCATCAC CCTGCACGCC
GACCGCGGCT TCGTCGTCGC GGACGAGGGC TCGGAACTCT CCGTCTCCGG GACCCAGGGA
CGCCTGGACG TACCGGTCAG GGGACCGAAC GGCGCCATCT CCCAGGAGCG CCGCCAGATC
GCCTCCTCCG GCGGCAGCAT CGCGATCCGC GCCGCCGAGG GCATCCAGCT CTACGGGAAG
CTGGACGGAC GGGCCGGTGA CGGAACCGGC GCCGCGGGCG GCGGACTGTC GCTGGAACTC
AATCCGAACA CCCGGGCCGA ACCCGACGAG ATCGGCCAAG GCCAGACGCC CTTCCCCAAG
GTCCCCAGCG TGATATCGCT GAACCAGTCC GGACCCGCGG GGACAGCCAC GGTCCAGGGC
GAGGCCGTGG CCTCCAACCG CTACGGCCAG GCGGTGCTGA GCGCCGGCCA GGTCGAACGC
GGCGGCTTCG GCGAACTGAC GCTGCGCACA CCGGGCCGCA TCGAACTGGG CAGCGGCCTC
AACCTGCACA CCGAGCGCAG CATCGTCCTC GACGCGCCGG TGCTGGCCTT CGCGCCCGCC
ACCGGCGCCA CCGCCGGCAA GGTGTCGCTG GATTCGGCCT ACGTCGCGCT GGGTTCGACC
CAGACCCGGC CTGGCAACGC CGCTCCGACC ACCGGCAACG GCAGCCTGCA AGTCAACGCC
GGCCTGATCG ACCTGGTCGG CACCACCGCC CTCCAGGGCT TCGACGGCGC CAACCTCAAA
AGCGCCGGCG ACCTGCGGCT GATCGGGGTG CGCACCACCC AGCAGCAGCG GGACTTCCTG
GGCGAGTTCC TGGCGGCCGG CGATCTCACC CTGACCGCCG ACCAGATCTA TCCGTCGACC
CTGAGCGACT TCCGGATCGC GGTGAGAGGC ACGAGCGACG GGACGCTCAC GGTCAAGCCA
GGCGACTCGA AAGGGGGAAC GGTGCTCTCC GCCGGCGGGA AACTGACGCT GGAGGCTCCG
AACATCGTCC AGGGCGGCAC CGTGAAAGCG CCGCTCGGCC AGCTGAATTT CCAGGCATCC
AACCGTCTCG AACTGGCCGC CGGGAGCGTC ACCTCCAATT CGGCCAAGGG CGCGATCATT
CCTTTCGGAC GCACCCAGGG CGGGCTGGAC TGGATCTACC CGCTGGGCGA CCAGAACCTG
GTCTTCACCG CGCCGCCGGA AAAGAAACTG GTGCTCGACG GTCCGAAGGT CGACATCGCC
GACGGCTCGG TGATCGACAC ACGGGGCGGC GGCGATTTGT TCGCCTTCGA ATTCGTGCCG
GGACCAGGCG GTTCCTACGA TCTGCTCGAC CCCACCAGCC AGGGCTACCA GGAAGCCTAC
GCCGTCCTGC CCTCCTTCAA GGGTACGACC GCGCCCTACG ATCCGCTGGA ATCCGCCACC
TCGGGCCTCA AGGTCGGCGA CAGCGTCTAC CTCTCCGGCG GCGGCGGACT CAAGGCCGGC
TATTACGTCC TCCTGCCGGC CCATTATGCG CTGCTCCCCG GCGCCTATCT GGTGACGCCC
GACAAGAACG TCACCAACAT CGTGCCGGGC CTCGACCTGC AGCGTGCCGA CGGCGCTCCG
ATCGTCGCCG GTTACCGCAC CGTTTCCGGC ACGGGCATCC GCGACGCACT CTGGAGCGGC
TTCGCGGTGG AACCCGGCAG CGCCGCCCGG ACCCGCGCCG AATATTCGAC TTACGGCGCC
AACGCATTCT TCTCTGCCAA GGCTGCCAAG GATGAAACCG CGCTCCCCTA TCTGCCGCGC
GACGCGGGCT CCCTGTTCAT CTCGGCGGAG ACGGCGCTGA GTCTGGACGG ACGGGTGTCG
GCCAAAGCCG GCGCCAAGGG CCGCGGCGGC CGCCTCGACA TCGACGCCAC CAACATCGCC
ATCGTCTCCC AAGGCAACGC TGGGCAGGCG ACCGGCAGCG CCATCAGGCT GGTCGCAGAG
AAGCTCAACG GGCTGGGAGT TTCCAGCATC GCCATCGGTG GTGTGCGCAA CACCGTGGAT
GGTACCGTCA CGGTCGACAC CCATGCCTCG ACCATCTCCC TGGAGCAGAA CGCCCGGCTC
AAGGGCTCCG AATTCATCCT CACCGCCAAG GACAACATCA CCCTCGCCAA GGGTTCCGGC
ATCACCGCCG AAGGCCCGTC CAAGAGCGTC GGGACACCGA AGGTCTATAA AGTCGACGGC
GATGCAGCCT TCCTGCGGGT GTCGACGTCG GACCAAGCCG ATCTCCAACG CAGCGGTGTC
TCGGGTCAGA GCGGCTCGAT CCGGATCGGC GCAGGCGCGA CCCTGGCCAC GTCCAACTCC
ATGATCGTCG ATGCCACGGC CAACATGGAT CTTTCCGGTA GACTCCTGGC CCAGGGCGGC
TCGGTATCGC TGGGCGCCAG CCGGATCGGG CTGGGCGCCG ACGCCAGCTT CCAGAACGGC
CTGGCGCTGA GCCAGACGGC ACTGAACGCC CTCCAGGCGA ACGAACTGCG CCTGAACAGC
GGCAGCGACA TCAGCCTGTA CGGCCCGGTC GAACTGACAT CGCAGCGGAT CGTACTGCGC
TCGGGCGGAC TGCTGGGATT CGACAACGCC GGCCAGACCG CAAGCATCAA GGCCGGAGAC
ATCCGTCTGG AGAATCCGCT CGCTGCGAGC ACCAGCCGTA CCGGCACCGG CACCGGCACT
CTCTCCCTGG TCGCCGACAC CGTCGAACTG GGCGGGGGCG CCTATGCACT CGAGGGCTAT
TCCAAAGTGA CGGTCGAAGC CGGCAAGGCC ATCGTCGGCT CGGGTACGGG CACGCTGACC
GCCTCGGCGG ATCTCGACTT CAAAACCCCC GTCGTCACCG GCGACCGCGG GGCGGACACC
CGGATCGACG CCACCGGCCA CGCAGTCGCC ATCCAGTCGT CCGGCGCCGC TCAGGCCGCC
GCCGACGCAC TCGGTGCCCA GTTGTCGATC ACCGCGGACA GCATCCGGAA CGCCGGGACG
ATTGCGCTCA AGTCCGGCGT CGTCAAGCTC GATGCCCTGA AAGGCGACGT GGTCCTTGCC
GCCGGCTCCA ACATCGACGT TTCCGGCCGG GAAGTGAGCA TCGGCAAAGC CAATGTCAAA
ACCGACGGCG GTGCGGTGGA GCTCTCCAGC CGGACCGGCA GCGTCGCTCT GGAATCCGGC
GCCAAGCTCG CGCTCAACGG CAGCAAGGGC GGCGAGCTGC AGGTGTCGGC GGCAGCGGGC
GGATTCCGCT TCGACGGCAG CATCGACGCC CGGGGCACGG AACGCGGCGG CCGTTTCGGT
CTCGATGTCC ATACCCTGGA AAACGGGGGC GACGTCGGCG GAATGGCCGG GAAGCTGGCG
TCGGCCGGAT TCAGCGACGG CATCAGCCTG AGGGCGCGCA CCGGTGACCT GCACCTGAAT
ACCGGCGATA CCCTGGCGGC CCGCACCATC GAACTCGCGG CCGACCAGGG CTCGGTCCGA
ATCGACGGCA GCCTGAAAGC CCAAGGCGAC AACGCCAGCC TGGACCTCAA GGCCGGCGGC
GGCCTGACCC TGGCTGCGGG CGCCGACCTG GAAGCGCACG GCAGTACCAG CCAGGGCGGC
CGGATCGCGT TGGAGTCCAT GGACCCGGGT CCGCAGGGCG GGATCACCGT GGCTGCGGGC
GCCCGAATCG ACGTGAGCGC GACGGACGGT TCCGCCAACG GAACCGTGAA CCTGCGCGCG
CTGCGCACGG GGCAGGACGT CGCCGTCTCG GGGAACCTCG GATCGTCCGT GACCGGGGCC
CGGGAAACCA CGGTGGAAGC CGTCCGTATC TACGATCACA GCGGCACCAT CGGCAGTACC
GACATCGCGG CCTGGAAAAC CGACACCGAC GCCTACATGG CCAACGCCGG CGCCATCGAG
AGCCGTCTGG GGCTGCCTGG CGGCCTCAGA GCCGGCCTGG AGATCCGCAG CAGCGGCGAC
CTGACGCTGG GATCGGCCGG CTGGGACCTG GTCGACTGGC GCTATGGCGG CCGTCCGGGC
GTCCTCACCT TGAAAGCCGC CGGCCAGCTC TCGATCGACG GCAAACTCAG TGACGGCTTC
CGCGATGACC CCAACGGCAT CGACGTTTCC GGCATCCTCG GCCCCGGTGC CACCGTCGCC
GTGAAAGACA TGCTGCAGAC CGGTTCTTCA TGGAGCTACC GGTTGCAGGC CGGCACCGAC
GTGGTGGTAG GCGCCGACGT CGCGGTGCGG ACCGGCACCG GTGACATCGA CGTCGACGCC
GGCCGGGACG TGGTCCTGAC CAATGCCAGC TCATCCATCT ACACTGCGGG CCGTCCGACC
GACACCCAAC GCTACGGCAA TTTCAAAAAC GGCTTCGTGG CCTTCCAGTT CTACGGCGAA
TACCCGGTCG ACGGCGGAGA CATCCATATC AGCGCCGGCC GCGACGTGAT CGGCGCCAAG
ACCGGCCAGT TCTTCGACGG CTGGCTGGTG CGCACCGGCA ACTGGACCGA TGGCACCAGT
CACCAGGGCG AAACGCCGAC GGCCTGGGCC GTGGCGATCG GCGGTCCGGT CGGCACCTCG
GCGCAGCAGG GAAGCTTCCA GCAGAACATC GGCGCACTCG GCGGCGGCAA CATCACGGTC
GAGGCCGGTC GGAACGTGTC CGATCTTTCG GCGGTCATCG CCACCACCGG CAAACAGCTC
GGCACCCCTT CCAAGCCGAA CGATCCCTCC GACACCGGCT TCAACACCAA CGAAGTGCAG
ATTTCCGGCG GCGGGAACCT GACCGTCCGT GCCGGCGGCG ACGTCCTGGG TGGTACTTTC
TACACCGGCA AGGGCATGGG GGAAATCAGC GCGAGGGGTG CAATCAAGGC ATCGACGACC
GGCCTGGGGC CGGTGCTGGC ACTGGGAGAC TCCCGGTTCA GCCTGAACGC CGGCCAGGAC
ATCGAACTGG GCGCGGCCAT CAACCCCACC GTGATCAACA GCGCTACCGC CCGCAACTTC
TTCTTCACCT ATTCCGACCG CAGCGGCATC GCGCTGGAAT CGCTCGCGGG AAACGTCCGC
CTCCAGAACG ACATCCCGGG GATGGTCAGC GCCGTCAACA ATCTGCGCAG CACGAGGAAC
CAGCTCAGCT TCCCCGGCGC TTCGTTGAGC GCGCTCGGCG TGTATCCCGC CTCGCTCGAC
GTCACTGCGC TGCAAGGGGA CATCGTGCTG GAACGCAGCT TCACGACCTA TCCCGCTGCC
CAAGCGAGCT TCAACCTCAT GGCGGGCGGA AACATCGGCA GCGGCTCGGT GGGTGACAAC
GTCAATGTCA CCCAGTCAGA CGCCGACCCC GCCCTGTTGC CCGGGATCGC GCATCCCACC
CGGAGCTGGG ACGATGCTTC GCAGAGGCTG CAACCCTTCG GCGCCGCCAA TCTGATCCAC
GCCCAGGTTC CCGTCCATCG CGGCGACAGC GAGGCGGCAA GGATATACGC CAATGGCAAC
ATCGCCTCCG TCGATCCCCT GCTCCTCGTC CTGCCCAAGG CGGTGGACGT CATGGCGGGC
CGGGATCTGT TCGATGTCAG CCTGCATGTG CAGCACCCCG ACTATGCGAT GTCCACCATC
ACCGCAGGCC GCGACATCCG TTTCACCTCC CCGCGCAACG CCCAGGGCAA TCTGGTGAAC
CTGACCCGCG AAATCCGGCT TGCAGGCCCC GGTCAGCTCT GGGTCAGCGC GGGACGGAAC
ATCGACCTGG GCGCTTCGGA AGGCATTTAC ACCATCGGCA ACACCGAAAA CCGCACCCTG
CCGGACAACG GCGCCTCGAT TACGGTCATG GCTGGCCTGA ACGGCAACCA GGCGCGCTTC
GACAAATTCG CGGAGAAATA CGACCCCACC TCGGCCCGGT ACCGCACCCT CCTGCGCGAC
TACATGCGCC GGCGGACCGG CAACGGCAGG CTCGATTACG CCGGTGCGGT GGACGCCTAT
CGGGCCCTGC CCGGCGACCA GCAGCACGAA TTCCTGCTGG CCATCCTGTT CGAAGAAATC
CGGATCTCCG CCGCCCAGGC CGCCAAGACG GGCAGCAAAT CCGCCTACGA CCGCGGCTTC
GCGGCCATCG ATACCCTCTT CCCGCAAAGC GGCGACACCC ACTACAAGGG TAATCTCAGC
CTGTTCTTCA GCAAGATCCA CACCGTCGAC GGCGGCGACA TCAATCTCCT CGTTCCCGGC
GGCGGTGTGA ATGCCGGTCT GGCCGTTGCC TTCGCCGGCT CGAAGGCCGC CAGCGATCTG
GGCATCGTCG CTCAGCGGCA AGGAGCCGTG AATGCCCTGG TAAACGGCAA TTTCATGGTC
AACCAATCAC GGGTGTTCGC CATGGACGGG GGTGACATCA CCATCTGGTC ATCCAATGGC
AACATCGACG CCGGCCGCGG TGCCAAATCG GCCATCGCCG TCCCGCCGCC GCGGATCACC
TTCGACGAAC GCGGCAACCT GCAGGTCGAA TTCCCACCGG TGGTATCGGG CAGCGGCATC
CGTACCGCAG CCAGCACGGC CCCCGTGCCC GGTGACGTAT TCCTCGCCGC CCCGCGGGGC
GTGGTCGATG CCGGCGAGGC GGGCATCGGC GGCACCAACG TGGCGATCGC GGCGACGGCC
GTACTCGGCG CCAGCAACAT TCAGGTCGGA GGCACCGCCA CCGGCGTACC CAGCACCAAT
GTCAGCGTCC CGGTGGTTCC GGCCGGCGCG GCAGCCGCTG CGGGCGCGGC CACACAGGCC
GCCATGCAGT CCACCGTGTC CGACAGCGAG GAGAAGTCCG AACCCAAGGT GGCGAATTCC
AGCGGCCTCA ACCCGCTCAA AGTCGAGTTG CTGGGATTCG GCGAATGTTC GACGACTGAC
ATCAAGAACG GTTCGCCCGG CTGCACTTGA
 
Protein sequence
MTTREAANRK AHLPPADLGL KPLAASVRTV LAGAFLAGGA AHAGSLPVPA GVFVSSGSAD 
QSVAGNLMTI NQHSDRAILN WKSFDIGAGN TVQFKQPSAS SIALNRIYQN DPSRIFGRLS
ANGQVYLLNQ NGFLFGKGSQ VDVNTLLVST LNITDDTFQR GITKVLDQDG RAALVGDGKV
YRVDDQGQFV LDEKGNRVKT GIEFAEGSSV KTANSGRIIA AAPSVVNKGD LSSPDGQILL
IAATDKVYLQ EAGKDSTLRG LLVEVGTGGE VANIGRALAE RGNVTLMGFA VNQQGRVSAT
TSVRVNGSVR LLAREGASVR REGDAWLLQA NRTKRSAPLD DGLGTRATVT LKGGSKTSAN
PDLNDPATAV DGQAQDASWV EIMGHQVRIE NGAQLVSRSG KVTVTATENP ANPGLDNVKN
DARVYVDKGA TIDVSGIKDV SVPMERNVVT VELRSNELRD SPLQRHGVLY GKKIRVDIRK
GTPIADISGE LERIARTVAE RSTAGGTIKL TSEGDAILQR GALLDFSGGS VAYRSGYIDT
TQLLTPDGKT VDIGSADPNQ TYAGIFGQVT QKFKAWNVTK TWDIIGPRNL GRYEQGYVEG
KAAGTLDIKA AALALEAEMR GAATAGLHQR EAGTQPAGGT LKIDLARSPD STQSVIFGSA
PGSLGIGKDQ PFPQDPEKPG QPAALVLSGN KLRDSGIMYA DIKTNGKVAI RSGENLAMTD
GGSLALTGGE IKVDGTITAH AGEVDLSTRL TSATQGKLSG AIDLGAGATI DVSGQWINDR
PADATGHSGQ DRSRVLVNGG TVSAKAEGDV NLAAGSRIDV SGGARRTGKD SIKAGDAGSI
SLEAAAVDGS DLKLKGTLEG YAFAGGKGGA LSLVSDQVIL GNATDIGTAK GADPLVLTPD
FFGQGGFAHY SVGSNKSGVT VSDGTRIQLS VQNRVIDPTA VRRVSGSNFR DFAQVELLPE
LSRQAGELDL VLAQKVGQGG KDAAVRIGDG AVIHTDAGGK ISLQSDSSIF MNGTLEAAGG
DVAMTVTPPA GTDPGFKADQ GIWIGSGAKV DVSGTALLYS DRPDHMTGKV LDGGTITLHA
DRGFVVADEG SELSVSGTQG RLDVPVRGPN GAISQERRQI ASSGGSIAIR AAEGIQLYGK
LDGRAGDGTG AAGGGLSLEL NPNTRAEPDE IGQGQTPFPK VPSVISLNQS GPAGTATVQG
EAVASNRYGQ AVLSAGQVER GGFGELTLRT PGRIELGSGL NLHTERSIVL DAPVLAFAPA
TGATAGKVSL DSAYVALGST QTRPGNAAPT TGNGSLQVNA GLIDLVGTTA LQGFDGANLK
SAGDLRLIGV RTTQQQRDFL GEFLAAGDLT LTADQIYPST LSDFRIAVRG TSDGTLTVKP
GDSKGGTVLS AGGKLTLEAP NIVQGGTVKA PLGQLNFQAS NRLELAAGSV TSNSAKGAII
PFGRTQGGLD WIYPLGDQNL VFTAPPEKKL VLDGPKVDIA DGSVIDTRGG GDLFAFEFVP
GPGGSYDLLD PTSQGYQEAY AVLPSFKGTT APYDPLESAT SGLKVGDSVY LSGGGGLKAG
YYVLLPAHYA LLPGAYLVTP DKNVTNIVPG LDLQRADGAP IVAGYRTVSG TGIRDALWSG
FAVEPGSAAR TRAEYSTYGA NAFFSAKAAK DETALPYLPR DAGSLFISAE TALSLDGRVS
AKAGAKGRGG RLDIDATNIA IVSQGNAGQA TGSAIRLVAE KLNGLGVSSI AIGGVRNTVD
GTVTVDTHAS TISLEQNARL KGSEFILTAK DNITLAKGSG ITAEGPSKSV GTPKVYKVDG
DAAFLRVSTS DQADLQRSGV SGQSGSIRIG AGATLATSNS MIVDATANMD LSGRLLAQGG
SVSLGASRIG LGADASFQNG LALSQTALNA LQANELRLNS GSDISLYGPV ELTSQRIVLR
SGGLLGFDNA GQTASIKAGD IRLENPLAAS TSRTGTGTGT LSLVADTVEL GGGAYALEGY
SKVTVEAGKA IVGSGTGTLT ASADLDFKTP VVTGDRGADT RIDATGHAVA IQSSGAAQAA
ADALGAQLSI TADSIRNAGT IALKSGVVKL DALKGDVVLA AGSNIDVSGR EVSIGKANVK
TDGGAVELSS RTGSVALESG AKLALNGSKG GELQVSAAAG GFRFDGSIDA RGTERGGRFG
LDVHTLENGG DVGGMAGKLA SAGFSDGISL RARTGDLHLN TGDTLAARTI ELAADQGSVR
IDGSLKAQGD NASLDLKAGG GLTLAAGADL EAHGSTSQGG RIALESMDPG PQGGITVAAG
ARIDVSATDG SANGTVNLRA LRTGQDVAVS GNLGSSVTGA RETTVEAVRI YDHSGTIGST
DIAAWKTDTD AYMANAGAIE SRLGLPGGLR AGLEIRSSGD LTLGSAGWDL VDWRYGGRPG
VLTLKAAGQL SIDGKLSDGF RDDPNGIDVS GILGPGATVA VKDMLQTGSS WSYRLQAGTD
VVVGADVAVR TGTGDIDVDA GRDVVLTNAS SSIYTAGRPT DTQRYGNFKN GFVAFQFYGE
YPVDGGDIHI SAGRDVIGAK TGQFFDGWLV RTGNWTDGTS HQGETPTAWA VAIGGPVGTS
AQQGSFQQNI GALGGGNITV EAGRNVSDLS AVIATTGKQL GTPSKPNDPS DTGFNTNEVQ
ISGGGNLTVR AGGDVLGGTF YTGKGMGEIS ARGAIKASTT GLGPVLALGD SRFSLNAGQD
IELGAAINPT VINSATARNF FFTYSDRSGI ALESLAGNVR LQNDIPGMVS AVNNLRSTRN
QLSFPGASLS ALGVYPASLD VTALQGDIVL ERSFTTYPAA QASFNLMAGG NIGSGSVGDN
VNVTQSDADP ALLPGIAHPT RSWDDASQRL QPFGAANLIH AQVPVHRGDS EAARIYANGN
IASVDPLLLV LPKAVDVMAG RDLFDVSLHV QHPDYAMSTI TAGRDIRFTS PRNAQGNLVN
LTREIRLAGP GQLWVSAGRN IDLGASEGIY TIGNTENRTL PDNGASITVM AGLNGNQARF
DKFAEKYDPT SARYRTLLRD YMRRRTGNGR LDYAGAVDAY RALPGDQQHE FLLAILFEEI
RISAAQAAKT GSKSAYDRGF AAIDTLFPQS GDTHYKGNLS LFFSKIHTVD GGDINLLVPG
GGVNAGLAVA FAGSKAASDL GIVAQRQGAV NALVNGNFMV NQSRVFAMDG GDITIWSSNG
NIDAGRGAKS AIAVPPPRIT FDERGNLQVE FPPVVSGSGI RTAASTAPVP GDVFLAAPRG
VVDAGEAGIG GTNVAIAATA VLGASNIQVG GTATGVPSTN VSVPVVPAGA AAAAGAATQA
AMQSTVSDSE EKSEPKVANS SGLNPLKVEL LGFGECSTTD IKNGSPGCT