Gene M446_5785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5785 
Symbol 
ID6131097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp6345910 
End bp6360477 
Gene Length14568 bp 
Protein Length4855 aa 
Translation table11 
GC content73% 
IMG OID641645893 
Productfilamentous haemagglutinin outer membrane protein 
Protein accessionYP_001772507 
Protein GI170743852 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.352944 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGTGC GCTCCCTCGC CCCCCGCCTC CTCGCCGCGC TGCTGGCCAC CACGGCCCTG 
ACCCGGACGG CCCCGGCCCA GACCCTGCCG AGCGGCGGGC AGGTCGTCGC CGGCAGCGTC
AGCATCGGCG CGCCGCGGAA CGGCGCCCTC ACCGTCACCC AGGCCAGCCC GAACGCCATC
GTGAACTGGC AGGGTTTCTC GATCGGCCAG GGCGGCCGGG TCGAGTTCCG GCAGCCGGAC
GCGCAGGCGG CGATCCTCAA CCGGGTGACG GGGACGACGC CCTCGACCAT CGCCGGGCAG
CTCACCGCCA ACGGGCAGGT CTACCTCGTC AACCCGAACG GCATCGCGAT CACCCCGACC
GGCCGGGTCG AGGCGGGGGC CTTCGTGGCC TCCTCGCTGG GCATCACGGA CGGCGACTTC
CTGGCCGGGC GGCGGTCGTT CCGGGGCTCG GGCGCCTCGG CGCCGGTGAC CCAAGCGGGG
ACGCTGACCA TCGCCCGCGG CGGCTACGCG GCGCTGATCG GCGGGGAGGT GGCGAATTCC
GGCGTGATCA CGGTGCCGGC CGGCAAGGTC GGGCTGGGCT CGGGCGAGGA GGCGACTTTG
GACCTCGCCG GGGACGGGTT CCTGCAGGTG GCGGTGCCGA CGCGGCGGGC GGGCGCGGGC
GCGCTGGTGG CCCATTCGGG GGTGATCTCG GCCGAGGGCG GCACCGTGAC CCTGCGGGCG
GCGGCGGCGC GGGAGATGGC GCGGCAGGCC GTGAACCTGT CCGGCGTGGT GGAGGCGCGC
AGCTTCACGA CCCGGGGCGG CACCATCCTG CTCTCGGGCG GGGACGGGCA GGTCGAGGTC
GGGGCGGCGG CGCGGCTCGA CGCCTCGGGC GGCCCGCAAG GGGGCCGGGT GCGGATCCGC
GGCCGGGAGG TTTCGGTGGC GGGCCGGATC GAGGCCTCGG GGGCGCGGGG CGGCCGGGTC
GGGATCCGGG CGGGCGAGAG CCTGGCGCTG AGCGGCGTCC TCGCGGCGGC GGGCCGGTCG
GGTGCGGGCG GGAGCGTGGT GGCGACGGCG CCCGCCATGG CGCTGAGCCT GGCGCGGATC
GACGTGTCGG GGGCGGCCGG GGGCGGGCGG GTGCGGCTCG GGGGCGGGCG CCAGGGCCAG
GGCGGGCTGG CGCACGCCAC GACGCTGAGC GTGGATGCGG GCAGCGAGAT CCGGGCGGAG
GCGACGGCGC GCGGCGCGGG CGGCGACGTG GTGCTGTGGT CGGACGGGCG GACCGACTTT
TCCGGGACGA TCGTGGCGAC GGGCGGCCGC CTGGGCGGGG ATGGCGGCGC GGCGGAGGTG
TCGTCGAAGG GGGTGCTGGC CTATCGGGGC CGCACCGACC TGACGGCGCC GGCGGGTGCG
GCGGGGACGC TGCTGCTCGA CCCCTACGAC CTGACGATCT CGGGCGCCGC CGATTCGGGC
CTGTCGGGCT TCGCCGCGGC GGCGAACGAC AGCGTGCTGA GCGTGGGCAC GCTGCAGACC
GCGCTGGGCA GCGCGAACGT GACGGTGACG ACCGGGACGG GCGGGAGCCA GGCGGGCGAC
ATCACGGTGG CCGACCCGGT GACGTGGAGC GGCGGCACGA CGCTGACGCT GGCGGCGGCG
CGCCACGTGG CGGTCAACGC GAACCTCACC GGGGGGACGG GCGCGCAGAT CGTGCTGCGG
GCGGATGCGG CCGGCAGCGG CACCGGCACG GTGACCTTCG GGTCCGGCGT GCAGGCGACG
GCGTCGGGCG GCGTGGCGCT GTACTACAAC CCGGCCGATT ATGCGACGCC GACGAATTAC
GGGCCGAATG TCGGGACGGG GACGGTTCTG AGCGCCTGGA TGCTGGTGAA TACGGCGCAG
AACCTGCAGG ACATCAGCAA GAACCTGTCG GGCGGCTACG CGCTCGGCCG GGACATCGAT
GCGAGCGAGA CGGCGGGCTG GAACAACGGG AGCGGGTTCG CGCCGCTTGG GAGCGACACC
CGTCGGTTCA AGGGCCTGTT CGACGGCCAG GGTCACGTGA TCACCGGCCT CCTGATCAAC
CGCGGGGGCA CGTCCAATAT CGGTCTGTTC GGTGTCGTGG ATGGGAGCGC GGCGGTCCGC
AACGTCGGGC TCGTCGGCGG CAGCATGACC GGCATCGGCA ATGTCGGCGG GCTGGTCGGG
ACCAATTACG GCAGCATCAG CCAAGCCTCT GCCAGCGTCA GCGTGACGGG CGGCAGCGCT
CTCGGCGGTC TGGTCGGGGC GAATTCGGGC AGCATCACCC AGGCCTACGC CACCGGCAGC
GTGACGGGCG CCAACACACT CGGCGGCCTC GTCGGCAACA ATGCCGGCAG CATCAGCCAA
GCCTACGCGA CCGGCAGCGT GACGGGCGCA AGCACGGTCG GCGGCCTCGT CGGGAACGTC
ACTTCCGGCA CCATCACCCA GGCCTACGCG ACCGGCAGCG TGACGGGCGC AAGCACGGTC
GGCGGCCTCG TCGGGAACGT CACTTCCGGG AGCATCGGCC AGGCCTACGC CAGCGGCCGC
GTGATGGGCA CGGACACCGT CGGCGGCCTG GTCGGCAGCC TGCAATCCGG CGTCACCGTC
ACCGCCTCCT ACTGGACCAC GCAGACCACC GGCCAGGGAA CCAGCGCCGG GGGCATGGGC
ACCGGCCTGA CCACTGTGGG GGCGCGCCAA ACGTCGAGCT ACACCGGCTG GAACTTCACG
ACGGATTGGT ACCAGGCCGG TGACCTGCGG CCGATCGGGC GCTGGGAGGC GGCGCAGCCG
GGGGCGGACG GGGTGATCCC GATCGCCACC CTGCACCAGC TGCAACTGAT GGGGGCGAAT
CTCGCCGGCC GCTACCGGCT CACCGCCGAC ATCGATGCCG GCGCGACGAT GAGGGCGGCG
GCGGGGGCCG ACGCGAGCGG GGTGTTCGGC CCCGGCGGCT TCGTGCCGGT GGGCGATTAC
ACCGCGCCGT TCAGCGGCAG CCTCGACGGC GCGGGCCACG TCATCACCGG GCTGACCATC
GCCCGGTCGA GCCAGAGTTA CGTCGGCCTG ATCGGCCAGC TCGCGCCGGG CGGCGGTCTC
ACGCGGATCG GGCTGGTCGG CGGCAGCGTG TCGGGCTCCT CCAATGTCGG CGGCCTCGCG
GGTTCCAACG CCGGGACGAT CAGCCAAGCC TACTCCACGG CGTCCGTCTC CGCCTCCCGC
AGCGCGGCCG GTGGCCTCGT GGGTTACAAT TCCGGGATGA TCAGCCAAGC CTACGCGACG
GGCAGCGTGT CGGGCTGGAC CCAGATCGGC GGCCTCGTGG GTTACAATTC CGGGCCGATC
AGCCAAGCCT ACGCGACGGG CAGCGTGACG GGCTCGAACC AGATCGGCGG CTTGGTGGCT
TATAACGGCG GGACGGTCAG CGAAAGCTAT GCCTCGGGCC CTCTGAGCGG CACAAGCGAC
ATCGGCGGCC TGATCGGGAC CGGCACCGGC AGCGTGACCT CCTCCTATTG GGACACGAAC
ACGACCCTGC AGGCGAGCAG CGCCGGAGGA GGCACCGGCC TGACCACCGC GCAGGCGCGC
GCCCAGGGCG GCTACGCGGG CTGGAACTTC ACGACGGATT GGTACCAGAC CGCGGACCTG
CGGCCGATCG GGCGCTGGGA GGCGGCGCCG GCGGGCGCGG ACGGGGCGAT CCCGATCGCC
ACCCTGCACC AGCTGCAGCT GGTGGCGACG AATCCCGCCG GCCGCTACCG GCTCGCCGCC
GACCTCGACG CGCGCGCCAC CGCGGGCACC AACGCCGCCG ACCTCTGGGG CGGGGGCGGC
TTCGTGCCGA TCGGGGACGC CTCGACCGGG TTCGCCGGCA CCTTCGACGG CGGCGGCCAT
GCGATCGCCG GGCTCACGAT CAACCGGCCC ACCCTCGACA ACGTCGGCCT GTTCGGCGTC
ACCGGCGCAG GCAGTCTCGT GCGCGGCATC GGCCTCGATG GCGGCAGCGT GACGGGCGGG
AGCACGGTCG GCGGCCTCGT CGGAGCCAAT TCCGGCCGCA TCACCCAGGC CTCCGCGACC
GTCAGCGTGA CGGGCACCCG CACTGTCGGC GGCCTCGTTG GGTACAATAG CGGCAGCATC
ACCCAGGCCT CCGCGACCGG CGGCGTGACG GGCACCCGCA CTGTCGGCGG CCTCGTTGGG
ACCAATGGCG GCAGCATCAC CCAGGCCTAC GCGACCGGCG GCGTGACGGG CACCCGCACT
GTCGGCGGCC TCGTTGGGAC CAATGGCGGC AGCATCACCC AGGCCTACGC GACCGGCGGC
GTGACGGGCA CCAGCACTGT CGGCGGCCTC GTCGGGGCCA GCGCCGGCAG CATCCTCCAG
ACCTACGCCA GCGGCCACGT GACGGGCACG GACGGTGTCG GCGGCCTCGT CGGGTCCAGC
AATGGCACCG TCCTCGCGTC GTTCTGGGAC ACGGAGACCA CCGGTCAGGC GGCCAGCGCC
GGAGGAAGCG GCGCGACGGG TCTGACCACC GCGTCGGCGC GCCAGGCGGC GAGCTACGCG
GACTGGAACT TCACGACGGA TTGGTACCAG GCCGGTGACC TGCGGCCGAT CGGGCGCTGG
GAGGCGGCGC AGCCGGGGGC GGACGGCACC GCCCTCATCA CCAACCTGCA CCAGCTGCAG
CTCGTCGCCG CGAACCTTGC CGGGTCCTAC GTCCTGGCCG CCGATCTCGA CGCGAGTGCC
ACGGCGGGCG CGGGTGCCGC CGACATCTGG GGCCCGGGCG GCTTCGTGCC GATCGGGACC
CCGTCGGCGC GGTTCACGGG CCTGTTCGAC GGCCGCGGCG GCGTGATCCG GGGGCTGACG
ATCAACCGGC CCAGCATCAA CGTAGTCGGC CTGTTCACCG CCACCGGCGC GGGCAGCCTC
GTGCGCGGCA TCGGCCTCGT CGGCGGCAGC GTGACGGGCG GCCTCACCGT CGGCGGCCTC
GTCGGGACCA ATTCCGGCAG CATCACCCAG GCCTACGCGA CCGTCAGCGT GACGGGCACC
CGCAATGTCG GCGGCCTCGT TGGGTACAAT AGCGGCAGCA TCACCCAGGC CTCCGCGACC
GGCCGCGTGA CGGGCGGCGT CAATGTCGGC GGCCTGGTCG GCTTCAACTC CGGCGCCATC
CGGCAGACCT ACGCGAGCGG CCGCGTGACG GGCACGGGCG GTGTCGGCGG CCTCGTCGGG
TCCAGCAATG GCACCGTCCT CGCGTCGTTC TGGGACACGG AGACCACCGG TCAGGCGGCC
AGCGCCGGAG GAAGCGGCGC GGCGGGTCTG ACCCCCGCGT CGGCGCGCCA GGCGGCGAGC
TACGCGGGCT GGAACTTCAC GATGGATTGG TACCAGGCCG GTGACCTGCG GCCGATCGGG
CGCTGGGAGG CGGCGCAGCC GGGCGCGGAC GGGGTGATCC CGATCTCCAC CCTGCATCAG
CTGCAGCTGA TGGGGACGAA TCTCGCCGGC CGCTACCGGC TCACCGCCGA CATCGATGCC
GGCGCGACGA TGAAGGCGGC GGCGGGGGCC GACGCGAGCG AGGTGTTCGG CCCCGGCGGC
TTCGTGCCGG TGGGCGATGA GACCGCGCCG TTCAGCGGCA GCCTCGACGG CGCGGGCCAC
GTCGTCACCG GGCTGACCAT CGCCCGGCCG AGCCAGGATG CTGTCGGCCT GATCGGCAAG
CTCGCGTCGG GGGGCGGTCT CACGCGGATC GGGCTGGTCG GCGGCAGCGT CTCGGGCTCC
TCGCAAGTCG GCAGCCTCGT GGGCACCAAC TCCGGGACGA TCAGCCAAGC CTACTCCACG
GCGTCCGTCT CCGGCAGCCA GACCGGCAGC CAAGTCGGCG GCCTGGTCGG GTTCAGCCGC
GGCTGGATCA GCCAAGCCTA CGCGACGGGC AGCGTGTCGG GGACGAGCAA GGTTGGCGGC
CTCGTCGGGT ACAATTCCGC GACGATCAGC CAAGCCTACG CCAGCGGCAG CGTGACGGGC
GACAGCTTTG TCGGCGGCCT CGTCGGGGGC AACGCTGGCG GCAGCATCCA GCAGACCTAC
GCCAGCGGCC GCGTGACAGG CGGAGTCGGC GTGGGCGGTC TGCTGGGCTT CAGCATGGGC
AACGTCTCGT CATCCTTCTG GGACACGGAG AGCACCGGTC AGGCGACCAG CGCCGGAGGA
AGCGGCGCTT CGGGTCTGAC CACCGCGTCG GCGCGCCAGG CGGCGAGCTA CGCGGGCTGG
AACTTCACGA CGGATTGGTA CCAGGCCGGT GACCTGCGGC CGATCGGGCG CTGGGAGGCG
GCGCAGCCGG GGGCGGACGG CCCCGCCCTC ATCACCACCC TGCACCAGCT GCAGCTCGTC
GCCGCGAACC TCGCCGGGTC CTACGCCCTG GCCGCCGATC TCGACGCGGG CGCCACGGCG
GGCGCGGATG CCGCCGACAT CTGGGGCCCG GGCGGCTTCG TGCCGCTCGG CGGCAGCGCC
GCGGGCTTCT CCGGTCGCTT CGACGGCCGG GGGCGCGCGA TCTCCGGCCT GACCATCAAC
ACGCCGACCT CGGACTTCGT CGGCCTGTTC GGCGCGGTGA GCGCGGCCGG CCGCGTCGCC
CGGATCGGCC TCGCCGGCAG CCGCATCATC GGCCGTGACA AGGTCGGCGC CCTCGCCGGG
TCCAGCGACG GGACGATCTT CCAGGCCTCC GCGACCGGCC GCGTCTCGGG CGGCACCGGC
GTCGGCGGGC TGGTGGGCAG CAGTTCCGGC CGCATCGCCC AGGCCTCCGC GACCGGGAGC
GTCTACGGGT CGGGCACCTC GGTCGGCGGG CTCGTCGGCG CGGTGACCGC CGGCACCCTC
AGCCAGGCCT ACGCGACCGG GGAGGTCACT GGCGGCGCCG ATGTCGGCGG GCTCGTCGGC
AGCAACGGCG GCAGCATCGC CCAATCCTAC GCGTCGGGGC TCGTGGTCGC GCTCGCGGAC
CCGTCCAAGG CCGGCGGCCT GGTCGGGACC GGCACCGGCA GCGTGACCTC CTCCTATTGG
GACACGAACA CGACCCTGCA GGCGAGCAGC GCCGGAGGAG GCACCGGCCT GACCACCGCG
CAGGCGCGCG ACCAGGGCGG CTACGCGGGC TGGAACTTCA CGAGCGACTG GTACCAGGCC
GCGGACCTGC GTCCGATCGG GCGCTGGGAG GCGGCGCAGC CGGGGGCGGA CGGCCCCGCC
CTCATCACCA ACCTGCACCA GCTGCAGCTC GTCGCCGCGA ACCTCGCCGG GTCCTACGCG
CTGGCCGCCG ATCTCGACGC GAGTGCCACG GCGGGCGCGG TCGCCGCCGA CATCTGGGGT
CCGGGCGGCT TCGTGCCGAT CGGGACCCCG TCGGCGCGGT TCTCGGGCCT GTTCGACGGC
CGCGGCGGCG TGATCCGGGG GCTGACGATC AACCGGCCCA CCCTCGACAA CGTCAGCCTG
TTCGGCGTGA CCGGCCCAGG CAGCCTCGTG CGCGGGATCG GCCTCGACGG CGGCAGCGTG
ACGGGCAGGA GCAATGTCGG CGGCCTTGCC GGGAAAAATG TGGCCGGTAC CATCACCCAG
GTCTACGCCA GTGGCAGCGT GACGGGCGAC ACTTTCGTCG GCGGCCTCGT CGGGACCACT
TCCGGCAGCA TCACCCAGGT TTTCGCGACC GGCAGCGTGA CGGGCGCCAG CTATGTCGGC
GGCCTCGTCG GGTACACCTC CGGCGGCGGC ATCACCCAGG CCTATGCCAC CGGTAGCGTG
ACGGGCAGCA AGAATGTCGG CGGCCTCGTC GGGTACGCTG GCGGTAGCAT CACTCAGGCC
TACGCCAGCG GCCGAGTGAC GGGCGACAGC GCGGTCGGCG GCCTCGTCGG GGCTGGCAGT
GGCGCCCCCA TCACCGCGTC GTTCTGGGAC ACGGAGACCA CCGGTCAGGC GACCAGCATC
GGAGGAAGCG GCGCGACGGG TCTGACCACC GCGCAGGCGC GCGACCAGGG CAGCTACGCG
GGCTGGAACT TCACGACGGA TTGGTACCAG ACCGCGGACC TGCGGCCGAT CGGGCTCTGG
GAGGCGGCGC AGCCGGGGGC GGACGGGGCG ATCCCGATCG CCACCCTGCA CCAGCTGCAG
CTGGTGGCGA CGAATCCCGC CGGCCGCTAC CGGCTCGCCG CCGACCTCGA CGCGCGCGCC
ACCGCGGGCA CCAACGCCGC CGACCTCTGG GGCGGGGGCG GCTTCGTGCC GATCTGGGGC
GGCTCGACCG GGTTCGCCGG CACCTTCGAC GGCCGCGGCC ATGTGATCGC CGGGCTCACG
ATCAACCGGC CGACCATGGA GGCTGTCGGT CTGTTCAGTG TTGTGCAGAG CGGCGCGACG
GTCTCCAATG TCGGCCTCGT CGGCGGCAGC GTAACGGGCG GCGACACTGT CGGCGGCCTC
GTCGGGGCCA ATTCCGGCCG CATCACCCAG GCCTACGTCA CCGCCAGCGT AACGGGCGGC
CGCACCGTCG GCGGCTTCGT CGGGAGCAAT TCCGGCACCA TCGCTCAGGC CTACGCCGCC
GGCAGCGTGA CGGGCAGCAG CGCTGTCGGC GGCTTCGTCG GGAATAACTG GAATGGTCTC
GTCACCCAGG CCTACGCCGC CGGCAGCGTG ACGGGCAACA CCACTGTCGG CGGCTTCGCT
GGAAGCAACA GGAGGACCCT CACCCAAGTC TATGCCACCG GCCGCGTGAC GGCCAACAAC
CTTTCCGGCG GCCTCGTCGG GTCCAGCGGT GGCACCGTCC TCGCGTCGTT CTGGGACACG
GAGACCACCG GTCAGGCGAT CAGCGCCGGA GGAAGCGGCG CGAGGGGTCT GACCACCGCG
CAGGCGCGCG ACCAGGGCAG CTACGCCGGC TGGAACTTCA CGCGCGACTG GTACCAGGCC
GGCGACCTGC GGCCGATCGG GCGCTGGGAG GCGGCGCAGC CGGGGGCGGA CGGCATCGCG
ACCGTCACCA ACCTGCACCA GCTCCAGCTC GTCAACGTGA ATCTCGCCGG GTCCTACGCC
CTGGCCGGCG ATCTCGACGC GGGCGCCACG GCGGGCGCGA CCGCCTCCGA CATCTGGGGC
AGCGGCGGCT TCGTGCCGCT CGGCAATGGC ATGGGCCCGT TCACCGGGCG CTTCGACGGC
CGCGACCACC GCATCGCCGG CCTGACGATC AACGCGCCCT CGACCAGCTC GGCGGGCCTG
TTCGGGATCA TCGGTCCCAC GGGCGAGGTC CGCAGCGTCG GGCTGGCCGG CGGCGGTGTC
AGCGCGGCCG GCGATGCCGG GGGCTTGGCC GGAACCAACA AGGGCTTCGT CACCAAGGTC
TTCGCCGACA TCACTGCGCG CGCATCGACC TACGGCCGAG CGGGCGGTCT GGTCGGTTCC
AACGCTGGGT CGGGGGCGCT GCGCGCCGTC TACGCCACCG GCGCCGTGTC CGGCAGCGAC
TTGATCGGCG GCTTGGTCGG CAGCAATGCC GGGATCATCA TCCAGGCCTA CGCCACCGGT
CGGGTGTCCG CCAGGAGCGC GGCCGGCGGT CTGGTCGGCA TCAACTCCGG CACCATCCAG
CAGGCCTACG CGACCGGGAG CGTCACGGGC GGCACGACTC CCGACGCTCC CGTCGGCGGT
CTGGTCGGCA CCAACGCCGG CGCCATCCGG CAGACCTACG CGACCGGCAG CGTGACGGGC
ACCAGCACTG TCGGCGGCCT GGTCGGCAGC CTGCGATCCG GCGCCACCGT CACCGCCTCC
TACTGGGACA CGGAGGCCAC CGGCCAGGCG ACCAGCGCCG GGGGCATGGG CACCGGCCTG
ACCACCGCGC AGATGCTCGA CACCCCCGGC ACCGCCGGCG GGTTCACCGC CACCGCCACG
GCGGTCGGAT GGGATTTCAC GACCGTCTGG GCGCGGCCGA ACGCCTCCGC GGCCCAGTCG
AGCGACGGCA AGACGCACAC CGCCGAACTC TACGCCACCT CGGGCGTGGT GGCGCTCGAT
GCCTCGGCCA GCATGACCTA CGGCGACACG CCGCCGACCC TCGCGCCGAC CGTGTACGGC
TCGGGGAGCG TCTTCGGCAA CGTCGTGTCC GCGCTCCCGA GCGTGACCTC GAGCGTGACC
GCACAGAGCA ATGCGGGAAC CTACGCGATC GGTCTGTCCG GGGGCAGCGG CACCTCCTGG
GGCGGGCGGA CCACCCGGTT CGTTTCGCCC GGCTCGGTCA CGGTGGACCC CAAGACGCTG
ACGGTGTCCC TGACCGGCAG CGTCACCAAG ACCTATGACG GGAGCGCGAG CGCCAGCCTG
ACGGGGCTCG GCCTCAACAT CGTCAGCGGC CGCATCGGCC AGGACGACGT CCAGGTCGCG
GGGGCGAGCG CCTCCTACGC GGATGCCAAG GCGGGCGCGG GCAAGTCCGT GACGGTCTCG
GGCCTGACCC TGTCGGGGGC CGCGGCGGGC AATTACACGC TGGGCACCAC GAACCAGGTC
TCGGCGGCGA TCGGCTCGAT CGACAAAGCG ACGCTGGCGG TCTCTCTGAC CGGCGTGGCG
CGCAAGACCT ACGATGGCAG CGCGAGCGCC GGCCTGACGG GGCTCGGCTT CGACCTCGGC
AGCAGCCGCA TCGGCCAGGA CGATGTCCAG GTCGCGGGGG CGAGCGCGGT CTACGCGGAC
GCCAAAGCCG GGACGGGCAA GTCCGTGACG GTGTCGGGCC TGACCCTGTC GGGGGTGGAC
GCGGACAACT ACACGCTCGC CGCGCCCACG GCCTCGGCGG CGATCGGGAC GATCGACAGG
GCGACGCTGG CGGTCTCGCT GACCGGCTCG GCCAGCAAGA CCTACGACGG CCGCACGAGC
GCCGGCCTGA CGGGGCTCGG CCTCAACATC GTCAGCGGCC GCATCGGCCA GGACGACGTC
CAGGTCGCGG GGGCGAGCGC CTCCTACGCG GATGCCAAGG CGGGCGCGGG CAAGTCCGTG
ACGGTGTCGG GCCTGACCCT GTCGGGGGCG GACGCGGACA ACTACACGCT CGCCGCGCCC
ACGGCTTCGG CGGCGATCGG TACCATCGAC AAGGCGACGC TGACGGTGTC GCTGACCGGC
GTGGCGCGCA AGACCTACGA CGGCCGCGCG AGCGCCGGCC TGACGGGGCT CGGCTTCGAC
CTCGGCAGCA GCCGCCTCGG CCAGGACGAC GTCCAGGTCG CGGGGGCGAG CGCGGTCTAC
GCGGACGCCA AGGCCGGGAC GGGCAAGTCC GTGACGGTCT CGGGCCTGAC CCTGTCGGGG
GCGGATGCCG CCAACTACAC GCTGGGCAGC ACGAGCCAGG TCCAGGGTAC GGTCGGCACG
ATCGACAAGG CGACGCTGGC GGTGTCGCTG ACCGGCGTGG CGCGCAAGAC CTACGACGGC
CGCACGAGCG CCGGCCTGAC GGGGCTCGGC TTCGACCTCG GCAGCAGCCG CTTCGGCCAG
GACGATGTCC AGGTCGCGGG GGCGAGCGCC TCCTACGCGG ATGCCAAGGC CGGGACGGGC
AAGTCCGTGA CGGTGTCGGG CCTGACCCTG TCGGGGGCGG ACGCGGACAA CTACACGCTC
GCCGCGCCCA CGGCCTCGGC GGCGATCGGC ACCATCGACA GGGCGACGCT GGCGGTCTCG
CTCGGCGGCG CGGTGCGCAA GGTCTATGAC GGGACGGTCG CGGCGACGGT GGCGCCGGGC
CAGCTGAGCC TCGGCGGCGT GGTGGGCCAG GACGTCGTGC AGGCGTCGGG CCGCGCGGTC
TACGCGGACG CGAAGGCCGG GACGGGCAAG TCCGTGACGG TGTCGGGCCT GACCCTGTCG
GGGGCGGACG CGGACAACTA CACGCTCGCC GCGCCCACGG TCTCGGCGGC GATCGGCACC
ATCGACAGGG CGACGCTGGC GGTCTCGCTG ACCGGCGCGG CCCGCAAGAC CTACGACGGC
AGCACGGCCG CGACGCTGAG CGCCGCGAAC TACGCGCTCA CGGGCCTCGT CCCCGGCGAC
GCGGTCTCGG TCGCGGGCAG CGCGGTCTAC GCGGATGCCA AGGCCGGGAC GGGCAAGCTG
GTGACGGCGT CGGGCCTGAC CCTGTCGGGC GCGGATGCGG GCAACTACAC GCTGGGCTCC
GCGACCGAGA TCTCGGCGGC CCTCGGCACC ATCGACAGGG CGACGCTGGC GGTCGCGCTC
ACCGGCGCGG TGCGCAAGAC CTACGACGGC AGCGCGCTCG CGATGCTGGG CGCCGGGAAC
TTCGCGCTCG CGGGCGTCGT CGCAGGCGAT GCCGTCACGG TCTCGGGCGC GGGTGGCACC
TACGACACCG GGAATGCCGG GACGAACAAG CTGGTGCGCG CGAGCGGCCT CTCGCTCGCG
GGCGCCGATG CGGGCAATTA CGTCCTGGAG CGGACCTCCC TGTCCGGCGG GATCGGCACG
ATCGACCCGG CCACGCTGAC GGCCTCCCTC CGGGGCAGCG TGAGCAAGAC CTATGATGGC
TCGACCGCCG CGATCCTGGC CGCCGGCAAT TATGCCCTGG GGGGTGTGGT CGGGGCCGAC
GAGGTCAACT TGGTGTGGCC GGCGAACGGT GTTTACGACA CCAAGGACGC CGGCACGGGC
AAGACCGTGA GCGTGTCCGG CATCGCGCTC GCGGGCTCGG CGGCGGGCAA TTACGTCCTG
GCGCAGACCT CGCTGTCGGC CGCGATCGGC AGGATCCTGC CCGCTCCGTT GACGGTCACG
GCCGGCAACG CCGCCAAGAC CTACGACGGG CGGGTCTATT CGGGCGGGAA CGGCGTGTCG
TACGCGGGCC TGGTCGGTGG CGAGGATGCC TCCGTGCTCG GCGGCAGCCT GACCTATGGC
GGGTCTGCCC AGGGGGCGCG GAACGCCGGC AGCTACGCGA TCACGCCGGC CGGCCTGACC
TCCGGCAACT ACGCCATCAC CTACGCACCC GGCACCCTCA CGGTGACCAA GGCTCCGCTG
ACGGTGACGG CCGGCAATGC CGCCAAGACC TATGACGGGC GGGCCTACTC GGGCGGGAAC
GGCGTGTCGT ACGCGGGCCT GGTCGGTGGT GAGGATGCCT CCTTGCTCGG CGGCAGCCTG
ACCTATGGCG GGTCGGCTCA GGGGGCGCGG AATGCCGGCA CTTACGCGAT CACGCCGGCC
GGCCTCACCT CCGGCAACTA CGCCATCAGC TACGCGGATG GCACCCTGAC CGTGACCAAG
GCTGCCCTCC TCGTCACGGC CGGCAACGCC GCCAAGACCT ACGACGGGCG GGCCTACTCG
GGCGGGAACG GCGTATCGTA CGCGGGCCTG ATCGGTGGTG AGGACGCCTC GGTGCTCGGC
GGCAGCCTGA CCTATGGCGG GTCTGCCCAG GGGGCGCGGA ATGCCGGCAC TTACGCGATC
ACGCCGGCCG GCCTGACCTC CGGCAACTAC GCCATCACCT ACGCACCCGG CACCCTGACC
GTGACCAAGG CCCCGCTGAC GGTGACGGCC AACGGCCTGA GCAAGACCTA CGACGGGCAG
GCCTTCTCGG GCGGGAACGG CGTGTCATAC GCGGGCCTGG TCGGTGGTGA GGAGGCCTCG
GTGCTCGGCG GCAGCCTGAC CTATGGCGGG TCTGCCCAGG GGGCGCGGAA CGCCGGCAGC
TACGCGATCA CGCCCGCCGG CCTGACCTCC GGCAACTACG CCATCACCTA CGCACCCGGC
ACCCTCACGG TGACCAAGAC CCCGCTGACG GTGACGGCCA ACGGCCTGAG CAAGACCTAC
GACGGGCAGG CCTTCTCGGG CGGGAACGGC GTGTCGTACG CGGGCCTGGT CGGTGGCGAG
GATGCCTCCG TGCTCGGCGG CAGCCTGACC TATGGCGGGT CGGCTCAGGG GGCGCGGAAC
GCCGGCAGCT ACGCGATCAC GCCGGCCGGC CTGACCTCCG GCAACTACGC CATCACCTAC
GCACCCGGCA CCCTCACGGT GACCAAGGCT CCGCTGACGG TGACGGCCGG CAATGACGCC
AAGACCTATG ACGGGCGGGC CTATTCGGGC GGGAACGGCG TGTCGTACGC GGGCCTGGTC
GGTGGTGAGG ACGCCTCGGT GCTCGGCGGC AGCCTGACCT ATGGCGGGTC TGCCCAGGGC
GCCCGGAATG CCGGCAGCTA CGCGATCACG GCCGCCGGCC TCACCTCCGG CAACTATGGC
ATCACCTTCG TCGAGGGCAT CCTCACGGTC GCGCCGCGGC CGCTGACCGT CGCCGCCGAT
GCCCAGAGCC GAGCCTCCGG CCAGCCCAAC CCCGTCCTCA CCTACGCGGT GTCGGGACTC
GGGCTGGTGG GCGGCGATGG CCTGGCGGGC CAGCTCGCCA CCCCGGCGAC GCCGGACAGC
GCACCCGGCT CCTATCCGAT CACCCAAGGT ACGCTGGCCG CATCACCCGA CTACGCGCTC
ACCTTCCTGA ATGGCACCCT GACCGTGACG GAAGCGGTCG CGGCGGCCGG GTCCGCGCCG
CCGGTCGGTT CGCCCGTCAC CGCCAGCACC GTGACCCAGG TTCTGACCCT CAATCAGAGC
CTGACGCCCT ACACTCCGCC GGTCTTCCAG GGGGCGGGCC TGACCAGCTC CCAGGGAAGC
CCGCTCAGCG ATCCGCGCTT CGACACGCCC GTCGCCTGCC TGAGCCAGGC CGCCTGCTAC
ATCACCCCGG CGGCCCCCCA GACCGGCTCC CCCTCCGCCG GCCGATGA
 
Protein sequence
MPVRSLAPRL LAALLATTAL TRTAPAQTLP SGGQVVAGSV SIGAPRNGAL TVTQASPNAI 
VNWQGFSIGQ GGRVEFRQPD AQAAILNRVT GTTPSTIAGQ LTANGQVYLV NPNGIAITPT
GRVEAGAFVA SSLGITDGDF LAGRRSFRGS GASAPVTQAG TLTIARGGYA ALIGGEVANS
GVITVPAGKV GLGSGEEATL DLAGDGFLQV AVPTRRAGAG ALVAHSGVIS AEGGTVTLRA
AAAREMARQA VNLSGVVEAR SFTTRGGTIL LSGGDGQVEV GAAARLDASG GPQGGRVRIR
GREVSVAGRI EASGARGGRV GIRAGESLAL SGVLAAAGRS GAGGSVVATA PAMALSLARI
DVSGAAGGGR VRLGGGRQGQ GGLAHATTLS VDAGSEIRAE ATARGAGGDV VLWSDGRTDF
SGTIVATGGR LGGDGGAAEV SSKGVLAYRG RTDLTAPAGA AGTLLLDPYD LTISGAADSG
LSGFAAAAND SVLSVGTLQT ALGSANVTVT TGTGGSQAGD ITVADPVTWS GGTTLTLAAA
RHVAVNANLT GGTGAQIVLR ADAAGSGTGT VTFGSGVQAT ASGGVALYYN PADYATPTNY
GPNVGTGTVL SAWMLVNTAQ NLQDISKNLS GGYALGRDID ASETAGWNNG SGFAPLGSDT
RRFKGLFDGQ GHVITGLLIN RGGTSNIGLF GVVDGSAAVR NVGLVGGSMT GIGNVGGLVG
TNYGSISQAS ASVSVTGGSA LGGLVGANSG SITQAYATGS VTGANTLGGL VGNNAGSISQ
AYATGSVTGA STVGGLVGNV TSGTITQAYA TGSVTGASTV GGLVGNVTSG SIGQAYASGR
VMGTDTVGGL VGSLQSGVTV TASYWTTQTT GQGTSAGGMG TGLTTVGARQ TSSYTGWNFT
TDWYQAGDLR PIGRWEAAQP GADGVIPIAT LHQLQLMGAN LAGRYRLTAD IDAGATMRAA
AGADASGVFG PGGFVPVGDY TAPFSGSLDG AGHVITGLTI ARSSQSYVGL IGQLAPGGGL
TRIGLVGGSV SGSSNVGGLA GSNAGTISQA YSTASVSASR SAAGGLVGYN SGMISQAYAT
GSVSGWTQIG GLVGYNSGPI SQAYATGSVT GSNQIGGLVA YNGGTVSESY ASGPLSGTSD
IGGLIGTGTG SVTSSYWDTN TTLQASSAGG GTGLTTAQAR AQGGYAGWNF TTDWYQTADL
RPIGRWEAAP AGADGAIPIA TLHQLQLVAT NPAGRYRLAA DLDARATAGT NAADLWGGGG
FVPIGDASTG FAGTFDGGGH AIAGLTINRP TLDNVGLFGV TGAGSLVRGI GLDGGSVTGG
STVGGLVGAN SGRITQASAT VSVTGTRTVG GLVGYNSGSI TQASATGGVT GTRTVGGLVG
TNGGSITQAY ATGGVTGTRT VGGLVGTNGG SITQAYATGG VTGTSTVGGL VGASAGSILQ
TYASGHVTGT DGVGGLVGSS NGTVLASFWD TETTGQAASA GGSGATGLTT ASARQAASYA
DWNFTTDWYQ AGDLRPIGRW EAAQPGADGT ALITNLHQLQ LVAANLAGSY VLAADLDASA
TAGAGAADIW GPGGFVPIGT PSARFTGLFD GRGGVIRGLT INRPSINVVG LFTATGAGSL
VRGIGLVGGS VTGGLTVGGL VGTNSGSITQ AYATVSVTGT RNVGGLVGYN SGSITQASAT
GRVTGGVNVG GLVGFNSGAI RQTYASGRVT GTGGVGGLVG SSNGTVLASF WDTETTGQAA
SAGGSGAAGL TPASARQAAS YAGWNFTMDW YQAGDLRPIG RWEAAQPGAD GVIPISTLHQ
LQLMGTNLAG RYRLTADIDA GATMKAAAGA DASEVFGPGG FVPVGDETAP FSGSLDGAGH
VVTGLTIARP SQDAVGLIGK LASGGGLTRI GLVGGSVSGS SQVGSLVGTN SGTISQAYST
ASVSGSQTGS QVGGLVGFSR GWISQAYATG SVSGTSKVGG LVGYNSATIS QAYASGSVTG
DSFVGGLVGG NAGGSIQQTY ASGRVTGGVG VGGLLGFSMG NVSSSFWDTE STGQATSAGG
SGASGLTTAS ARQAASYAGW NFTTDWYQAG DLRPIGRWEA AQPGADGPAL ITTLHQLQLV
AANLAGSYAL AADLDAGATA GADAADIWGP GGFVPLGGSA AGFSGRFDGR GRAISGLTIN
TPTSDFVGLF GAVSAAGRVA RIGLAGSRII GRDKVGALAG SSDGTIFQAS ATGRVSGGTG
VGGLVGSSSG RIAQASATGS VYGSGTSVGG LVGAVTAGTL SQAYATGEVT GGADVGGLVG
SNGGSIAQSY ASGLVVALAD PSKAGGLVGT GTGSVTSSYW DTNTTLQASS AGGGTGLTTA
QARDQGGYAG WNFTSDWYQA ADLRPIGRWE AAQPGADGPA LITNLHQLQL VAANLAGSYA
LAADLDASAT AGAVAADIWG PGGFVPIGTP SARFSGLFDG RGGVIRGLTI NRPTLDNVSL
FGVTGPGSLV RGIGLDGGSV TGRSNVGGLA GKNVAGTITQ VYASGSVTGD TFVGGLVGTT
SGSITQVFAT GSVTGASYVG GLVGYTSGGG ITQAYATGSV TGSKNVGGLV GYAGGSITQA
YASGRVTGDS AVGGLVGAGS GAPITASFWD TETTGQATSI GGSGATGLTT AQARDQGSYA
GWNFTTDWYQ TADLRPIGLW EAAQPGADGA IPIATLHQLQ LVATNPAGRY RLAADLDARA
TAGTNAADLW GGGGFVPIWG GSTGFAGTFD GRGHVIAGLT INRPTMEAVG LFSVVQSGAT
VSNVGLVGGS VTGGDTVGGL VGANSGRITQ AYVTASVTGG RTVGGFVGSN SGTIAQAYAA
GSVTGSSAVG GFVGNNWNGL VTQAYAAGSV TGNTTVGGFA GSNRRTLTQV YATGRVTANN
LSGGLVGSSG GTVLASFWDT ETTGQAISAG GSGARGLTTA QARDQGSYAG WNFTRDWYQA
GDLRPIGRWE AAQPGADGIA TVTNLHQLQL VNVNLAGSYA LAGDLDAGAT AGATASDIWG
SGGFVPLGNG MGPFTGRFDG RDHRIAGLTI NAPSTSSAGL FGIIGPTGEV RSVGLAGGGV
SAAGDAGGLA GTNKGFVTKV FADITARAST YGRAGGLVGS NAGSGALRAV YATGAVSGSD
LIGGLVGSNA GIIIQAYATG RVSARSAAGG LVGINSGTIQ QAYATGSVTG GTTPDAPVGG
LVGTNAGAIR QTYATGSVTG TSTVGGLVGS LRSGATVTAS YWDTEATGQA TSAGGMGTGL
TTAQMLDTPG TAGGFTATAT AVGWDFTTVW ARPNASAAQS SDGKTHTAEL YATSGVVALD
ASASMTYGDT PPTLAPTVYG SGSVFGNVVS ALPSVTSSVT AQSNAGTYAI GLSGGSGTSW
GGRTTRFVSP GSVTVDPKTL TVSLTGSVTK TYDGSASASL TGLGLNIVSG RIGQDDVQVA
GASASYADAK AGAGKSVTVS GLTLSGAAAG NYTLGTTNQV SAAIGSIDKA TLAVSLTGVA
RKTYDGSASA GLTGLGFDLG SSRIGQDDVQ VAGASAVYAD AKAGTGKSVT VSGLTLSGVD
ADNYTLAAPT ASAAIGTIDR ATLAVSLTGS ASKTYDGRTS AGLTGLGLNI VSGRIGQDDV
QVAGASASYA DAKAGAGKSV TVSGLTLSGA DADNYTLAAP TASAAIGTID KATLTVSLTG
VARKTYDGRA SAGLTGLGFD LGSSRLGQDD VQVAGASAVY ADAKAGTGKS VTVSGLTLSG
ADAANYTLGS TSQVQGTVGT IDKATLAVSL TGVARKTYDG RTSAGLTGLG FDLGSSRFGQ
DDVQVAGASA SYADAKAGTG KSVTVSGLTL SGADADNYTL AAPTASAAIG TIDRATLAVS
LGGAVRKVYD GTVAATVAPG QLSLGGVVGQ DVVQASGRAV YADAKAGTGK SVTVSGLTLS
GADADNYTLA APTVSAAIGT IDRATLAVSL TGAARKTYDG STAATLSAAN YALTGLVPGD
AVSVAGSAVY ADAKAGTGKL VTASGLTLSG ADAGNYTLGS ATEISAALGT IDRATLAVAL
TGAVRKTYDG SALAMLGAGN FALAGVVAGD AVTVSGAGGT YDTGNAGTNK LVRASGLSLA
GADAGNYVLE RTSLSGGIGT IDPATLTASL RGSVSKTYDG STAAILAAGN YALGGVVGAD
EVNLVWPANG VYDTKDAGTG KTVSVSGIAL AGSAAGNYVL AQTSLSAAIG RILPAPLTVT
AGNAAKTYDG RVYSGGNGVS YAGLVGGEDA SVLGGSLTYG GSAQGARNAG SYAITPAGLT
SGNYAITYAP GTLTVTKAPL TVTAGNAAKT YDGRAYSGGN GVSYAGLVGG EDASLLGGSL
TYGGSAQGAR NAGTYAITPA GLTSGNYAIS YADGTLTVTK AALLVTAGNA AKTYDGRAYS
GGNGVSYAGL IGGEDASVLG GSLTYGGSAQ GARNAGTYAI TPAGLTSGNY AITYAPGTLT
VTKAPLTVTA NGLSKTYDGQ AFSGGNGVSY AGLVGGEEAS VLGGSLTYGG SAQGARNAGS
YAITPAGLTS GNYAITYAPG TLTVTKTPLT VTANGLSKTY DGQAFSGGNG VSYAGLVGGE
DASVLGGSLT YGGSAQGARN AGSYAITPAG LTSGNYAITY APGTLTVTKA PLTVTAGNDA
KTYDGRAYSG GNGVSYAGLV GGEDASVLGG SLTYGGSAQG ARNAGSYAIT AAGLTSGNYG
ITFVEGILTV APRPLTVAAD AQSRASGQPN PVLTYAVSGL GLVGGDGLAG QLATPATPDS
APGSYPITQG TLAASPDYAL TFLNGTLTVT EAVAAAGSAP PVGSPVTAST VTQVLTLNQS
LTPYTPPVFQ GAGLTSSQGS PLSDPRFDTP VACLSQAACY ITPAAPQTGS PSAGR