Gene Mext_1662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1662 
Symbol 
ID5831750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1858023 
End bp1868786 
Gene Length10764 bp 
Protein Length3587 aa 
Translation table11 
GC content69% 
IMG OID641367461 
Productheme peroxidase 
Protein accessionYP_001639132 
Protein GI163851089 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2931] RTX toxins and related Ca2+-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCATGG CCGTTAAGCT CAATCTTCAA GATCTGACGT TCATCCTGAA GCAGATCAAG 
ATCGCCGAAG CGCATGCGAG CGGCATCAAG CTCACGGAAC TGCGCCTCGA TGCCGCCGGC
ACGCTTCTCA CGGATCGCGG GCTCTACGAC GCCACCGGCA ACTGGCTGGG CGATACCGCG
GCGCCGAAGG CGATTGCGGA TCCCCACGTG CCCTACGGGC TGCGCACGGT CGACGGCACC
TACAACAACC TCGTGCCCGG CCGCGAGACC TGGGGCTCGT CCGGCCAGCC GATGCCGCAA
CTCTTCGAGC CGACCTACCT CAACGACGCG GACGGGGACA CGATGGCGCT CGGACCGGGC
GCTCCGGTCA TCACGAACAC CAATTACGGG CTGCCCGGCT CGGTCGCCGA TGCCGACCCG
CGGATCATCT CGAACCTCGT GGTCGATGCC ACGCTCGACA ACCCGGCCGC CATCGCCGCG
GCGCTCCGGA TCGCCGGTTC GGAGAACGTC ATCGCCGATC AGCGCGCCAT CACGGCAGCG
CACGAGGCCC TGAAGGCCGC CCAGGCCGCG AGCCCGGCCG GCGATCACGC GGTCCTGCAA
TCGAACCTCG ATGCGCTGCT GGAGCAGACA GGCGTCACCG TCACCAACGG CTCCATCGAC
GTCCTCAACG TCTCGCCCGA CGAGGGGCTG TCGAAGCCGT TCAACGCCTG GATGACCTTC
TTCGGCCAGT TCTTCGACCA CGGCCTCGAT CTAATCTCCA AGGGCGGGAA CGGCACCGTC
TACGTCCCGC TCGCCGCCGA CGACCCGCTC GTGCTCGGAC AGGACGGCCT CGCCGGCACC
GCCGACGACC TCGCGCCGCA TCTGCGCTTC ATGACGCTGA CGCGCGCCGC ACAGGTCGAG
GGTTCGCAGC GCAATGTTAC GACACCCTTC GTCGACCAGA ACCAGACCTA CACCTCGAAC
GCCTCCCACC AAGTTTTCCT GCGCGAATAC GCACTCGTGG ACGGTCGGCC CGTCGCGACC
GGGCGCCTGC TCGGCGGTGC GGACGGAGGC CTCGCCACCT GGGCGGACGT CAAGTTCCAG
GCGCGGACGA TCCTCGGCAT CGAACTGACG GACGCCGACG TCTCGGCCGT CCCTCAATTG
CTGGTGGACG CCTACGGCGA GTTCGTCCGC AGTGTCAACG GCTTGCCGCA GGTGATGGTC
GGGGTGGGGC CTGGGGGGCA GGCCGTCTAT GCCAGCGGCA GCCTCGCCGA GCCGCTGAAG
CTCTCGGCGA TCCAGCTTCC CGTCGGCACC GTGCTGGTGG GTCCCAACGG CGCGCAGAAC
GTCATCGAGG CCGGCGAGAC GGTCGCAGCG GCCCGCACCT TCAACGCGTT CCTCGACGAC
ATCGCCCACA ACGCGGTGCC GGTCGCCGTG AACGGCGTGC TCCGGCCCGA TGCCGACGCG
CTCACGGGCA ATGCGGTTCA GATGAACCCG CAGACCGGCC GCAATCTCGA ATACGACAAC
GAACTTCTCG ACCGGCACTT CGTCACCGGC GACGGGCGCG GCAACGAGAA TATCGGCCTC
ACCGCGGTCC ACCACATCTT CCATTCCGAG CACAACCGGC AGATCGACGC CCATAAGCTT
ACGATCCTGC AGTCGGGCGA TCTCGCCTTT ATCAACGATT GGCTGGCAAC CGACATCGCG
GCGCTGCCGG GCAACTTCGC GCAGATGACG GCGCTGGGTC AGCTCGCCTA CGCCAATACG
CTGTCCTGGG ACGGTGAGCG GCTATTCCAG GCGGCGCGCT TCGCCACCGA GATGCAGTAC
CAGCACCTCG TCTTCGAGGA ATTCGCCCGC AAGATCCAAC CGCTGGTCGA TCCGTTCGTG
TTCAACCCGG TGACCGAGAT CGACCCGTCG ATCTTCGCCG AGTTCGCCAA TACGGTCTAC
CGCTTCGGCC ACTCGATGCT GACCGAGAAC ATGCCGCGGC TCGGGCCGGA CGGGCAGGCG
CTCGACGCCG ACCTCGGCCT GATCGACGCC TTCCTCAATC CGTTGGCCTT CGACAACGAC
GGCGGCCTGT CCCACGACGA GAGCGCGGCC GCGATCATGC GCGGCATGAC CATCGAACGC
GGCAGCGAAA TCGACGAGTT CGTCGTCGGT GCCCTGCGCA ACAATCTGCT CGGCCTGCCG
CTCGATCTCG CCGCCATCAA CATCGCCCGC GGCCGCGATA CGGGCACGCC GACGCTCAAC
GAGGCGCGGG CTCAGCTCTA CGCCGCGACG GGCTCGACCT TCCTCACCCC CTACACGAGC
TGGGTGGAGA TGGCGGCCAA CCTGAAGAAC CCGCTCTCGG TGGTGAACTT CATCGCCGCC
TACGGCACCC ACGGCACCGT TGTCGCCGCG ACGACGCTCG CGGCCAAGCG CGACGCGGCC
ATGGCCCTGG TCTTCGGCGG CGAGGGCGCG CCGACGGACC GCCTCGACTA CCTCAATTCG
CGCGGGAGCT GGGCCGGGCG GGAAACGGGC TTCGGTGCGG TCGATCTCTG GATCGGCGGC
CTGGCCGAGA AGCAGATGCC GTTCGGCGGC ATGCTGGGCT CGACCTTCAA CGCCATCTTC
GAAGCCCAGA TGGAGAACCT GCAGGATGCC GACAGGTTCT ACTATCTCAG CCGCGTCCAG
GGGCAGAACT TCCTCAACGA GTTGGAGCAG AACTCCTTCT CGAAAATCAT GCTCGCCAAT
TCGAGCCTGT CGCTGCCGGG GCCTGACGGC ATCCGCGGCA CGGCGGACGA CATCGTGCCC
CGCCATATCG GTGTTGATGC CTTCGCCGAT TACGACTTCG AGCTGGAGGT GAACGCCGCC
AACCAGCTCG ACCAGAACGG CGCCGCCCCC GGCCGCGACC CGACCGGAAA CGACCCCGTG
CTGGAGGCGA TGGGCCTCGG CAAGGTCGTG CGCGACGATC CCGGCACCGC GGCCGACGAG
GGCGCGAGCG GCTTCCACGC CTCGGTCAAC GCCCTGGTCC GACGCTACGG CGCGGACGGG
AGCCCGACGG GCGCGCTCGT GGACGGCAGC GAGGATGGGG GCGGCGACGC CGGCAGCCCC
GTCACCTGGG CCGATCTCAA GGCAAATGCG GCCAAGCTCG GCATCGCGCT GACCCAGGCG
GACATGCTGG ACGCGCCGGT GCTCCGGATC GGTGCCGATG GGCGCCTCGC CTTCGCTCCC
GGCTCGTCCG TCCCCGAGGC GGTGGCGGTC GCCAACGGCA GCTTCGAGGG CTTGGCGCTG
GTCGCCGGCC AGGAGGGCGT GATCCTCGAC GGCAACGGCA ACTACACCAC GACGAACCCG
GCCGGCTGGA CCATCGCCGG GGGCGTCGGC GGCCTCTTCG CGCCGGCCGA CGCGGTCGTC
GACCCCGCCG GCCGCGACGG CGCCAACGTC GTGTGGCTGC GCGGCGGCGC CACGCTCTCG
CAGGAGGGCG GGACGACCCT CCAGGCCGGC GTCGGCTACA CCTACAGCTT CAAGGTCGGG
GACCGCACCG ACTTCACCTG GCCGGGCGCC GAGGCGCGCC TCGTGGCGGT CGGCGGGGCG
ACCCCCGTGA CCCTCGGGAC GCTGACCCTG ACGGAGCCGG CCGACGGGCA GTGGGGCACC
TTCACCCTCG CAACGGGTGT TGTCCCGTCG GCGCTTGCGG GCCTTCACCT TCGCCTGGAG
ATCCGCAACA CCGGCAGCGG CGATGCGCAG ATCCTCGTGG ACGACATCGA ACTCGTGCGG
ACGGCGCCGG CCTACCGGTC CGATCTGACC CCGGCGCAGA CGCCCGGCTA CGACCCGGCC
GCCGACCCGT TCCTGCGCGA TGGCGCCGGC AACGTCCTGC GCACCGGGCA GTCCATCGCC
AGCCCGGCCG CCGACCTCGA CGCAACCGTC GTCGATCCGG CCGCGCTCAA TCAGCCGTTC
GCCACCGGGC ACTACCTCCG CTTTACCGGC GGCGAGCACG TGCTGGTGGG CGGCACCGAC
GGCAACGACA CCATCATCAC CGATTTCGGC GATGACGGAA TCTGGGGGGA TGCAGGCGAC
GACCGGATCG AGGCGGGCGC AGGCGTCGAT CTCGTGAACG GCGGGGCGGG CGACGATATC
ATCACCGATT CCGGCGATAC CGGCGACTTC CTCAAGGGCG AGGACGGCAA CGACGTCATC
GCCAACTCGA ACGGCATCGA CATCCTGATG GGCGGGCGCG GCAAGGACGC GATCTTCGTC
GGCGTCGATG CGACCGAAGT CTTCGCCGGC GAGGGCGACG ACTTCGTGAT CGGTGGCGAC
GACGCCGACC TTCTGATGGG CAACGAGGGC GACGACTGGA TGGAGGGTGG CGGCGGCTTC
GACACCACCG CCGGCGACAA CTCGGAACTG TTCTTCAACT CGGCCATCAA GGGCCACGAC
GTCATGTTTG CCGGGGGCGA CGAGCATGAT TTCGACGGCG AGTCCGGCGA CGACATCATG
GTCCAGGGCG AGAGCGTGAT GCGCAACGAG GGCATGTTCG GCTTCGATTG GGCGATCTAC
AAGGGCAACC AGATCGCGGC CAATGCCGAC ATGCGCATCC CGATCTTCAC CACCGAAGAA
GCCGACATCC TACGCAACCG CTTCGACAAG ACGGAGGGGC TCTCCGGCTG GCGGCTGAAC
GACACGCTCA TCGGCGACGA CCGGACCGCG GCCGCCAACG CGGATGCCGA GGCGCCTGCC
GGCGCGCCGA TTACGGCGGC AAACGAGGGC GTGTTCTTCA ACGACGGGCT CGATGCGGCC
GGCATCGCCC GGATCGCGGG GCTCGACCAG ATCGTGTCGC TTGCTTCCGG ACAGCAGTTC
TTCGAGGCCG GCAACATCCT GCTGGGCGGC GCGGGGAGCG ATACGCTCCA GGGCAACGGC
GGCAACGACA TCCTCGACGG CGACCGCTGG CTGAACGTGC GCATCAGCAT CCGCAATCCT
GCGGATGCCG GCCAGGAGAT GGCGACGGCC GACAGCCTGA AGCACGTCTT CGACGACAGC
GCCGCCAATC AGGCGCGCGG CTGGGCCGGA AAGTCGCTCT TCGAGCTGAT GATCGACCGG
GTGATCAGCC CGACCCAGCT CGCAATCGTC CGCGAGGTGA TCACCACCGG AGCGACGGCG
GCGGACGTGG ACACCGCCGT GTTCAACGAC ATCCGGGCGA ACTACACGAT CACCCGCGCG
GCGAACGGCA CGCTCACCGT CACCCACACC ACGCTGACCA ATCCCACCGT CGATGACGGC
ACGGATACGC TGCGCAACAT CGAGAAGCTG CGTTTCGCGG ACGGCACGGC GGACGTCGCG
CTGGTTCTCA ACCAGCCCTT CGACAGCCTC ACCATCAGAC CGTTCGACGC GGACGGCGAC
GATTCCTCGA CACTCGTCGC GACGCTGGTG AACCGTGTGA ACGCGACCAC CCGGCCCGTG
ACCCTGCAAT GGCAGGTACT GGCCGACAAC GGGCAGTGGC GCAACGTGAC GGGCGCGGAC
GGCCAGGTCA CGAACGGGGG CACCTTCTTC ACGCCGACCG GAGCGACGGG CGTCGAGGTC
CGGGTGGTCG CGAACTGGAC CAGCACGGTC GCCGGCAATA CCGGCCTGCA GCAGACGGCC
TCCATCCAGT CCGCCTTCGT CGGAACTGCC GCCGCCGAGG ACATCACGGG TTCCGCGACC
CCGAACGTCA TCCTGGGGCG CGACGGCGAC GACGACATCG AGGGCGATGT CGGCAACGAT
GCCATCTACG CCGGCAGCGG CGACGACCGG GTCGATGGCG GCGAGGGCGA CGACACCCTC
CTCGGCAACG ACGGCGCCGA CACCCTGATC GGCCGGACCG GCAACAACAC GCTCGACGGC
GGCGACGACG ACGACCAACT CTCCGGCGGG CACGGCAACG ACCGTCTGAT CGGCGGCGCC
GGCACCGACA CCGCCATCTA CTCGGGACCG ATTGCGGCCT ACAGCTTCGA GCGGAACGCG
CAGGGCGAGG TCGTCGTCAG CGACAACCTC GGAGCGGAGG GCGACGGCGT CGATACGCTC
ACCACGATCG AGCAGATCCA GATGGGCAAC GACCTCACGC CCTACGCGCT GGTCGCCAAC
GGCACGGCGG CTATCGACAT CGTGGTCGGG ACGGCCGGCA ACAACACGCT CAGCGGCGGC
GCGGGCAACG ACCTCGTCTT CGCGGGAGCC GGCAACGACA ACGTCCTGTG GCGGACCGGC
GACGGCCGCG ACTTCGTCGA TGGCGGCGCC GGCACGGACA GCTTCCGGAT CATGAACGGG
ACCGGTCCCG TCCAGCAGCT CACGCTGGCC CAGGCCCGGG CGCAGTTCGC GAACCTGTCG
TTCCGTGACG ACACGCAGAC CGTGGTCGTG CGCAACGGGA TCGTCATCGC CGAGCTCAAG
AACGTCGAGC AGGTCGCCGT GAACGCCGTC GCGACCGGGG CGCCCGTCGT CACGGACCCG
ACGCCGACAA ACGGGCTCGT CTCGCCGACG GAGGGCCAGC CGCTCGGCGC GCTGGTCGCC
GCGATCCAGG ACGCCGACGG TCTTGGGACC TTCTCGCTGC GCTGGCAGCA ATCGGGCGAT
AACGGCCAGA CCTGGACGGA TATCGCCGGC AACGCCGCCG GCACGCTCAA CTACACGCCC
GGCCAGGCCC AGGTCGGGGA CGTGTTGAGG CTGCGGGTGA GCTTCACCGA CGGCGCGGGA
AATCCGGAAG AGCTGTTCTC GGCGCCGACC GGCGTCGTCG GCGACACCTT CACGGGCACG
GCGCTCAACA GGACCTTCAA CGGGACGGCC GGTGACGACA TCGCCAACGG CGCCGACACC
GCCCTCTTCG GGATCCAGCC GAACGACACG ATGAACGGTG GCGCCGGCAA CGACATCCTG
AACGGCCGCG GCGGCAATGA CACGTTCATT CAAGTCAGCA CCGACGGGCG CGACCGCGTC
GATGGCGGGG CTGGCACCGA TACCTACCAG TTGAACGGCG CCGCGGGGCC TGAGACCTTC
CGCATCTACA GCCGGTCGGC TTGGCTTCAG GTCGCCGGCA ATACGGAGGC GCAGCTCGCC
GCTTCGACCG AGATCGTGAT CACCCGCAAC GGCACGGGTG CCGCCGCGAT CGTCGCCGAA
CTCGACAACG TCGAGGAGAT CCGCGTCAAC ACGCTCCAAG TCACGTCACC CGGCGGCCAG
AACGGCGGTG CCAATGGCGG CGACACGATC CAGGTGATCG GCAGCTTCAC CGGCACCAGC
CTCAACTTCA ACACCATCAC CATCGACGGC TCATCGGGCG ACGACACCGT GGACATGGCC
GCCCTCGCCT CGGCCCACCG CATCGTCTTC CGCTCGAACG GGGGGCACGA CACCATCGTC
GGCACCCTGC GGCCGCAGGA CGTGATCGAA CTGCCGGCCG GCTCGAACCG GGCGGACTAC
GTCGCGGCCG CGGGCGCCAA CGGCATAACG ACCCTCTCCA ACGGGTCGCA TACCATCACC
TTCTCGGCCG CGGGCGGCAT GCCACGGATC ACGGTCGATG CAGGCGCCGA CGGAGGGGAC
GGCGAGGGCG TCACCGGCGC CTTCGCCTAC ACGGCAGCCG ACATCGATGG GCTGGAGGCG
CTGGTGCGCG GCCAGCGCCC TGACAATGCG GGCGACGACG ACGTGCCGAC GGGCTACCGC
GAGCTCAGCG GGCACGGCAA CAACCTCGAC CACCCGACCT GGGGCAGCGC CGACCAAGCC
TTCATCCGCC TGACCCAGGC CCGCTACGGC GAGGCCGACG CCAACGGCAA CCGGGCCATC
AACCCGATCT TCGACGGCCT CGACGCGCGC ACCATCAGCA ACATCCTCGG CACGCAGGAA
GCCGGCCTGC CCAAGGCCGG AAACGACGCC AACATCTTCT TCATGGCGAT GGGCCAGTAC
ATCGACCACG GGCTCGACTT CCTGCCCAAG GGCGGCAACG GCTCGATCGT GATCGGTGCG
CCCGGCGGCG GGGCACCGGG CTCCAACAAC CCGGCCGACC TCACCCGCGG CACCGTCATG
GCCGTCGATG CGAACGGCGT GCCGCAGCAC AAGAATCAGA CCTCCCCCTA CATCGACCAG
AACCAAGCCT ACGGCTCCAA CGCGCTGGTC GGCCAGTTCC TGAGAGAGAG CGACGGGGCG
CAGGGCGTCG GCATGCGGCT CCTGGCGGGG GCACCGGACC CGTCCAACCC CGCCTTCAAC
CTGCTGCCGA CGCTGCGCGA GTTGGTCAAC CACCACTGGC AGGCCGACAC CATCTTCGCC
GGACCGGACG GGCCGATCAG CTTCCGGACC TATTACACGA ACTTCGCCCT GTCCGAGGGG
GTGACCGGCA CCCTGTTCAA CACCGAGACC GGCGCCTTCG ACCCGCAGGT GCTCACGAAG
CTCGTCGGAA ACTTCATGGG CTCGGGCCAT CCACTGCTGC TCGACACCAA CCCCTTCATC
AGCGTCCTCG ACCACTTCGT GGCCGGCGAC GGGCGGGCCA ACGAGAACTT CGCCCTCACC
TCGATCCACA CGGTCTGGGC GCGCAACCAC AACTACCACG TCGAGAAGCT GCTGGAATCC
GGCTTCGAGG GCACGCCCGA GCAGGTGTTC CAGGCCGCCA AGATGGTCAA CGAGGCTGAG
TATCAGCGTG TCGTCTTCGA CGAGTACCTG GAGACGCTGA TCGGCGGTCT GCGCTCGGAC
GGCACGCACG GCTTCGAGGC CTATGATCCG AGCGTGGATG TCGCGATCAG CCACGAGTTC
GCGGCGGCGG TGTTCCGGTT CGGCCACTCT CTGATCGGGC AGACGCTGAA TGTGAAGGGC
GCCGACGGCG AGACCGTTCC GGTCAGCCTG TTCGACGCCT TCCTCAACCC GAGCAACGAT
CCCTCGGTCT TCACCGCGCC GCTGCCCCCC GGCTACGTGC CGCAGCCCGG CTACGCCCAG
TACGGCGTCG GCGGCATCAT CGGCGGCACC ATCGAGCAGG CGGCCGAGGA CGTCGACTTC
AACATCGTCG ATGCGGTTCG CAACGACCTC GTGCGCATTC GGGCCGACCT GTTCGCCTTC
AACGTGGCCC GCGGCTGGGA CGTGGGCCTC GGCACCCTCA ACCAGGTCCG GGCCGATCTG
GCGGCCTCCA CCAATCCCTA TATCCGGGAC GCGGTGGGCT TCGCCGGCGG CGACCTCTCG
CCCTACGCCT CCTGGGAGGA CTTCCAGGCG CGCAACAGCC TCAGCGACGC GGTGATCGCG
CAGTTCCGGC AAGCCTACCC GGACCTCGTC CTCGCCGCCG CGGACATCGC GGCCTTCCAG
GCGATCAACG GCGACATCGC CATCGCCATG CAGGCCGACG GGACCGGTGT CGTGAAGGGG
ATCGACCGGC TCGATCTTTG GGTCGGCGGG CTTGCCGAGA AGCACATCAA CAACGGCGTC
GTCGGCCAGA CCTTCTGGGT CGTGCTGCAC GAGCAGTTCG ACCGACTGCA GGACGGCGAC
CGCTTCTACT ACCTCGAGCG CTTCGACAAC TTCGACTTCT ACGAGAACGT CATCGACGGC
CAGGGCTTCT CGGACATCGT CGCCCGGAAC ACCGGCCTGA CGGTCCTGCC GGAACACATC
TTCGAGCTGT CCGACGAGGA CGGGCCGGGG ACGGAGCCCG GCGATGACGA TGACGACGGC
GACACCGATC CCGTGGGTGG CGATCCCGAC GAGGACGAGG ACGAGGACGG CCCGACCGAT
CCGGTCGGCG GTGGCGGCGA TGCCGGTGGG GACGAGGACG ACGGCGTGAC CGACCCTGTC
GGCGGCGGGG ACGATCCCGG TGACGACGAG GACGACGGGC AAGGAGACGG AGACGGGACG
ACCGACCCGG TCGGCGGGGG TGACGGCGAC GAGGACGGGC CCGGGACCGG TCCCGGCACC
AATCCGCCCG TCAACCACGT GCCCGGCGTG ATCGCCGGCG GCGCGAACGG AGACGTCCTC
AACGGCACGG CGGGGGCGGA TACCATCCTC GGCCTGGATG GCGACGACAA CATCCTCGCC
GGCGGCGGTG CCGACGTGGT CCGGGCCGGT GCGGGCAACG ACTTCGTCGA TGCTGGCGAG
GGCCGGGACG TGGTGTTCGC CGGGGACGGC GACGACGATG TCCTCAGCGG CGGCGGCGCC
GACATGGTGT ACGGCGACGG CGGCAACGAC CGCATCCTCG CCGGAGCCGG CAACGACCTC
GTGACCGCCG GCGCCGGCCG GGACACGGTC ATCGGCGGCG AGGGCGACGA CCTGTTCGTG
GCCGAGACCG GTGACGGCGA CGACACCTAC TGGGGCGACG AGATGGGGGG CGGGCTGGGG
AGCGACACGC TCGACATGTC GGCCATCACC GCCAACATCG CGGTCAACCT CGGGACGGGT
CTGGCTGGGC GCGGCAGTGC GACCAGCACC CAGTCGGGTC GCGACGTGCT CTGGGGCGTC
GAGAACGTCG TCACGGGGTC CGGCAACGAC GACATCACGG CGAGCGACGC CGCGAACGTG
ATGGATGGCG GCGAGGGCAG CGACACCTAC CGCTTCGGCT CGGCCGCCGC GGCGAACGGC
GACACCATCG AGGGCTTCCG GCCGGGCGAC AAGATCGACC TCAGCGCGAT CGACGCCGAT
GCCGGGCAGG CCGGGAACCA AGCCTTCACG CTCGCGACCG GCGCGGCCTT CACGGGGGTC
GGACAGCTCC TGGTGACGCA GGAGACGCGC GACGACGGAG ACTACGTCGT CGTCCAGGGG
AACACGGCCG GCGACGCCTC CCCGGAATTC AAGCTCGCCA TCAAGGGCAA TACGGCGCCC
ACGGCCGCCG ACTTCACGCT CTGA
 
Protein sequence
MAMAVKLNLQ DLTFILKQIK IAEAHASGIK LTELRLDAAG TLLTDRGLYD ATGNWLGDTA 
APKAIADPHV PYGLRTVDGT YNNLVPGRET WGSSGQPMPQ LFEPTYLNDA DGDTMALGPG
APVITNTNYG LPGSVADADP RIISNLVVDA TLDNPAAIAA ALRIAGSENV IADQRAITAA
HEALKAAQAA SPAGDHAVLQ SNLDALLEQT GVTVTNGSID VLNVSPDEGL SKPFNAWMTF
FGQFFDHGLD LISKGGNGTV YVPLAADDPL VLGQDGLAGT ADDLAPHLRF MTLTRAAQVE
GSQRNVTTPF VDQNQTYTSN ASHQVFLREY ALVDGRPVAT GRLLGGADGG LATWADVKFQ
ARTILGIELT DADVSAVPQL LVDAYGEFVR SVNGLPQVMV GVGPGGQAVY ASGSLAEPLK
LSAIQLPVGT VLVGPNGAQN VIEAGETVAA ARTFNAFLDD IAHNAVPVAV NGVLRPDADA
LTGNAVQMNP QTGRNLEYDN ELLDRHFVTG DGRGNENIGL TAVHHIFHSE HNRQIDAHKL
TILQSGDLAF INDWLATDIA ALPGNFAQMT ALGQLAYANT LSWDGERLFQ AARFATEMQY
QHLVFEEFAR KIQPLVDPFV FNPVTEIDPS IFAEFANTVY RFGHSMLTEN MPRLGPDGQA
LDADLGLIDA FLNPLAFDND GGLSHDESAA AIMRGMTIER GSEIDEFVVG ALRNNLLGLP
LDLAAINIAR GRDTGTPTLN EARAQLYAAT GSTFLTPYTS WVEMAANLKN PLSVVNFIAA
YGTHGTVVAA TTLAAKRDAA MALVFGGEGA PTDRLDYLNS RGSWAGRETG FGAVDLWIGG
LAEKQMPFGG MLGSTFNAIF EAQMENLQDA DRFYYLSRVQ GQNFLNELEQ NSFSKIMLAN
SSLSLPGPDG IRGTADDIVP RHIGVDAFAD YDFELEVNAA NQLDQNGAAP GRDPTGNDPV
LEAMGLGKVV RDDPGTAADE GASGFHASVN ALVRRYGADG SPTGALVDGS EDGGGDAGSP
VTWADLKANA AKLGIALTQA DMLDAPVLRI GADGRLAFAP GSSVPEAVAV ANGSFEGLAL
VAGQEGVILD GNGNYTTTNP AGWTIAGGVG GLFAPADAVV DPAGRDGANV VWLRGGATLS
QEGGTTLQAG VGYTYSFKVG DRTDFTWPGA EARLVAVGGA TPVTLGTLTL TEPADGQWGT
FTLATGVVPS ALAGLHLRLE IRNTGSGDAQ ILVDDIELVR TAPAYRSDLT PAQTPGYDPA
ADPFLRDGAG NVLRTGQSIA SPAADLDATV VDPAALNQPF ATGHYLRFTG GEHVLVGGTD
GNDTIITDFG DDGIWGDAGD DRIEAGAGVD LVNGGAGDDI ITDSGDTGDF LKGEDGNDVI
ANSNGIDILM GGRGKDAIFV GVDATEVFAG EGDDFVIGGD DADLLMGNEG DDWMEGGGGF
DTTAGDNSEL FFNSAIKGHD VMFAGGDEHD FDGESGDDIM VQGESVMRNE GMFGFDWAIY
KGNQIAANAD MRIPIFTTEE ADILRNRFDK TEGLSGWRLN DTLIGDDRTA AANADAEAPA
GAPITAANEG VFFNDGLDAA GIARIAGLDQ IVSLASGQQF FEAGNILLGG AGSDTLQGNG
GNDILDGDRW LNVRISIRNP ADAGQEMATA DSLKHVFDDS AANQARGWAG KSLFELMIDR
VISPTQLAIV REVITTGATA ADVDTAVFND IRANYTITRA ANGTLTVTHT TLTNPTVDDG
TDTLRNIEKL RFADGTADVA LVLNQPFDSL TIRPFDADGD DSSTLVATLV NRVNATTRPV
TLQWQVLADN GQWRNVTGAD GQVTNGGTFF TPTGATGVEV RVVANWTSTV AGNTGLQQTA
SIQSAFVGTA AAEDITGSAT PNVILGRDGD DDIEGDVGND AIYAGSGDDR VDGGEGDDTL
LGNDGADTLI GRTGNNTLDG GDDDDQLSGG HGNDRLIGGA GTDTAIYSGP IAAYSFERNA
QGEVVVSDNL GAEGDGVDTL TTIEQIQMGN DLTPYALVAN GTAAIDIVVG TAGNNTLSGG
AGNDLVFAGA GNDNVLWRTG DGRDFVDGGA GTDSFRIMNG TGPVQQLTLA QARAQFANLS
FRDDTQTVVV RNGIVIAELK NVEQVAVNAV ATGAPVVTDP TPTNGLVSPT EGQPLGALVA
AIQDADGLGT FSLRWQQSGD NGQTWTDIAG NAAGTLNYTP GQAQVGDVLR LRVSFTDGAG
NPEELFSAPT GVVGDTFTGT ALNRTFNGTA GDDIANGADT ALFGIQPNDT MNGGAGNDIL
NGRGGNDTFI QVSTDGRDRV DGGAGTDTYQ LNGAAGPETF RIYSRSAWLQ VAGNTEAQLA
ASTEIVITRN GTGAAAIVAE LDNVEEIRVN TLQVTSPGGQ NGGANGGDTI QVIGSFTGTS
LNFNTITIDG SSGDDTVDMA ALASAHRIVF RSNGGHDTIV GTLRPQDVIE LPAGSNRADY
VAAAGANGIT TLSNGSHTIT FSAAGGMPRI TVDAGADGGD GEGVTGAFAY TAADIDGLEA
LVRGQRPDNA GDDDVPTGYR ELSGHGNNLD HPTWGSADQA FIRLTQARYG EADANGNRAI
NPIFDGLDAR TISNILGTQE AGLPKAGNDA NIFFMAMGQY IDHGLDFLPK GGNGSIVIGA
PGGGAPGSNN PADLTRGTVM AVDANGVPQH KNQTSPYIDQ NQAYGSNALV GQFLRESDGA
QGVGMRLLAG APDPSNPAFN LLPTLRELVN HHWQADTIFA GPDGPISFRT YYTNFALSEG
VTGTLFNTET GAFDPQVLTK LVGNFMGSGH PLLLDTNPFI SVLDHFVAGD GRANENFALT
SIHTVWARNH NYHVEKLLES GFEGTPEQVF QAAKMVNEAE YQRVVFDEYL ETLIGGLRSD
GTHGFEAYDP SVDVAISHEF AAAVFRFGHS LIGQTLNVKG ADGETVPVSL FDAFLNPSND
PSVFTAPLPP GYVPQPGYAQ YGVGGIIGGT IEQAAEDVDF NIVDAVRNDL VRIRADLFAF
NVARGWDVGL GTLNQVRADL AASTNPYIRD AVGFAGGDLS PYASWEDFQA RNSLSDAVIA
QFRQAYPDLV LAAADIAAFQ AINGDIAIAM QADGTGVVKG IDRLDLWVGG LAEKHINNGV
VGQTFWVVLH EQFDRLQDGD RFYYLERFDN FDFYENVIDG QGFSDIVARN TGLTVLPEHI
FELSDEDGPG TEPGDDDDDG DTDPVGGDPD EDEDEDGPTD PVGGGGDAGG DEDDGVTDPV
GGGDDPGDDE DDGQGDGDGT TDPVGGGDGD EDGPGTGPGT NPPVNHVPGV IAGGANGDVL
NGTAGADTIL GLDGDDNILA GGGADVVRAG AGNDFVDAGE GRDVVFAGDG DDDVLSGGGA
DMVYGDGGND RILAGAGNDL VTAGAGRDTV IGGEGDDLFV AETGDGDDTY WGDEMGGGLG
SDTLDMSAIT ANIAVNLGTG LAGRGSATST QSGRDVLWGV ENVVTGSGND DITASDAANV
MDGGEGSDTY RFGSAAAANG DTIEGFRPGD KIDLSAIDAD AGQAGNQAFT LATGAAFTGV
GQLLVTQETR DDGDYVVVQG NTAGDASPEF KLAIKGNTAP TAADFTL