Gene Mchl_1979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_1979 
Symbol 
ID7118679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp2051560 
End bp2062323 
Gene Length10764 bp 
Protein Length3587 aa 
Translation table11 
GC content69% 
IMG OID643524732 
ProductAnimal heme peroxidase 
Protein accessionYP_002420757 
Protein GI218529941 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTTA AGCTCAATCT TCAAGATCTG ACCTTCATCC TGAAGCAGAT CAAGATCGCC 
GAAGCGCATG CGAGCGGCAT CAAGCTCACG GAACTGCGCC TCGATGCCGC CGGCACGCTT
CTCACGGATC GGGGTCTCTA CGACGCCACC GGCAACTGGC TCGGCGATGC CGCGGCGCCG
AAGGCGATTG CGGATCCCCA CGTGCCCTAC GGGCTGCGCA CGGTCGACGG CACCTACAAC
AACCTCGTGC CCGGCCGCGA GACCTGGGGC TCGTCCGGCC AGCCAATGCC GCAACTCTTC
GAGCCGACCT ACCTCAACGA CGCGGACGGG GACACGATGG CGCTCGGACC GGGCGCTCCG
GTCATCACGA ACACCAATTA CGGGCTGCCC GGTTCGGTCG CCGATGCCGA CCCGCGGATC
ATCTCGAACC TCGTGGTCGA TGCCACGCTC GACAACCCGG CCGCCATCGC CGCGGCGCTC
AGGATCGCCG GTTCGGAGAA CGTCATCGCC GATCAGCGCG CCATCACGGC AGCGCACGAG
GCCCTGAAGG CCGCTCAGGC CGCGAACCCG GCCGGCGATC ACGCGGTCCT GCAATCGAAC
CTCGATGCGC TGCTGGAGCA GACCGGCGTC ACCGTCACCA ACGGCTCCAT CGACGTCCTC
AACGTCTCGC CCGACGAGGG GCTGTCGAAG CCGTTCAACG CCTGGATGAC CTTCTTCGGC
CAGTTCTTCG ACCACGGCCT CGATCTGATC TCCAAGGGCG GGAACGGCAC CGTCTACGTC
CCGCTCGCCG CCGACGACCC GCTCGTGCTC GGACAGGACG GCCTCGCTGG CACTGCCGAC
GACCTCGCGC CGCATCTGCG CTTCATGACG TTGACGCGCG CCGCGCAGGT CGAGGGTTCG
CAGCGCAATG TCACGACACC CTTCGTCGAC CAGAACCAGA CCTACACCTC GAACGCCTCC
CACCAAGTGT TCCTGCGCGA ATACGCACTC GTGGACGGTC GGCCCGTCGC GACCGGGCGC
CTGCTCGGCG GCGCGGACGG AGGCCTCGCC ACCTGGGCGG ACGTCAAGTT CCAGGCGCGG
ACGATCCTCG GCATCGAACT GACGGACGCC GACGTCTCGG CCGTCCCGCA ATTGCTGGTG
GACGCCTACG GCGAGTTCGT CCGCAGTGCC AACGGCTTGC CGCAGGTGAT GGTCGGGGTG
GGGCCTGGGG GGCAGGCCGT CTACGCCAGC GGCAGCCTCG CCGAGCCGCT GAAGCTCTCG
GCGATCCAGC TTCCCGTCGG CACCGTGCTG GTGGGTCCCA ACGGCGCGCA GAACGTCATC
GAGGCCGGTG AGACGGTCGC AGCGGCCCGC ACCTTCAACG CGTTCCTCGA CGACATCGCC
CACAACGCGG TGCCGGTCGC CGTGAACGGC GTGCTCCGGC CCGATGCCGA CGCGCTCACG
GGCAATGCGG TTCAGATGAA CCCGCAGACC GGCCGCAATC TCGAATACGA CAACGAGCTT
CTCGACCGGC ACTTCGTCAC CGGCGACGGG CGCGGCAACG AGAATATCGG CCTCACCGCG
GTCCACCACA TCTTCCATTC CGAGCACAAC CGGCAGATCG ACGCCCATAA GCTCACGATC
CTGCAGTCGG GCGATCTCGC CTTCATCAAC GATTGGCTGG CGACCGACAT CGCAGCGCTG
CCGGGCAACT TCGCGCAGAT GACGCCGCTG GGTCAGCTCG CCTACGCCAA TACGCTGTCC
TGGGACGGTG AGCGGCTGTT CCAGGCGGCG CGCTTCGCCA CCGAGATGCA GTACCAGCAC
CTCGTCTTCG AGGAATTCGC CCGCAAGATC CAGCCGCTGG TCGATCCGTT CGTGTTCAAC
CCGGTGACCG AGATCGACCC GTCGATCTTC GCCGAGTTCG CCAATACGGT CTACCGCTTC
GGTCACTCGA TGCTGACCGA GAACATGCCG CGGCTCGGGC CGGACGGGCA GGCGCTCGAC
GCCGACCTCG GCCTGATCGA CGCCTTCCTC AATCCGTTGG CGTTCGACAA TGACGGCGGC
CTGTCCCACG ACGAGAGCGC GGCCGCGATC ATGCGCGGCA TGACCATCGA ACGCGGCAGC
GAAATCGACG AGTTCGTCGT CGGCGCCCTG CGCAACAACC TGCTCGGCCT GCCGCTCGAT
CTCGCCGCCA TCAACATCGC CCGCGGCCGC GATACGGGCA CGCCGACGCT CAACGAGGCG
CGGGCCCAAC TCTACGCCGC GACGGGCTCG ACCTTCCTCA CCCCCTACAC GAGCTGGGTG
GAGATGGCGG CCAACCTGAA GAACCCGCTC TCGGTGGTGA ACTTCATCGC CGCCTACGGG
ACCCACGGCA CCGTTGTCGC CGCGACGACG CTCGCGGCCA AGCGCGACGC GGCCATGGCC
CTGGTCTTCG GCGGCGACGG CGCGCCGACG GACCGCCTCG ACTACCTCAA TTCGCGCGGG
AGCTGGGCCG GACGGGAAAC GGGCTTCGGT GCGGTCGATC TCTGGATCGG CGGCTTGGCC
GAGAAGCAAA TGCCGTTCGG CGGCATGCTG GGCTCGACCT TCAACGCCAT CTTCGAAGCC
CAGATGGAGA ACCTGCAGGA TGCCGACAGG TTCTACTATC TCAGCCGCGT CCAGGGGCAG
AACTTCCTCA ACGAGTTGGA GCAGAACTCC TTCTCGAAAA TCATGCTCGC CAATTCGAGC
CTGTCGCTGC CGGGGCCTGA CGGCATCCGC GGCACGGCGG ACGATATCGT CCCCCGCCAT
ATCGGCGTCG ATGCCTTCGC CGATTACGAC TTCGAGCTGG AGGTGAACGC CGCCAACCAG
CTCGACCAGA ACGGCGCCGC CCCCGGCCGC GACCCGACCG GAAACGACCC CGTGCTGGAG
GCAATGGGCC TCGGCAAGGT CGTGCGCGAC GATCCCGGCA CCGCGGCCGA CGAGGGCGCG
AGCGGCTTCC ACGCCTCGGT CAACGCCCTG GTCCGGCGCT TCGGCGCGGA TGGGAGCCCC
ACGGGCGCGC TCGTGGACGG CAGCGAGGAT GGGGTCGGCG GCGCCGGCAG CCCCGTCACC
TGGGCCGATC TCAAAGCGAA TGCGGCCAAG CTCGGCATCG CGCTGACCCA GGCGGACATG
CTGGACGCGC CGGTGCTCCG GATCGGTGCC GATGGGCGCC TCGCCTTCGC TCCCAGCTCG
TCCGTCCCCG AGGCGGTGGC GGTCGCCAAC GGCAGCTTCG AGGGCTTGGC GCTGGTCGCC
GGCCAGGAGG GCGTGATCCT CGACGGCAAC GGCAACTACA CCACGACGAG CCCGGCCGGC
TGGACCATCG CCGGGGGCGT CGGCGGCCTC TTCGCGCCGG CCGACGCGGT CGTCGACCCC
GCCGGCCGCG ACGGCGCCAA CGTCGTGTGG CTGCGCGGCG GCGCCACGCT CTCGCAGGAG
GACGGGACGA CCCTCCAGGC CGGCGTCGGC TACACCTACA GCTTCAAGGT CGGGGACCGC
ACCGATTTCA CCTGGCCGGG CGCCGAGGCG CGCCTCGTGG CGGTCGGCGG GGCGAACCCG
GTGACCCTCG GAACGCTGAC CCTGACGGAG CCGGCCGACG GGCAGTGGGG CACCTTCACC
CTCGCAACGG GTGTTGTCCC GTCGGCGCTT GTGGGCCTTC AGCTTCGCCT GGAGATCCGC
AACACCGGCA GCGGCGATGC GCAGATCCTC GTGGACGACA TCGAACTCGT GCGGACGGCG
CCGGCCTACC GGTCCGATCT GACCCCGGCG CAGACGCCTG GCTACGACCC GGCCGCCGAC
CCGTTCCTGC GCGATGGCGC CGGCAACGTC CTGCGCACCG GGCAGTCCAT CGCCAGCCCG
GCCGCCGACC TCGACGCAAC CGTCGTCGAT CCGGCCGCGC TCAATCTGCC GTTCGCCACC
GGGCACTACC TCCGCTTCAC CGGCGGCGAG CACGTGCTGG TGGGCGGCAC CGACGGCAAC
GACACCATCA TCACCGATTT CGGCGATGAC GGAATCTGGG GGGATGCGGG CGACGACCGG
ATCGAGGCGG GCGCAGGCGT CGATCTCGTG AACGGCGGAG CGGGTGACGA CATCATCACC
GATTCCGGCG ATACCGGCGA CTTCCTCAAG GGCGAGGACG GCAACGACGT CATCGCCAAC
TCGAACGGCA TCGACATCCT GATGGGCGGG CGCGGCAAGG ACGCGATCTT CGTCGGCGTC
GATGCGACCG AAGTCTTTGC CGGCGAGGGC GACGACTTCG TGATCGGTGG CGACGACGCC
GACCTTCTGA TGGGCAACGA GGGCGACGAC TGGATGGAGG GTGGCGGCGG CTTCGACACC
ACCGCCGGCG ACAACTCGGA ACTGTTCTTC AACTCGGCCA TCAAGGGCCA CGACGTCATG
TTTGCCGGGG GCGACGAGCA TGATTTCGAC GGCGAGTCCG GCGACGACAT CATGGTCCAG
GGCGAGAGCG TGATGCGCAA CGAGGGCATG TTCGGCTTCG ATTGGGCGAT CTACAAGGGC
AACCAGATCG CGGCCAATGC CGACATGCGC ATCCCGATCT TCACCACCGA AGAAGCCGAC
ATCCTACGCA ACCGCTTCGA CAAGACGGAG GGGCTCTCCG GCTGGCGGCT GAACGACACG
CTCATCGGCG ACGACCGGAC CGCGGCCGCC AACGCGGATG CCGAGGCGCC TGCCGGCGCG
CCGATCGCGG CGGCAAACGA GGGCGTGTTC TTCAACGACG GGCTCGACGC GGCCGGCATC
GCCCGGATCG CGGGGCTCGA CCAGATCGTG TCGCTTGCTT CCGGACAGCA GTTCTTCGAG
GCCGGCAACA TCCTGCTGGG CGGCGCGGGC AGCGATACGC TCCAGGGCAA CGGCGGCAAC
GACATCCTCG ACGGCGACCG CTGGCTGAAC GTGCGCATCA GCATCCGCAA TCCCGCGGAT
GCCGGCCAGG AGATGGCGAC GGCCGACAGC CTGAAGCACG TCTTCGACGA CAGCGCCGCC
AATCAGGCGC GTGGCTGGGC CGGAAAGTCG CTCTTCGAGC TGATGATCGA CCGGGTGATC
AGCCCGACCC AGCTCGCAAT CGTCCGCGAG GTGATCACCA CCGGAGCGAC GGCGGCGGAC
GTGGACACCG CCGTGTTCAA CGACATCCGG GCGAACTACA CGATCACCCG CGCGGCGAAC
GGCACGCTCA CCGTCACCCA CACCACGCTG ACCAATCCCG CCGTCGATGA CGGCACGGAT
ACGCTGCGCA ACATCGAGAA GCTGCGTTTC GCGGACGGCA CGGCGGACGT CGCGCTGGTT
CTCAACCAGC CCTTCGACAG CCTCACCATC AGACCGTTCG ACGCGGACGG CGACGATTCC
TCGACACTCG TCGCGACGCT GGTGAACCGT GTGAACGCGA CCACCCGGCC CGTGACCCTG
CAGTGGCAGG TGCTGGCCGA CAACGGGCAG TGGCGCAACG TGACGGGCGC GGACGGCCAG
GTCACGAACG GGGGCACCTT CTTCACGCCG ACCGGAGCGA CAGGCGTCGA GATCCGGGTG
GTCGCGAACT GGACCAGCAC GGTCGCTGGC AATACCGGCC TGCAGCAGAC GGCCTCCATC
CAGTCCGCCT TCGTCGGAAC TGCCGCCGCC GAGGACATCA CGGGTTCCGC GAGCCCGAAC
GTCATCCTGG GGCGCGACGG CGACGACGAC ATCGAGGGCG AGGTCGGCAA CGATGCCATC
TACGCCGGTA GCGGCGACGA CCGGGTCGAT GGCGGCGAGG GCGACGACAC CCTCCTCGGC
AACGATGGCG CCGACACCCT GATCGGCCGG ACCGGCAACA ACACGCTCGA TGGCGGCAAC
GACGACGACC AACTCTCCGG CGGGCACGGC AACGACCGTC TGATCGGCGG CGCCGGCACC
GACACCGCCA TCTACTCGGG ACCGATTGCG GCCTACAGCT TCGAGCGGAA CGCGCAGGGC
GAGGTCGTCG TCAGCGACAA CCTCGGAGCG GAGGGCGACG GCGTCGATAC GCTCACCACG
ATCGAGCAGA TCCAGATGGG CAACGACCTC ACGCCCTACG CGCTGGTCGC CAACGGCACG
GCGGCCGTCG ACATCGTGGT CGGGACGGCC GGCAACAACA CGCTCAGCGG CGGCGCGGGC
AACGACCTCG TCTTCGCGGG AGCCGGCAAC GACAACGTCC TGTGGCGGAC CGGCGACGGC
CGCGACTTCG TCGATGGCGG CGCCGGCACG GACAGCTTCC GGATCATGAA CGGGACCGGT
CCCGTCCAGC AGCTCACGCT GGCCCAGGCC CGGGCGCAGT TCGCGAACCT GTCTTTCCGT
GACGACACGC AGACCGTGGT CGTGCGCAAC GGGATCGTCA TCGCCGAGCT CAAGAACGTC
GAGCAGGTTG CCGTGAACAC CGTCGCGACC GGGGCGCCCG TCGTCACGGA CCCGACGCCG
ACGAACGGGC TCGTCTCGCC GACGGAGGGC CAGCCGCTCG GCGCGCTGGT CGCCGCGATC
CAGGACGCCG ACGGTCTTGG GGCCTTCTCG CTGCGCTGGC AGCAATCGGG CGATAACGGC
CAGACCTGGA CGGACATCGC CGGCAACGCC GCCGGCACGC TCAACTACAC GCCCGGCCAG
GCCCAGGTCG GGGACGTGCT GAGGCTGCGG GTGAGCTTCA CCGACGGCGC GGGAAATCCG
GAAGAGCTGT TCTCGGCACC CACCGGCATC GTCGGCGACA GCTTCACAGG CACAGCGCTC
AACAGGACCT TCAACGGGAC GGCCGGTGAC GACATCGCCA ACGGCGCCGA CACCGCCCTC
TTCGGGATCC AGCCGAACGA CACGATGAAC GGTGGCGCCG GCAACGATAT CCTGAACGGC
CGCGGCGGCA ACGATACGTT CATTCAAGTC AGCACCGACG GGCGCGACCG CGTCGATGGC
GGAGCCGGCA CCGATACCTA CCAGTTGAAC GGCGCCGCGG GGCCTGAGAC CTTCCGCATC
TACAGCCGGT CGGCTTGGCT TCAGGTCGCG GGCAATACGG AGGCGCAGCT CGCCGCTTCG
ACCGAGATCG TGATCACCCG CAACGGCACG GGTGCCGCCG CGATCGTCGC CGAACTCGAC
AACGTCGAGG AGATCCGCGT CAACACGCTC CAGGTGACGT CGCCCGGGGG CCAGAACGGC
GGTGCCAATG GCGGCGACAC GATTCAGGTG ATCGGCAGCT TCACCGGCAC CAGCCTCAAC
TTCAACACCA TCACCATCGA CGGGTCATCG GGCGACGACA CCGTGGACAT GGCCGCCCTC
ACCTCGGCCC ACCGCATCGT CTTCCGCTCG AACGGGGGGC ACGACACCAT CGTCGGCACC
CTGCGGCCGC AGGACGTGAT CGAACTGCCG GCCGGCTCGA ATCGGGCGGA CTACGTCGCG
GCCGCGGGCG CCAACGGCAT GACGACCCTC TCCAACGGGT CGCACACCAT CACCTTCTCG
GCCGCGGGCG GCATGCCGCG GATCACGGTG GATGCGGGTG CCGACGGAGG TGACGGCGAG
GGCGTCACCG GCGCCTTCGC CTACACGGCA GCCGACATCG ACGGGCTGGA GGCGCTGGTG
CGCGGCCAGC GCCCTGACAA TGCGGGCGAC GACGACGTGC CGACGGGCTA CCGCGAGCTC
AGCGGCCACG GCAACAACCT CGACCACCCG ACCTGGGGCA GCGCCGACCA AGCCTTCATC
CGCCTGACCC AGGCCCGCTA CGGCGAGGCC GACGCCAACG GCAACCGGGC CATCAACCCG
ATCTTCGACG GCCTCGACGC GCGCACCATC AGCAACATCC TCGGCACGCA GGAGGCCGGC
CTGCCCAAGG CCGGAAACGA CGCCAACATC TTCTTCATGG CGATGGGCCA GTACATCGAC
CACGGGCTCG ACTTCCTGCC CAAGGGTGGC AACGGCTCGA TCGTGATCGG TGCGCCCGGC
GGCGGGGCAC CGGGCTCCAA CAACCCGGCC GACCTCACCC GCGGCACCGT CATGGCCGTC
GATGCGAACG GGGTGCCGCA GCACAAGAAC CAGACCTCCC CCTACATCGA CCAGAACCAA
GCCTACGGCT CCAACGCGCT GGTCGGCCAG TTCCTGCGGG AGAGCGATGG GGCGCAGGGC
GTCGGCATGC GGCTCCTGGC GGGGGCACCG GACCCGTCCA ACCCCGCCTT CAACCTGCTG
CCGACGCTGC GCGAGTTGGT CAACCACCAC TGGCAGGCCG ACACCATCTT CGCCGGACCG
GACGGACCGA TCAGCTTCCG GACCTACTAC ACGAACTTCG CCCTGTCCGA GGGGGTGACG
GGCACCCTGT TCAACACGGA GACCGGCGCC TTCGACCCGC AGGTGCTCAC GAAGCTCGTC
GGAAACTTCA TGGGCTCGGG CCATCCGCTG CTGCTCGACA CCAACCCCTT CATCAGCGTC
CTCGACCACT TCGTGGCCGG CGACGGGCGG GCCAACGAGA ACTTCGCCCT CACCTCGATC
CACACGGTCT GGGCGCGCAA CCACAACTAC CACGTTGAGA AGCTGCTGGA ATCCGGCTTC
GAGGGCACCC CCGAGCAGGT GTTCCAGGCC GCCAAGATGG TCAACGAGGC GGAGTATCAG
CGCGTCGTCT TCGACGAGTA CCTGGAGACG CTGATCGGCG GTCTGCGCTC GGACGGCACG
CACGGCTTCG AGGCCTATGA TCCGAACGTG GATGTCGCGA TCAGCCACGA GTTCGCGGCG
GCGGTGTTCC GGTTCGGCCA CTCCCTGATC GGGCAGACGC TGAATGTGAA GGGCGCCGAC
GGCGAGACCG TTCCGGTCAG CCTGTTCGAC GCCTTCCTGA ACCCGAGCAA CGATCCCTCG
GTGTTCACCG CGCCGTTGCC TCCCGGCTAC GTGCCGCAGC CCGGCTACGC TCAGTACGGC
GTCGGCGGCA TCATCGGCGG CACCATCGAG CAGGCGGCCG AGGACGTCGA CTTCAACATC
GTCGATGCGG TCCGCAACGA CCTCGTGCGC ATTCGGGCCG ACCTGTTCGC CTTCAACGTG
GCCCGCGGCT GGGATGTGGG CCTCGGCACC CTCAACCAGG TCCGGGCCGA TCTGGCGGCC
TCCACCAATC CCTACATCCG GGACGCGGTG GGCTTCGCCG GCGGCGACCT CTCGCCCTAC
GCCTCCTGGG AGGACTTCCA GGCGCGCAAC GGCCTCAGCG ACGCGGTGAT CGCGCAGTTC
CGGCAAGCCT ACCCGGACCT CGTCCTCGCC GCCGCGGACA TCGCGGCCTT CCGGGCGATC
AACGGCGACA TCGCCATCGC CATGCAGGCC GACGGGACCG GTGTCGTGAA GGGTATCGAC
CGGCTCGATC TCTGGGTCGG CGGGCTTGCC GAGAAGCACA TCAACAACGG CGTCGTCGGC
CAGACCTTCT GGGTCGTGCT GCACGAGCAG TTCGACCGAC TGCAGGACGG CGACCGCTTC
TACTACCTCG AGCGCTTCGA CAACTTCGAC TTCTACGAGA ACCTCGTCGA CGGCCAGGGC
TTCTCGGACA TCGTCGCACG GAACACCGGC CTGACGGTCC TGCCGGAACA CATTTTCGAG
CTGTCCGACG AGGACGGGCC GGGGACGGAG CCCGGCGATG ACGACGACGA CGACGACGGC
GTCACCGATC CCGTGGGTGG TGATCCCGAC GAGGACGAGG ACGGCCCGAC CGATCCGGTC
GGCGGTGGCG GCGATGCCGG TGGGGACGAG GACGAGGACG ACGGCGTGAC CGACCCTGTC
GGCGGCGGGG ACGATCCCGG CGACGACGAG GACGACGGGC AAGGAGACGG AGACGGGACG
ACCGACCCCG TCGGCGGGGG TGACGGCGAC GAGGACGGGC CCGGGACCGG TCCCGGCACC
AATCCGCCCG TCAACCACGC GCCCGGCGTG ATCGCCGGCG GCGCGAACGG AGACGTCCTC
AACGGCACGG CGGGGGCGGA TACCATCCTC GGCCTGGACG GCGACGACAA CATCCTCGCC
GGCGGCGGTG CCGACGTGGT CCGGGCCGGC GCGGGCAACG ACTTCGTGGA TGCCGGCGAG
GGCCGGGACG TGGTGTTCGC CGGGGACGGC GATGACGATG TCCTCAGCGG CGGCGGCGCC
GACATGGTGT ACGGCGATGG CGGCAACGAC CGCATCCTCG CCGGAGCCGG CAACGACCTC
GTGACCGCCG GCGCCGGCCG GGACACGGTC ATCGGCGGCG AGGGCGACGA CCTGTTCGTG
GCCGAGACCG GTGACGGCGA CGACACCTAC TGGGGCGACG AGATGGGGGG CGGGCTGGGA
AGCGACACGC TCGACATGTC GGCCATCACC GCCAACATCG CGGTCAACCT CGGGACGGGT
CTGGCCGGGC GCGGCAGTGC GACCAGCACC CAGTCGGGGC GCGACGTGCT CTGGGGCGTC
GAGAACGTCG TCACGGGGTC CGGCAACGAC GACATCACGG CGAGCGACGC CGCGAACGTG
ATGGATGGCG GCGAGGGCAG CGACACCTAC CGCTTCGGCT CGGCCGCCGC GGCGAACGGC
GACACCATCG AGGGCTTCCG GCCGGGCGAC AAGATCGACC TCAGCGCGAT CGACGCCGAT
GCCGGGCTGG CCGGGAACCA AGCCTTTACC CTCGCGACCG GCGCGGCCTT CACGGGGGTC
GGACAGCTCC TGGTGACGCA GGAGACGCGC GACGACGGAG ACTACGTCGT CGTCCAGGGG
AACACGGCCG GCGATGCCTC CCCGGAATTC AAGCTCGCCA TCAAGGGCAA TACGGCGCCC
ACGGCCGCCG ACTTCACGCT CTGA
 
Protein sequence
MAVKLNLQDL TFILKQIKIA EAHASGIKLT ELRLDAAGTL LTDRGLYDAT GNWLGDAAAP 
KAIADPHVPY GLRTVDGTYN NLVPGRETWG SSGQPMPQLF EPTYLNDADG DTMALGPGAP
VITNTNYGLP GSVADADPRI ISNLVVDATL DNPAAIAAAL RIAGSENVIA DQRAITAAHE
ALKAAQAANP AGDHAVLQSN LDALLEQTGV TVTNGSIDVL NVSPDEGLSK PFNAWMTFFG
QFFDHGLDLI SKGGNGTVYV PLAADDPLVL GQDGLAGTAD DLAPHLRFMT LTRAAQVEGS
QRNVTTPFVD QNQTYTSNAS HQVFLREYAL VDGRPVATGR LLGGADGGLA TWADVKFQAR
TILGIELTDA DVSAVPQLLV DAYGEFVRSA NGLPQVMVGV GPGGQAVYAS GSLAEPLKLS
AIQLPVGTVL VGPNGAQNVI EAGETVAAAR TFNAFLDDIA HNAVPVAVNG VLRPDADALT
GNAVQMNPQT GRNLEYDNEL LDRHFVTGDG RGNENIGLTA VHHIFHSEHN RQIDAHKLTI
LQSGDLAFIN DWLATDIAAL PGNFAQMTPL GQLAYANTLS WDGERLFQAA RFATEMQYQH
LVFEEFARKI QPLVDPFVFN PVTEIDPSIF AEFANTVYRF GHSMLTENMP RLGPDGQALD
ADLGLIDAFL NPLAFDNDGG LSHDESAAAI MRGMTIERGS EIDEFVVGAL RNNLLGLPLD
LAAINIARGR DTGTPTLNEA RAQLYAATGS TFLTPYTSWV EMAANLKNPL SVVNFIAAYG
THGTVVAATT LAAKRDAAMA LVFGGDGAPT DRLDYLNSRG SWAGRETGFG AVDLWIGGLA
EKQMPFGGML GSTFNAIFEA QMENLQDADR FYYLSRVQGQ NFLNELEQNS FSKIMLANSS
LSLPGPDGIR GTADDIVPRH IGVDAFADYD FELEVNAANQ LDQNGAAPGR DPTGNDPVLE
AMGLGKVVRD DPGTAADEGA SGFHASVNAL VRRFGADGSP TGALVDGSED GVGGAGSPVT
WADLKANAAK LGIALTQADM LDAPVLRIGA DGRLAFAPSS SVPEAVAVAN GSFEGLALVA
GQEGVILDGN GNYTTTSPAG WTIAGGVGGL FAPADAVVDP AGRDGANVVW LRGGATLSQE
DGTTLQAGVG YTYSFKVGDR TDFTWPGAEA RLVAVGGANP VTLGTLTLTE PADGQWGTFT
LATGVVPSAL VGLQLRLEIR NTGSGDAQIL VDDIELVRTA PAYRSDLTPA QTPGYDPAAD
PFLRDGAGNV LRTGQSIASP AADLDATVVD PAALNLPFAT GHYLRFTGGE HVLVGGTDGN
DTIITDFGDD GIWGDAGDDR IEAGAGVDLV NGGAGDDIIT DSGDTGDFLK GEDGNDVIAN
SNGIDILMGG RGKDAIFVGV DATEVFAGEG DDFVIGGDDA DLLMGNEGDD WMEGGGGFDT
TAGDNSELFF NSAIKGHDVM FAGGDEHDFD GESGDDIMVQ GESVMRNEGM FGFDWAIYKG
NQIAANADMR IPIFTTEEAD ILRNRFDKTE GLSGWRLNDT LIGDDRTAAA NADAEAPAGA
PIAAANEGVF FNDGLDAAGI ARIAGLDQIV SLASGQQFFE AGNILLGGAG SDTLQGNGGN
DILDGDRWLN VRISIRNPAD AGQEMATADS LKHVFDDSAA NQARGWAGKS LFELMIDRVI
SPTQLAIVRE VITTGATAAD VDTAVFNDIR ANYTITRAAN GTLTVTHTTL TNPAVDDGTD
TLRNIEKLRF ADGTADVALV LNQPFDSLTI RPFDADGDDS STLVATLVNR VNATTRPVTL
QWQVLADNGQ WRNVTGADGQ VTNGGTFFTP TGATGVEIRV VANWTSTVAG NTGLQQTASI
QSAFVGTAAA EDITGSASPN VILGRDGDDD IEGEVGNDAI YAGSGDDRVD GGEGDDTLLG
NDGADTLIGR TGNNTLDGGN DDDQLSGGHG NDRLIGGAGT DTAIYSGPIA AYSFERNAQG
EVVVSDNLGA EGDGVDTLTT IEQIQMGNDL TPYALVANGT AAVDIVVGTA GNNTLSGGAG
NDLVFAGAGN DNVLWRTGDG RDFVDGGAGT DSFRIMNGTG PVQQLTLAQA RAQFANLSFR
DDTQTVVVRN GIVIAELKNV EQVAVNTVAT GAPVVTDPTP TNGLVSPTEG QPLGALVAAI
QDADGLGAFS LRWQQSGDNG QTWTDIAGNA AGTLNYTPGQ AQVGDVLRLR VSFTDGAGNP
EELFSAPTGI VGDSFTGTAL NRTFNGTAGD DIANGADTAL FGIQPNDTMN GGAGNDILNG
RGGNDTFIQV STDGRDRVDG GAGTDTYQLN GAAGPETFRI YSRSAWLQVA GNTEAQLAAS
TEIVITRNGT GAAAIVAELD NVEEIRVNTL QVTSPGGQNG GANGGDTIQV IGSFTGTSLN
FNTITIDGSS GDDTVDMAAL TSAHRIVFRS NGGHDTIVGT LRPQDVIELP AGSNRADYVA
AAGANGMTTL SNGSHTITFS AAGGMPRITV DAGADGGDGE GVTGAFAYTA ADIDGLEALV
RGQRPDNAGD DDVPTGYREL SGHGNNLDHP TWGSADQAFI RLTQARYGEA DANGNRAINP
IFDGLDARTI SNILGTQEAG LPKAGNDANI FFMAMGQYID HGLDFLPKGG NGSIVIGAPG
GGAPGSNNPA DLTRGTVMAV DANGVPQHKN QTSPYIDQNQ AYGSNALVGQ FLRESDGAQG
VGMRLLAGAP DPSNPAFNLL PTLRELVNHH WQADTIFAGP DGPISFRTYY TNFALSEGVT
GTLFNTETGA FDPQVLTKLV GNFMGSGHPL LLDTNPFISV LDHFVAGDGR ANENFALTSI
HTVWARNHNY HVEKLLESGF EGTPEQVFQA AKMVNEAEYQ RVVFDEYLET LIGGLRSDGT
HGFEAYDPNV DVAISHEFAA AVFRFGHSLI GQTLNVKGAD GETVPVSLFD AFLNPSNDPS
VFTAPLPPGY VPQPGYAQYG VGGIIGGTIE QAAEDVDFNI VDAVRNDLVR IRADLFAFNV
ARGWDVGLGT LNQVRADLAA STNPYIRDAV GFAGGDLSPY ASWEDFQARN GLSDAVIAQF
RQAYPDLVLA AADIAAFRAI NGDIAIAMQA DGTGVVKGID RLDLWVGGLA EKHINNGVVG
QTFWVVLHEQ FDRLQDGDRF YYLERFDNFD FYENLVDGQG FSDIVARNTG LTVLPEHIFE
LSDEDGPGTE PGDDDDDDDG VTDPVGGDPD EDEDGPTDPV GGGGDAGGDE DEDDGVTDPV
GGGDDPGDDE DDGQGDGDGT TDPVGGGDGD EDGPGTGPGT NPPVNHAPGV IAGGANGDVL
NGTAGADTIL GLDGDDNILA GGGADVVRAG AGNDFVDAGE GRDVVFAGDG DDDVLSGGGA
DMVYGDGGND RILAGAGNDL VTAGAGRDTV IGGEGDDLFV AETGDGDDTY WGDEMGGGLG
SDTLDMSAIT ANIAVNLGTG LAGRGSATST QSGRDVLWGV ENVVTGSGND DITASDAANV
MDGGEGSDTY RFGSAAAANG DTIEGFRPGD KIDLSAIDAD AGLAGNQAFT LATGAAFTGV
GQLLVTQETR DDGDYVVVQG NTAGDASPEF KLAIKGNTAP TAADFTL