Gene Mvan_0269 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_0269 
Symbol 
ID4647567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp293453 
End bp304483 
Gene Length11031 bp 
Protein Length3676 aa 
Translation table11 
GC content70% 
IMG OID639803778 
Productbeta-ketoacyl synthase 
Protein accessionYP_951124 
Protein GI120401295 
COG category[C] Energy production and conversion
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases
[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR00128] malonyl CoA-acyl carrier protein transacylase
[TIGR00517] acyl carrier protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.180962 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATCCG CTGAACATCC GAACGGACCG GCCCCGCGTT TCGCCATCGT CGGCTACGCA 
GCGCGTTTCC CGGGCGCCCC GGATGCCGAC GCGTTCTGGG ACGTGCTGCG CGAAGGCCGC
GACGCGATAT CGGACGTCCC GAGGGACCGC TGGGATGCCG AGGAGTTCTT CGACCCGGAA
CCCGGCGCTC CCGGCAAGGT GGTGACCCGT CGTGCCGGTT TCGTCGATGA CGTGACGGGG
TTCGACGCGC CGTTCTTCGG CATGTCGACC CGAGAGGTCA GGATGATGGA CCCCCAGCAT
CGGCTGTTGC TGGAGACGGC GTGGCGGGCG GTGGAACACT CCGGGACCGC GCCGACGGCC
CTGGCCGGTA GCAACACCGG GGTGTTCGTC GGTCTGGCCA CCCACGATTA CCTCGGAATG
GCCTCGGACG AACTGACCTA CCCCGAGATC GAGGCCTACA TGGCCATCGG GACGTCGAAC
GCGGCGGCGG CGGGCCGGAT CAGCTACCGG CTGGGGCTGC AGGGTCCCGC GGTCGCCGTC
GACACCGCGT GCAGCTCGTC GCTGGTGGCG ATCCATCAGG CCTGCCAGGC GCTGCGCCTG
GGCGAATGTG ACCTCGCGCT GGCCGGCGGC GCGAACGTCC TGCTCACCCC GGCGACCATG
ATCACGTTCT CCAACGCGCA CATGCTCGCA CCCGACGGCC GGTGCAAGAC GTTCGACGCG
GCCGCCGACG GGTACGTGCG CGGTGAGGGT TGCGGCGTCA TCGTCATCAA GCGTCTGGAG
GACGCGGTCC GCGACGGTGA CCGAATCCGC GCGGTGATCC GCGGCAGCTC GATCAACCAG
GACGGCGCAT CGGGCGGCTT GACGGTTCCC AACGGCGTTG CCCAGCAACG TGTTATCGCC
GATGCGCTGA AGCGTTCCGA CCTCGAGCCC AAGGATGTCG GATACCTGGA AGCCCATGGG
ACCGGGACCT CGCTGGGCGA CCCGATCGAG GCGCAGGCCG CCGGTGCGGT GCTCGGCGCC
GGACGTGAAC CCAGCCGGCC GCTGTTGATC GGGTCGGCCA AGACGAACAT CGGGCACCTG
GAAGCGGCCG CCGGGATCGC GGGCGTCATC AAGGTCATCC TGTCGCTGGA GCACCGGACG
CTGCCCAAGC ACCTCAACTT CGAGAACCCG TCGCCGCACA TTCCGTGGGA CCGGCTCGCG
GTGGAGGTGG TCAAGGAGAC CATCCCCTGG GAGCGCGACG GCGGTCCCCG CATCGCGGGG
GTCAGCTCGT TCGGGTTCGC CGGGACCAAC GCGCACGTCA TCCTCGAAGA AGCGCCCGAG
CAGGCGGCGG CGCCCGGTCC GGTGCAGCAG CCCGACCGCC GCTTCAGCCT GCTGCCGCTG
TCCGCGCGGA CTCCCGCCGC CCTCACGCAG CTCGCCGGGC AGTACCGCGA CTGGTGGACC
GCTCACCCGG AGGCCACCCT GGCCGACGTG TGCTTCACCG CCGGCGCGGG GCGCGCCCAT
CTGGAACACC GGGCCGCGTT GGTGGTCAAT TCCGCCGAGT CCGCCGTCGA ACTGCTCGGT
GCGCTCGCCG ACGACCGCCC GGCGCCCGGC CTGGTTCGCG GCGAGTCGCA TGACACACCG
AAGACGGCGT GGCTCTTCAC CGGCCAGGGT AGCCAGTACC CGGGGATGGC ACGGGAGTTG
TACGACACCG AGCCGGTGTT CGCCGAGACG CTGAACCGGT GCGCCGAGGC AGTGGCCGAT
GTTCTCGAAA AGCCCTTGCT CGACGTCATT TTCGATGCAG AGGCGGACGA CGCCGAGAAG
ACCCTGCAGA ACACCTGCTA CGCGCAGCCC GCCCTGTTCG CCGTCGAGAT GGGGTTGGCC
CGTCTCTGGC AGTCCTGGGG TTTCGAACCC GACGTGGTGC TCGGGCACAG CGTCGGCCAG
TATTCGGCGG CCTGCGTCGC GGGCGTGTTC AGCCTCACCG ACGGCGCGCG GCTGATGGCC
GAACGCGGGC GCCTGTTCGG AAGCCTGCCC GAAGGTGGCC GCATGGTCGC GGTGTTCACC
ACCGCCGAGC GGGCCGAAGC CATGACCGAC GAGTTCCCGA GCCTGTCGGT CGCCGCTTAC
AACGGCACCA ACACCGTATT GTCCGGGCCC GCAGGGGATC TGGAGAAAGC GGTGGCCACG
CTTTCGGCCG ACGGTGTCCG GTGCGACTGG CTGGACACCA GCCACGCGTT CCACTCGGCG
CTGCTGGACC CGATTCTCGA CGACTTCGAG TCGTATGCGA ATCAGTTCGA TTTCGCGGCG
CCGCAACGGA TCCTGATCGA CAACCGCACC GGTACCGCGC TGGGCCGGAG CGTGCAGCTC
GATGGCACCT ACTGGCGCAG ACACGCACGT CAACCGGTGG AGTTCGCCAA GAGCGTACGC
ACCCTCGCCG AGATGAACTG CAAGCTTCTG GTGGAGATCG GGCCTCGACC GGTGCTGACC
GCCGCGGCCC TTGGGGCATG GCCCGACCCC GCCACCGCGC CGAGGGTGAT CGCGTCCCTG
CGCCGAACCG CGGCCGACCA CCGGCAGATC ACCGAAGCCG TCGCCGACGC ATACGTGTTG
GGCCACCTGC CCGAGTTCGG CGCGTTTCGA GACGCGGGCG CGCGAAAGCT GGACCTGCCC
ACGTATCCGT TCGAGCACCG CCAGTACTGG TTCAGAGACA ACCAGGATCC CGACGGGTCC
GAGGCGCTGC AGCCGCGCAC CCCGAGCACC GAGGCCGTCC GGCTCCTCGA GGACGGCCGC
ATCGAGGAGC TCGCGACACT GCTCGACGGC GCCGCCGGCG ACCAGCAGAC CCTGGACGTG
CTGAGCAGGC TTGCCGCACA GCACAATCAG CAACGCAGCA GCCAATCGAT CACGAACGAC
CGTTACGAGA TCCGCTGGGA GCAGTCCGCG GCGGCGCTCT CGGGCGCGGA GACCGACGAG
GTGTCGTCCT GGATCCTTGT CGGCGACGAC ACCGAGGCGA CCCGGCCGCT GGTCGACGTG
CTCAGCGCAC GCGGGCAGCG GTACCGGATC TTCGGCCTGC CGGTGTCCGA CGCGGACGAG
GAACAACTCG GCGAAGCGCT ACGCCTTGCG GCAGCGGAGG ATTCGACGCT GCGCATCGTC
CATGTCGGCG GGCTCGATTC CGACCCCGAC CCCGGTACGG CACCCTCGAT GCGGTCACTG
TTGCGGATGC AACACCGGAT CCTGGGCGGA ACCCGGCGAC TGTTCCGGGC TGCGGCCGCC
GCGGAACTGC GCACCCCCAT CTGGGTGGTG ACCCGTCGGG CGCAGCACGT CACCGCCACC
GACACGGTTG CCCCGGACCA GAGCTGCCTG TGGGGTTTCG GTCGTGCCGC CGCCCTGGAA
CTTCCGCAGT TGTGGGGCGG ACTCGCAGAT CTGGCCGACG GCACCGCCGA CGAGTGGTCC
CGACTGCTCA ACCGGATCAC GGCACCGCAC GGGCCGGCCG TCAGGGAAGA CCAGATCGCG
CTGCGCGATC ACGCGGTGTA TGTGCCCCGG CTGGTTCGAC GGGCCGGCCG GCCGGCCGGC
AAACCGCTGC AGCTGCGCGA CGACGCCACG TATCTGGTGA CCGGCGGTCT GGGCTCGATC
GGGCTGGAGA TCGCCGGATA TCTCGCCGCG CACGGCGCCC GCCACCTGGT GCTGACCAGC
CGGCGCGCAC CCGGTGATGC CACGCAGCAA CGTATCGACG CGCTGGGCGC ACAACACAGT
TGCCAGATCC GGGTCGTCGC CGCCGACGTC GCCGACGCGC ACGACGTCGC GCGCCTGCTG
GCAGCCGTGG CCGCGGAGCT ACCGCCGTTG GCCGGCATCG TGCACGCCGC GGGCGAGATC
GGCACCACCC CGCTGGTCAG CCTGGAAGAC AGCGAAGTGG ATCGCGTCTT CGCCGGGAAG
GTATGGGGTG CTTGGCATTT GAGCGAAGCG GCGGCTGACC TGCAGCTCGA CTTCTTCCTC
GGCACCTCGT CCATCGCATC GGTGTGGGGC GGGTACGGGC AGACCGCCTA CGGCGCCGCC
AACGCCTTCC TCGACGGGCT TGCCTGGCGG CTGCGCGAGC AGGGCATCTC CGGCGTCAGC
ATCAACTTCG GCCCGTGGTC GGCCGGCATG GCCGACGCGG AGTCGCGTGC CCGACTGGAC
CAGCGCGGAG TCCGGACCTT GTCACCCGCC GACGCATTGG CGGGGCTGGC CGACGTGGTG
ACAGCCGCGG AGGCACCGGG CCCTGCGCAG GGGATCGTCG CCCGCATCGA CTGGGAGCGT
TTCCTTCCGC TGTACCAGCA GGCGGGGCGG CGGGCGTTCC TGGCGGAGTT GGAGCGCGAG
GTGCCGGACA CCGCCCCGGC TGCCACGCCG TCGGGCAGGA CCGAACTGGT CGAACGGCTC
ACCAACGCCC CGGTGCAGCA ACGCAAGAAG CTGCTGACCG ACTACCTGCG TACCGCGGTG
GCCGAGATCA CCCGGGTGGA TGCCACGGAG ATCCGCGAGG ACGCGGGGTT CTTCGACCTC
GGCATGGATT CGCTGATGGC CGTCGAACTG CGGCGCCGCA TCGAACAGGG CGTGGGCCGC
GAGATCCCGG CCACCCTGGC GATGGATCAT CCGCGCCTGT CCGACGTGGC CGACTATCTG
CTCGGCGAGG TGCTCGGGCT TGCCGAGCAG GCACCCGCCA AGGCGGGGTC GCAGCCGGCG
TCGGCGGCGG CGAACCGCAC GGACGAACCG ATCGCGATCG TCGCGGTGTC GTGCCGCTTC
CCCGGCGCAC CGGACCCGGA GGCCTTCTGG GAGGTACTGG CCGGCGGTGT CGACGCGATC
CGCGAGGTCC CCGAGGACCG GTTCGACATC GACGAGTTCT ACGATCCGGA TCCCGACGCC
GCGGGCAAGA CCTACACGCG TTTCGGCGGA TTCCTGGACG GTATCGACGG ATTCGATCCC
GAGTTCTTCG GCATCTCCCC GCGTGAGGCC GTCTGGATCG AGCCGCAGCA GCGGTTGATG
CTCGAAACGG TGTGGGAGGG CCTGGAGAGG GCTGGGCTCG CGCCTGCCGA CCTGCGAGGC
AGCCGCACCG GAATCTTCGT GGGCGTGGCC GCCAACGAGT ATGCGCATCT GCTGTCGTCG
GAGTCGATCG AGAAGATCGA ACCCCACTTC ATCACCGGCA ACGCGCTCAA CGCCATCTCC
GGTCGGGTCG CGTTCGCGTT GGGACTCGAA GGCCCGGCGG TGGCGGTCGA CACCGCATGC
AGTTCGGCGC TGGTCGCCGT CCACCAGGCA TGCCAGGCAC TGCATTCCGG CGACTGCGAC
CTGGCGCTGG CCGGCGGCGT GAACGTCCTG CTGAGCCCGG TGACGAGCGT CGCCGCATCC
CGCGCCCGGA TGCTGTCCCC CGTCGGGCGG TGCAAGACCT TCGACGCCTC CGCCGACGGC
TACGTGCGCA GCGAAGGCTG CGGGATCCTG GTGCTCAAGA GGCTCGGCGA CGCGGTGCGC
GACGGCGACC GGGTGTGCGC GGTCATTCCC AGCAGCGCGG TGAACCAGGA CGGCGCTTCC
AGCGGCCTGA CTGTGCCCAA TGGTGGTGCA CAGCAACGCC TTATCGGGAT GGCGCTGGCG
CGCGCCGGCC TTTCGGGCGG GGATGTCGAC TACCTCGAGG CGCACGGGAC AGGCACCCCG
CTGGGTGATC CGATCGAGGT GCAGGCGGCC GCGGCCGCCT ACGGCGCCTC ACGTGACGCG
GACCGCCCGC TGCTGATGGG ATCGGTGAAG ACCAACATCG GCCACCTCGA GTCGGCTTCC
GGGGCAGCGG GTCTGATCAA GGTTGTGCTG TCGCTTCAGC ACAACCTGCT GCCGCAGAGC
CTGCACTTCG AGAATCCGTC GCCGCACATC CCGTGGGATT CGCTGCCGGT GCGGGTGGTG
GACAAGGCGA TTCCGTGGCA GGCCGACGGC AGGCCGCGGC GCGCCGGGGT CAGCTCGTTC
GGGTTCACCG GCACGAACGC GCATGTGCTG ATCGAGGAGG CGCCACTCCC GCAGGCGGCC
GACCCGGTCG AGGAACCGGA CACCCGCGCG CTGCCCGTCG GTGTGCTCGC GCTGTCCGCC
CGGTCACCGG AGGCGCTGAC GGCGCTGGCG CAGCGCTACG AGGCCTGGCT GAGCGCCCAC
CCCGACGCCG ATCTCGCCGA CGTCTGCCGC ACCGCCGGAA CGGGCCGGTC GCATTTCGAG
CACCGGGCCG CGCTGGTCGT CGATTCGGTC GCGGCCGCGC GCGAGGGCCT GGCCGAACTG
GCGCAGAACC GGCTGCGGCC CGGCGTCGTA CGTGGCGAAC ACACCCACCA CCCCACCACG
GCGTGGCTGT TCACCGGACA GGGCAGCCAG TTCCCCGGGA TGGCCCGTGA ATTGTTCGAG
ACCGAACCGG TTTTCGCCGA CGCCGTGACA CGCTGCGCGG ACGCAGTCAA GGACATACTG
CCGCGCCCAT TGCTGGAGGT GTTGTTCGCC GCCGATCGGG AATCCGGGGA AGCGTTGCGG
CACACGTCGT TCGCCCAGCC GGCGATCTTC GCCGTGGAGA TGGGGCTGGC CCGGCTGTGG
CAGTCGTGGG GCATCGAGCC CGACGTGGTG CTCGGGCACA GCGTGGGCCA GTACGCGGCG
GCCTGCGTGG CCGGCGTGTT CAGCCTCGAA GACGGGGCAC GGCTGATGGC CGAACGCGGC
CGGATGTTCG GAAGCCTGCC CGAGGGCGGG CGGATGGTCG CGGTGTTCAC CGACGCGAAG
CTCGTCGAGG AGATCGCAGG AGACTTCCCG CGGGTGTCGG TCGGCGCCTA CAACGGACCC
AACACCGTGC TCTCGGGCCC TGGCGAGGAT CTGGAACAGA TCGTGGACAG GTTCGGCGAC
GACGGTGTCC GTTGCACCTG GCTGCAGACC AGCCACGCCT TCCACTCGGA GCTACTGGAT
CCGGTGCTCG ACGAATTCGA GTCCTATGCG GCACAGTTCC AGTTCGCCAC CCCGACCTTG
CCGCTGGTCT GCAACCGTAC CGGCACCGTG CTCACGGCCC AGACCCCCCT CGACGCCCAG
TACTGGCGGC GGCACTCCCG CCAACCGGTG CAGTTCGCCG AAAGTGTGCG CACCGTGGCG
GCGCTGGGAT GTTCGGTGCT GATGGAGATC GGTCCGCAGC CGGTGTTGAC CGGGTCCGCG
GTGCAGGTCT GGCCGGAGCA CCTGGCCGCA CCCCGGGCGA TCGTCTCGCT CCGCAAGGGT
GTCAGCGACC GCCGCCAGAT CACCGAGGCG CTGGCCGCGG CCTACGTGGG CGGCCACCGG
CCCGATTTCG GTGCGCTGTA TCGCCGGCCG GGTCGCGCAG TCGCGTTGCC CACGTATCCG
TTCCAGCGTC GCCGGTTCTG GCCCAAGACG TCCGGAATCA CCACCGATGG TCCGGCGGTC
TCCGGCATCC TCGGCAGCGC CAAGGACCTT GCCAGCGGTG ACACGGTCTA CACCAGCAGA
TGGTCGGTCA GATCGCAGCC GTGGCTCGCC GACCACGTCA TCTACGGCAC CGTCGTCGTC
CCCGGCGCCA CCTACGCGGC GATGGCGCTG GCCGCGGTCG GCACCCCGGC CCGGGTGAAG
GACGTCTTCT TCTACGAGCC GATCATCCTG CCCGAGAAGG CTTCCCGCGA GGTGCAGCTG
ACTCTTCACC CGTCCGAGGA CGGCGGACAG AAGTTCCAGG TGCACAGCCG GGAGTACGGC
GAACGCGGCA CCGAATGGTC ACTGAACGCC GAAGGCACCG TGGCCAGCGG TGTCGACGAG
AACCCCCAGG CCGAGCCGTC AGGTCCCGTC GACGAGGCCA TCGAGCGACT GAACCGGATG
CGCCCACAGG ACCTGTTCGA GACGTTCGCC GACATGGAGC TGGCATGGGG CCCGACCTGG
TCCGGTTCCC TGAAATCGCT GTGGCTCGGC GAGGGTGAGG CGATCGGCGA CATCCTCGTC
GGTGAAGAAC TCGCCGAGCA ACTCGGCACC GAGCCGATGC ACCCGGTGCT GATGGATCTG
TGCACCGGCG TCGCGTTCCC GGCGTTCCCT GCGCTCCTCG CGGCCGAACA GGGCGTCAGC
GACCTGTTCC TCCCGCTGCG CTACGGCCAG GTGACGTTGC GGGACAAGAT GCCACGCAGG
TTCTACTGCC GCGCCACCTG GCACACCAGC GAACTCGACA GCGAGACCCA GGTTTTCGAT
CTCGACTTCC TCGACCGCGA TGGCCGTCAC CTCGGCGGGA TTCGCGAGTT CACGGTCAAA
CGCGCCCCCC GCGAGGCGTT ACTCCGCGGC CTCGGCGGCG ACGCCACCCG CCTGCTCTAC
ACCCTCGGCT GGCACGAAGT GCCGGCACCG GCATCCGGTG ACGCCGCGCT GAACGGCAAC
TGGTTGATCG CCGGGTTCGA CGAACTGGCA GCCGGCGTGC CTGGCTGCAT CCCGTTCGAC
CGGAGCACCG ATCCGGAACC CCTGGGACAG CTGCTGGCAC AGGCGCACGA ACGCGGGATC
GGCTTCTCCG GCGTCGTCTG GCGCGCCGCG GCGCCGAGCG CCGACGAGTC GAGCACCGCG
ATGGCAGCGC GGATCGAGAC CGAGATCGCG GACCTGCTCA GCGCGGTGCA CACGGTGCAG
AACGGCGCCG TGAAGCTGCC CGGCGGACTG TGGATCGTCA CCGAACGCGC CGTGGCCACC
GAATCCGGTG AACCCGTCGA TCCGGTGCAG GCCGCACTGT GGGGATTCGG GCGCACCACC
ATCAACGAGG AACCGGCGCT GCGCTGCAGG CTGGTCGACG TCGACGGATC ACCGGAGGCC
GTCCTGGCGC TGGCCGGCCT GCTGGCCGCT CCGGTCGACG AGCCGGAACT CGCTGTGCGC
CAAGGGAAGT TGCTGGCCTC GCGGTTGCTG CCGTGGGCGC GCAGCGGTCA CCTCACGGTG
CCCCGGTCCG CCGACTACGT GCTGGCCCCC ACCGAACGCG GAGCGATCGA CAACCTGCGT
CTGACCGAGA CGGACGTGGC ACCCCCGGCA GAGGGTTACG TGCAGGTGCG GGTGGAGGCC
GCGGGCCTCA ACTTCCGCGA TGTGCTCAAC GTCCTCGGCC TCTACCCGGG CGACCCCGGA
CCGATCGGCG GCGACTTCGC GGGCACCGTC ACGCAGTTGG GCAGCGGCGT CACCGGGCTC
GAAGTGGGCC AGCGCGTCTA CGGTTTCATG CAGGGCGCCT TCGCCAGCCG GTTCAACGTG
CCCGCACAGC TGCTGGCCCC GATTCCCGAC GGGATCGGCG CGGTGGACGC GGCGACCATT
CCCGCCGCGG CGCTCACGGC CCGCCTCGCG TTCGACTGGG CGCAGCTCAA GCCCGGCGAC
CGTGTGCTCA TCCATGCCGC CAGCGGCGGC GTCGGGCTGG CCGCCATCCA GATGGCGCAG
CAGCACGGCG CCGTCGTCTT CGCCACGGCC AGCACCTACA AACGCGCCAC GCTGCGCAAG
CTGGGAGTCG ATCACGTCTA CGACTCGCGC ACCACGGAAT TCGCCGACCA GATCCTGGCC
GACACCGACG GCGAGGGCGT CGACGTCGTC CTCAACAGCC TCACCAACGA GGGCTTCATC
GAGGCGACCG TGCGGGCCAC CGCACAGAAC GGCCGGTTCG CCGAGATCGC CAAACGCGAC
ATCTGGACAC CGGAGCAGAT GGCGGCGGCC CGACCCGACA TCGCCTACGA GATCGTGGCG
CTGGACACGG TGACCCTGCT GGAGCCCGAA CGCATCCGGG GCCTGCTCGG TGAGGTGTCC
GACGGGCTGG GCAAGGCCGA TGGCTCCTCG CGCGCCGAAT GGGTGCCGCT GCCCGCCGAG
ATCTACCCGC TGACCGAGGC CAGGGCCGCG TTCCGCCGCA TGCAGCAGGC CCGGCACGTC
GGCAAGATCG TGTTGCAGAT GCCGAAACCG TTGCAGCCGC GCGCCGATCG CAGTTACCTG
ATCACCGGCG GTCTCGGTGC GATCGGTCTG CACACGGCGG CATACCTGGC CCAACTCGGC
GCCGGTGACC TCGTGTTGAC CAGTCGGCGC GCGCCCGACG CGGACGCGCG GCGCGCGATC
GAGGAGATCA CCGAGCGGTA CAAGTGCCGC GTGCACACCT TCTGCGCCGA CGTCGGGGAC
GAGTCCCAGG TGGCCGAGCT GCTGGCGCGA ATCCGCGCGG AGCTGCCGCC GCTGGCCGGG
GTGGCACATC TCGCGGGCGT GCTCGACGAT GCGCTGCTCT CCCAGCAGAG CCTGGAGCGC
TTCCGGACGA CGTTGGCTCC CAAGGCGTTC GGTGCCTGCC ACCTCGATCG CCTGACCGCG
GGCGACGATC TGGACTTCTT CATCATGTCC TCGTCGGTGT CCAGCCTGTT CGGTTCGCCC
GGCCAGGCCA ACTATGCGAC GGCCAATGCA CTGCTCGACG GCCTGACCGC GCACAGACGC
GCCCGGGGCC TGCCGGCCAC GGGCGTCAAC TTCGGTCCAT GGGCCCAGGG CGGGATGGCT
TCGTCGGAGG CCGCGACCGC CAACATCGGT GCGCAGGGCC TGGTTCCGCT GGAACCGTCG
GCAGCGCTGG GCGCCCTTGC CGAGGTGCTC GCCAACGGAA CCGGGCAGGC GGCCGTGCTC
AAGGCCAACT GGCAGCGCGC CGCGAAGGTG CTGGGAAGTT CCCGGCCACC GATTCTCGAT
CTGGTGCTGC CTCGCCCGGA GGGCGAGGTG GCCGGGGACA GCGAACTGCT ACGGCAGCTG
CAGGAGATAC CCGTCGCGCA GCGGGCGGGA TTCGTGACCG AATTCCTGCA GCGGGAGGTG
CAGAACTTCC TGCGGCTCGC GCAGCCCCCG GCCGCCACGA GCCGGTTCCT GGACCTGGGC
ACGGACTCAC TGATGGCGAT CGAGCTTCGC AACCGGTTGC ACAGCCAGTT CGGCGGTGCG
TTCACCATCA ACGCGACCGC GGTGTTCGAC TATCCGACCA TCGGGGGGCT CGCCGAGTAC
CTGGTGGGTC AGCTACCCGA CTCCGACGCT GCGGCGGGCG AGACGCCGGC CACCGAGACG
TCACCGATTG ACCACAATCA CCCTGACCGG GAAGCTGATT CGCCGGGGTA G
 
Protein sequence
MGSAEHPNGP APRFAIVGYA ARFPGAPDAD AFWDVLREGR DAISDVPRDR WDAEEFFDPE 
PGAPGKVVTR RAGFVDDVTG FDAPFFGMST REVRMMDPQH RLLLETAWRA VEHSGTAPTA
LAGSNTGVFV GLATHDYLGM ASDELTYPEI EAYMAIGTSN AAAAGRISYR LGLQGPAVAV
DTACSSSLVA IHQACQALRL GECDLALAGG ANVLLTPATM ITFSNAHMLA PDGRCKTFDA
AADGYVRGEG CGVIVIKRLE DAVRDGDRIR AVIRGSSINQ DGASGGLTVP NGVAQQRVIA
DALKRSDLEP KDVGYLEAHG TGTSLGDPIE AQAAGAVLGA GREPSRPLLI GSAKTNIGHL
EAAAGIAGVI KVILSLEHRT LPKHLNFENP SPHIPWDRLA VEVVKETIPW ERDGGPRIAG
VSSFGFAGTN AHVILEEAPE QAAAPGPVQQ PDRRFSLLPL SARTPAALTQ LAGQYRDWWT
AHPEATLADV CFTAGAGRAH LEHRAALVVN SAESAVELLG ALADDRPAPG LVRGESHDTP
KTAWLFTGQG SQYPGMAREL YDTEPVFAET LNRCAEAVAD VLEKPLLDVI FDAEADDAEK
TLQNTCYAQP ALFAVEMGLA RLWQSWGFEP DVVLGHSVGQ YSAACVAGVF SLTDGARLMA
ERGRLFGSLP EGGRMVAVFT TAERAEAMTD EFPSLSVAAY NGTNTVLSGP AGDLEKAVAT
LSADGVRCDW LDTSHAFHSA LLDPILDDFE SYANQFDFAA PQRILIDNRT GTALGRSVQL
DGTYWRRHAR QPVEFAKSVR TLAEMNCKLL VEIGPRPVLT AAALGAWPDP ATAPRVIASL
RRTAADHRQI TEAVADAYVL GHLPEFGAFR DAGARKLDLP TYPFEHRQYW FRDNQDPDGS
EALQPRTPST EAVRLLEDGR IEELATLLDG AAGDQQTLDV LSRLAAQHNQ QRSSQSITND
RYEIRWEQSA AALSGAETDE VSSWILVGDD TEATRPLVDV LSARGQRYRI FGLPVSDADE
EQLGEALRLA AAEDSTLRIV HVGGLDSDPD PGTAPSMRSL LRMQHRILGG TRRLFRAAAA
AELRTPIWVV TRRAQHVTAT DTVAPDQSCL WGFGRAAALE LPQLWGGLAD LADGTADEWS
RLLNRITAPH GPAVREDQIA LRDHAVYVPR LVRRAGRPAG KPLQLRDDAT YLVTGGLGSI
GLEIAGYLAA HGARHLVLTS RRAPGDATQQ RIDALGAQHS CQIRVVAADV ADAHDVARLL
AAVAAELPPL AGIVHAAGEI GTTPLVSLED SEVDRVFAGK VWGAWHLSEA AADLQLDFFL
GTSSIASVWG GYGQTAYGAA NAFLDGLAWR LREQGISGVS INFGPWSAGM ADAESRARLD
QRGVRTLSPA DALAGLADVV TAAEAPGPAQ GIVARIDWER FLPLYQQAGR RAFLAELERE
VPDTAPAATP SGRTELVERL TNAPVQQRKK LLTDYLRTAV AEITRVDATE IREDAGFFDL
GMDSLMAVEL RRRIEQGVGR EIPATLAMDH PRLSDVADYL LGEVLGLAEQ APAKAGSQPA
SAAANRTDEP IAIVAVSCRF PGAPDPEAFW EVLAGGVDAI REVPEDRFDI DEFYDPDPDA
AGKTYTRFGG FLDGIDGFDP EFFGISPREA VWIEPQQRLM LETVWEGLER AGLAPADLRG
SRTGIFVGVA ANEYAHLLSS ESIEKIEPHF ITGNALNAIS GRVAFALGLE GPAVAVDTAC
SSALVAVHQA CQALHSGDCD LALAGGVNVL LSPVTSVAAS RARMLSPVGR CKTFDASADG
YVRSEGCGIL VLKRLGDAVR DGDRVCAVIP SSAVNQDGAS SGLTVPNGGA QQRLIGMALA
RAGLSGGDVD YLEAHGTGTP LGDPIEVQAA AAAYGASRDA DRPLLMGSVK TNIGHLESAS
GAAGLIKVVL SLQHNLLPQS LHFENPSPHI PWDSLPVRVV DKAIPWQADG RPRRAGVSSF
GFTGTNAHVL IEEAPLPQAA DPVEEPDTRA LPVGVLALSA RSPEALTALA QRYEAWLSAH
PDADLADVCR TAGTGRSHFE HRAALVVDSV AAAREGLAEL AQNRLRPGVV RGEHTHHPTT
AWLFTGQGSQ FPGMARELFE TEPVFADAVT RCADAVKDIL PRPLLEVLFA ADRESGEALR
HTSFAQPAIF AVEMGLARLW QSWGIEPDVV LGHSVGQYAA ACVAGVFSLE DGARLMAERG
RMFGSLPEGG RMVAVFTDAK LVEEIAGDFP RVSVGAYNGP NTVLSGPGED LEQIVDRFGD
DGVRCTWLQT SHAFHSELLD PVLDEFESYA AQFQFATPTL PLVCNRTGTV LTAQTPLDAQ
YWRRHSRQPV QFAESVRTVA ALGCSVLMEI GPQPVLTGSA VQVWPEHLAA PRAIVSLRKG
VSDRRQITEA LAAAYVGGHR PDFGALYRRP GRAVALPTYP FQRRRFWPKT SGITTDGPAV
SGILGSAKDL ASGDTVYTSR WSVRSQPWLA DHVIYGTVVV PGATYAAMAL AAVGTPARVK
DVFFYEPIIL PEKASREVQL TLHPSEDGGQ KFQVHSREYG ERGTEWSLNA EGTVASGVDE
NPQAEPSGPV DEAIERLNRM RPQDLFETFA DMELAWGPTW SGSLKSLWLG EGEAIGDILV
GEELAEQLGT EPMHPVLMDL CTGVAFPAFP ALLAAEQGVS DLFLPLRYGQ VTLRDKMPRR
FYCRATWHTS ELDSETQVFD LDFLDRDGRH LGGIREFTVK RAPREALLRG LGGDATRLLY
TLGWHEVPAP ASGDAALNGN WLIAGFDELA AGVPGCIPFD RSTDPEPLGQ LLAQAHERGI
GFSGVVWRAA APSADESSTA MAARIETEIA DLLSAVHTVQ NGAVKLPGGL WIVTERAVAT
ESGEPVDPVQ AALWGFGRTT INEEPALRCR LVDVDGSPEA VLALAGLLAA PVDEPELAVR
QGKLLASRLL PWARSGHLTV PRSADYVLAP TERGAIDNLR LTETDVAPPA EGYVQVRVEA
AGLNFRDVLN VLGLYPGDPG PIGGDFAGTV TQLGSGVTGL EVGQRVYGFM QGAFASRFNV
PAQLLAPIPD GIGAVDAATI PAAALTARLA FDWAQLKPGD RVLIHAASGG VGLAAIQMAQ
QHGAVVFATA STYKRATLRK LGVDHVYDSR TTEFADQILA DTDGEGVDVV LNSLTNEGFI
EATVRATAQN GRFAEIAKRD IWTPEQMAAA RPDIAYEIVA LDTVTLLEPE RIRGLLGEVS
DGLGKADGSS RAEWVPLPAE IYPLTEARAA FRRMQQARHV GKIVLQMPKP LQPRADRSYL
ITGGLGAIGL HTAAYLAQLG AGDLVLTSRR APDADARRAI EEITERYKCR VHTFCADVGD
ESQVAELLAR IRAELPPLAG VAHLAGVLDD ALLSQQSLER FRTTLAPKAF GACHLDRLTA
GDDLDFFIMS SSVSSLFGSP GQANYATANA LLDGLTAHRR ARGLPATGVN FGPWAQGGMA
SSEAATANIG AQGLVPLEPS AALGALAEVL ANGTGQAAVL KANWQRAAKV LGSSRPPILD
LVLPRPEGEV AGDSELLRQL QEIPVAQRAG FVTEFLQREV QNFLRLAQPP AATSRFLDLG
TDSLMAIELR NRLHSQFGGA FTINATAVFD YPTIGGLAEY LVGQLPDSDA AAGETPATET
SPIDHNHPDR EADSPG