Gene Mmcs_0244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_0244 
Symbol 
ID4109090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp271988 
End bp283069 
Gene Length11082 bp 
Protein Length3693 aa 
Translation table11 
GC content70% 
IMG OID638029369 
Productbeta-ketoacyl synthase 
Protein accessionYP_637421 
Protein GI108797224 
COG category[C] Energy production and conversion
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases
[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR00128] malonyl CoA-acyl carrier protein transacylase
[TIGR00517] acyl carrier protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCACA GGGGTCGTGC ACTGACCGCA GACGCAAGGG AGGACACGAA CGGCATGGAG 
TCTGCCGCAT ATCCGCACGT GCCGGCCAAC CGATTTGCCA TCGTCGGCTA CGCGGCTCGG
TTTCCAGGGG CACAGGACGC AGACGAGTTC TGGGACCTTT TGCGGGACGG TCGCGAGGCG
ATTTCGGAAG TCCCCCAAGA CCGCTGGAAT GTCGACGAAT TCTTCGACCC GGAACCGGGG
GCGCCCGGTA AGGTCGTGAC CCGCCGTGCG GGGTTCGTCG ACGACGTGAC CGGGTTCGAC
GCGCCGTTCT TCGGCATGTC GACGCGCGAG GTCAGGTTGA TGGACCCGCA GCACCGGCTG
CTGCTCGAGA CGGCGTGGCG TGCGGTCGAA CATTCGGGCA CCGCGCCAAC GGATCTGGCC
AACAGCAACA CCGGAGTCTT CGTCGGTCTG GCCACCCACG ACTACCTGGG CATGGCTTCG
GACGAGCTCA GCTACCCCGA GATCGAGGCC TACATGGCCA TCGGTACGTC GAATGCGGCG
GCGGCAGGCC GGATCAGCTA TCGGCTGGGC CTGCAGGGGC CCGCGGTCGC CGTCGACACC
GCGTGCAGCT CGTCGCTGGT GGCCATCCAC CAGGCGTGCC AGGCGCTGCA ACTGGGCGAA
TGCGACCTCG CGCTGGCCGG CGGTGCGAAC GTCCTGCTCA CCCCCGCCAC CATGATCACG
TTCTCCAACG CGCACATGCT CGCACCCGAC GGCCGGTGCA AGACCTTCGA CGCCGCCGCG
GACGGTTACG TACGCGGTGA GGGCTGCGGC GTCATCGTCA TCAAACGCCT CGAGGACGCG
CTGCGCGACG GCGACCGGAT CCGGGCGGTG ATCCGCGGCA GCGCAATCAA CCAGGACGGT
GCTTCCGGCG GTCTGACCGT GCCGAACGGT GTTGCCCAGC AGAGGGTTAT CACCGATGCG
CTCAAACGAG CGGGTGTTCG GGCCGGGGAT GTCGGATATC TGGAGGCGCA CGGCACCGGA
ACCTCGCTGG GCGACCCGAT CGAGGCGCAG GCCGCGGGTG AGGTGCTCGG CGCCGGACGC
GAACCGAGCC GGCCGCTGCT GATCGGCTCG GCGAAGACGA ACATCGGGCA CCTGGAAGCG
GCCGCGGGGA TCGCCGGCGT CATCAAGGTG ATCCTGTCGC TCGAACACGA GACCCTGCCG
AAGCACCTCA ACTTCACCAC CCCGTCGCCG CACATCCCGT GGGACCGGCT CGCGGTGAAG
GTCGTGACGG AGTCCACGCC CTGGGAGCGC AACGGCCGGC CGCGCATCGC CGGGGTGAGC
TCGTTCGGGT TCGCCGGCAC CAACGCCCAC GTCATCCTCG AAGAGGCACC CGAGTCGACC
GTCACGCCCG CGGCGACCCC CGAACCCGGC GAGCGGTTCA GCCTCCTTCC CCTCTCGGCG
CGGACACCGG CCGCCCTGGT GCGGATCGCC GATCGGTACC GCAGCTGGAT GACCACCCAT
CCGGAGGCCA CGCTGGCCGA CGTCTGCTTC ACCGCGAGCA CGGCGCGCGC GCACCTGGAG
CAGCGAGCCG CACTGGTGGT CGATTCGCGG GAGTCGGCGG TCGAACTGCT CGGTGCGCTG
GCCGACGACC GACCGGCACC CGGCCTGGTG CGGGGCGAAT CCCATGACAC ACCGAAGACG
GCGTGGCTGT TCACCGGGCA GGGCAGCCAG TACCCGGGCA TGGCGCGGGA GCTGTTCGAC
ACCGAACCGG TGTTCGCCGA GACGATGAAG CAGTGTGCGG CCGCGGTCGC CGATGTGCTC
GAAAAGCCGT TGCTCGACGT GATATTCGAT GCCCACGGTC CAGAGGCTGC CGAGACGTTG
CGGCAGACCT CGTACGCGCA GCCCGCCCTG TTCGCCGTCG AGATGGGCCT GGCCCGGCTC
TGGCAGTCGT GGGGCTTCGA ACCCGATGTG GTGCTGGGCC ACAGCGTCGG CCAGTACGCG
GCGGCCTGCG TCGCGGGTGT GCTCGGCCTC GAAGACGGTG CGCGGCTGAT GGCAGAACGC
GGCCGCCTGT TCGGCAGCCT GCCCGCGGGT GGCCGCATGG TCGCGGTGTT CACCGCCGCC
GAACGGGTCG AGAACATCAC CGACGAGTAC CCGCGCCTGT CGGTCGCGGC CTACAACGGC
GCCAACACCG TATTGTCCGG CCCGGCAGGC GATCTGGAAA AGGCCGTCGC CACGCTGGCG
GCCGACGGTA TCCGGTGCGA CTGGTTGGAC ACCAGCCACG CTTTCCACTC GGCGCTGCTC
GACCCCATCC TCGACGAGTT CGAGTCGTAC GCGCAGCAGA TCGAATACGC CGCCCCGCAA
CGGGTTCTGA TCGACAACCG CACCGGCGCC GCACTCGGCT GGAGCGCGAA ACTCGACGGC
ACCTATTGGC GCAGGCACGC GCGCCAACCC GTGGAGTTCG CCAAGAGCGT GCGCACCCTT
GCCGAACTGA ACTGCAAGGT CCTGCTCGAG ATCGGTCCGC GACCGGTGCT CACGGCCGCA
GCCCTTTCGG CCTGGCCCGA CCCCGCCACC GCGCCGCGGG CGATCGCCTC GCTGCGGCGC
AACACCGCCG ACCACCGGCA GATCACCGAA GCCCTCGCCG ACGCCTACGT CCTGGGCCAC
CGGCCCGATT TCGCGGCGAT TCAGCGGCCG GACGCGCACA AGCTCGACCT GCCGACGTAT
CCGTTCGAGC ACCGCCAGTA CTCGTTCCGG GACAACCAGG CGGAACGTCC CGAACAGGCA
GGCCCGCCGA TCCATCAGGC CGCGCGCACC GAGGCGGTCG GCCTCCTCGA GGACGGCCGG
ATCGAGGAAC TCGCGGCCCT GCTCGGCGAC ACCGACGGTG ATCGGCAGAC CTTCGACGTG
CTGAGCAAGC TTGCGGCACA ACACAACCAA CAGCGTACGA GTCAGTCGAT CACCGATGAC
CGCTACGAGA TCCGCTGGGA GGAGCTCACC ACCGCATCGC CGGCGGCGAC CGGCGAGCCG
TCCACGTGGA TCATCGTCGG CGACGACACC GACGCCGCCC GTCCGCTGAT CGACGCGGTG
ACCGCCCGCG GCGACCGCCA CCGGCTCGTC GGGTCACCGG TGTCCGACGC CGACGAGGCG
TCCCTCGCGG ATGCGTTGCG CGCCTCAGTG GACGACGCAT CGGCGCCAGG GGGCGCTGTC
CGCATCCTGC ACATCGCGGC CCTCGACGCC ACCACCGCAC CGTCGATGCG GACCCTGCTG
AGCATGCAGC ACCGGATCCT CGGCAGAACC CGACGGCTCT TCCATGCCGT GACCACCACC
GGACTGCGCA CCCCGATCTG GCTGATCACC CGTGGCGCAC AACGAGTCAC GGCCACCGAC
ACCGTCGCGC CCGATCAGAG CGCGTTGTGG GGATTCGGAC GGGCCGCGTC GCTGGAGCTT
CCGCACCTGT GGGGCGGGCT GGCCGACCTG CCGACCGGTC CCGGTGCGAG CGAAGACGAA
TGGTCCCGGC TCCTCGACCG GATCGCCGCG CCGCGACACT CGGATGTCAC CGAAGACCAG
GTCGCGCTGC GCGACGGCGC AGTCCACGTG CCCCGGCTGG TCCGGCGGAC CGGACAGCCC
AGCGGTGCGC CGCTGCGGCT GCGCAGCGAC GCAACGTATC TCGTGACCGG CGGGCTGGGT
GCGATCGGCC TGGAGATCGC GGGATACCTG GCCGCGCACG GTGCCGGGAA CATCGTGCTG
ACCAGCAGGC GCGCACCCGG CGATGCCGCG CAACAGCGCA TCGACGCGCT GCGCGACAAG
TTCGGCTGCG CGATCCGGGT GGCCACCGCC GACGTCGCCG ACGCGCACGA CGTGGCACGC
CTGTTGGCGG GTGTGCAGGC CGAGCTACCG CCGTTGGCCG GCATCGTCCA CGCCGCCGGT
GAGATCGGCA CCACCGCACT GAGCGCGATG GACGACGAAT CCCAGCAAGC CGAGGTCGAT
CGCGTATTCG CCGGAAAGGT CTGGGGCGCC TGGCATCTCA GCGAGGCGGC GGTCGACCTG
CAGCTCGACT TCTTCATCAG CACCTCGTCG ATCGCCTCGG TCTGGGGTGG GTTCGGTCAG
ACCGCCTACG GGGCGGCGAA CGCCTTCCTC GACGGACTGG CCTGGCGCCT GCGCGAACAG
GGTATCGCCG CGACCAGCGT CAACTTCGGT CCGTGGTCGG CGGGGATGGC CGACGCGGAG
TCCCGCGCGC GCCTCGAGCA GCGCGGAGTC CGGACCCTCG ACCCGGCCGA TGCACTGGCC
GGCCTGGCCG ACGTCGTGGC GGGTCCTGCG ACTCAGGGCG TGATCGCGCG GATCGACTGG
GCCCGTTTCC TGCCGCTGTA CCAGCAGGCG GGCAGGCGCG CGTTCCTGAC CGAGTTGGAG
CGCGAGGCGC CGGTCGCGTC GACGGATGCG GCGCCGGCGG TGACGGCATC CGGGAAGACC
CCACTGGTCG AGCGGCTCTC GGGTGCCCCG GTGCAGCAGC GCAAGAAGCT GCTCACCGAC
TATCTGCGGG ACGCGGTCGC CGAGGTGACA CGCGTCGATT CCGCCGAGAT CCGCGAGGAC
GCAGGGTTCT TCGACCTCGG GATGGATTCG CTGATGGCCG TCGAACTGCG GCGCCGCCTC
GAACAGGGTG TCGGCAAGGA GATCCCGGTC ACCCTGGTGA TGGACCATCC CCGCCTGTCC
GACGCGGCCG ACTACCTGCT CGGCGAGGTG CTCGGCCTGG CCGAACAGGC GCCCGCCGAA
TCCCGGCCCG CGCTGGCGTC TCTGGCCTCC GAGCGCACAG ACGACCCGAT CGCGATCGTC
GCGGTCTCGT GCCGGTTCCC CGGCGCTCCC GACCCGGAAG CCTTCTGGGA TCTGCTCTCC
GGTGGTGTCG ACGCGATCCG GGAGGTCCCG GAGGATCGGT TCGACATCGA CGAGTTCTAC
GACCCGGATC CGGACGCCGC AGGCAAGACC TACACGCGCT TCGGCGGATT CCTCGACGGG
ATCGACGGAT TCGACCCCGA GTTCTTCGGC ATCTCCCCCC GTGAGGCCGT CTGGATCGAA
CCGCAGCAGC GACTGATGCT CGAAACGGTC TGGGAGGGCA TCGAAAGAGC CGGCCTCTCC
CCGGCGGACC TGCGGGGCAG CCGGACCGGG GTCTTCGTGG GCGTGGCCGC CAACGAGTAC
GCCCACCTGC TGTCGTCGGA GTCGATCGAG AAGATCGAGC CCCACTTCAT CACCGGTAAC
GCGCTCAACG CCATCTCCGG CAGGGTCGCC TTCGCGCTGG GCCTCGAGGG TCCGGCGGTC
GCGGTGGACA CCGCGTGCAG TTCGGCGTTG GTGGCCGTCC ACCAGGCCGT TCAGGCACTG
CACTCCGGGG ACTGCGACCT GGCAGTGGCC GGCGGGGTGA ACGTCCTGCT CAGCCCGGTG
ACGGTGGTCG CCGCCTCGCG CGCGCGGATG CTCTCCCCCG TCGGTCGGTG CAAGACCTTC
GACGCCTCCG CCGACGGCTA CGTGCGCAGC GAAGGCTGCG GCATCCTGGT GCTCAAGCGA
CTCAGCGACG CGGTGCGCGA CGGAGACCGG GTCTGCGCGG TCATCCCCGG CACCGCTGTG
AACCAGGACG GCGCCTCCAG CGGTCTGACC GTGCCCAACG GTGGTGCGCA GCAACGCCTC
ATCAAGACCG CACTGACCCG CGCCGGCCTG ACGGGCGGTG ACGTCGACTA CCTCGAGGCA
CACGGGACGG GCACCCCGCT GGGTGATCCG ATCGAGGTGC AGGCGGCCGC CGCCGCCTAC
GGCGCCTCCC GTGACGCGGA CCGGCCACTG CTGATGGGGT CGGTCAAGAG CAACATCGGT
CACCTCGAAT CCGCCTCGGG CGCAGCAGGT CTGATCAAGG TCGTGTTGTC GCTGCAGCAC
GGCGTGCTGC CGCAGAGCCT GCACTTCGAC AACCCGTCGC CGCACATCCC GTGGGATTCG
CTGCCGGTGC GGGTCGTCGA CGAGGCGGTG CCGTGGCAGC CCAACGGCAG GCCGCGGCGC
GCCGGGGTGA GTTCCTTCGG GTTCACCGGC ACGAACGCGC ACGTGTTGGT AGAAGAGGCG
CCGCAGGCAC CGGTGTCCGA GGACCAGGAG TCCGACACCG ACGATCAGCC CGTCCACGTC
CTCCCGCTGT CTGCCCGGTC ACCGGAGGCA CTGGTGGCGT TGGCCCGGCG TTACGACTCG
TGGCTGGGCG CGCACCCGGA CGCCGACCTC GCCGGCGTGT GCTTCACGGC CGGTACGGGT
CGTTCGCATT TCGAACATCG CGCGGCGATG GTCGTGGATT CGGTCGCCAG CGCCCGCCAG
GGTCTGGCCG ACCTGGCCGA CAACCGCACC CGCCCCGGTG TCGTGCGGGG TGAGCACACG
AACCGCCCGA CGACCGCGTG GTTGTTCACC GGACAGGGCA GCCAGTACCC CCGGATGGCG
CGCGAGTTGT TCGACGCCGA ACCGGTTTTC GCGGAAACCG TGACGCGATG TGCGGACGCG
GTCGACGGTA TGCTGCCGCT TCCGCTGCTC GAGGTGCTGT TCGCCGCTGA CCGCGAGACC
GCCGAACGGT TGCGGCACAC CTCGTACGCG CAGCCCGCGC TGTTCGCGGT CGAGATGGGC
CTGGCCCGGC TGTGGCAGTC GTGGGGTGTC ACACCCGACG TGGTGCTGGG GCACAGCGTG
GGCCAGTACG CCGCGGCGTG CGTGGCCGGG GTGTTCAGCC TCGAGGACGG CGCCCGGCTG
ATGGCCGAAC GCGGCCGGAT GTTCGGCAGC CTGCCCGAGG GCGGGCGGAT GGTGGCGATC
TTCGCCGACG CCAAACAGGT CGAGCAGATC GCCGGTGAGT TCGCGCGCGT GTCCGTCGGC
GCCTACAACG GACCCAACAC CGTGCTCTCC GGTCCCGGTG AGGATCTGGA ACAGATCGTC
GCCCGGTTCG CCGACGAGGG GATCCGCTGC ACCTGGTTGG AGACCAGTCA TGCCTTCCAC
TCCGAACTGC TCGATCCGGT GCTCGACGAG TTCGAGTCCT ACGCGGCGCA GTTCGAGTTC
GCCGCCCCGA CGATGCCGGT GGTGTGCAAC CGGACCGGCG CGGTACTGAC CGGGCAGACC
CCGCTCGATG CGCAGTACTG GCGGCGGCAT TCGCGCCAGC CGGTGCAGTT CGCCGAGAGT
GTGCGCACCG TCGCGACGCT CGGCTGTTCG GTGCTGATGG AGATCGGTCC GCAACCCGTG
CTGACCGGGG CCGCGGTGCA GATCTGGCCG GAGCACCTGG CCGCTCCGCG GGCGATCGCC
TCGCTGCGCA AGGGCGTCGG CGATCGGCGT CAGATCGCCG ATGCGCTGGC CGCAGCCTAT
GTCGGCGGCC TCCGGCCCGA TTTCGCTGCG CTGCAAGGTC AGCCGCATCA CCGGCTCGAA
CTGCCCACGT ATCCTTTCCA ACGTCGACGG TTCTGGCCCA AGACGTCGAG TATCACGGTG
GACGGTCCGG CGACGTCCGG AATCCTGGGC AGTGCCAAGG ATCTCGCGTC CGGCGACACC
GTCTACACCA GCAGGCTGTC CGTCAAGTCG CAGCCGTGGC TGTCCGACCA CGTCATCTAC
GGCACGGTCG TCGTCCCCGG AGCCACCTAT GCGGCGATGG CGTTGGCCGC GGTCGGTACC
CCGGCGCGGG TGAAGGACGT GTTCTTCTAC GAACCGATCA TCCTGCCCGA GAAGAGTTCT
CGCGAAGTGC AGCTGACCCT GCACATACTC GGCGACGGTG AGCAGAAGTT CCAGGTGCAC
AGCCGTTCGT ACGGTGTGCG GGACGCCGAG TGGTCGTTGA ACGCCGAAGG CACTGTGGTG
CGGGGTGTCG ACGACGCGCC GGTGTCGCAG GATGATCCGG TCGACGAGGC GATCGAGCGG
TGCAACCGCA TGCGTCCGCA GGAACTGTTC GAGACCTTCG CCGACATGGA ACTGGCGTGG
GGTCCGACCT GGTCCGGATC CCTGAAGTCG CTGTGGCTCG GTGACGGTGA GGCGATCGGT
GACGTCCTCG TCGGCGCGGA ACTCGCCGAA CAACTCGGCA GCGAGCCGAT CCACCCGGTG
CTGATGGACC TGTGCACCGG CGTCGCGTTC CCGGCGTTCC CGGCGCTTCT CGCCGCCGAG
CAGGGGGTGA GCGATCTGTT CCTGCCGCTG CGGTACGGGC AGGTGACGGT GCAGGAGAAG
ATGCCTCGGC GGTTCTACTG CCGCGCCAAG TGGCACCACA GCGAACTGGA CAGTGAGACC
CAGGTCTTCG ACCTCGACTT CATCAGTCGG GACGGCCGCC CCCTCGGCGG TATCCGCGAG
TTCACGGTCA AACGCGCACC TCGCGAGGCG CTCCTACGCG GCCTGGGCGG CGACGCCACC
CGGCTGCTCT ACACCCTGGG CTGGCACGAG GTGCCGTTGC CTGCGGTGGA TCCGGCTGCC
CCCAACGGCA ACTGGCTGAT CGCCGGGTTC GACGAACTGG CCGCCTCGGT CCCCGGATGC
ATCCCCTTCG ACCGGACGAC GGATCCGGAG CCGCTCGGCC AGCTGCTCAC CCAGGCACAC
GAGCGCGGTA TGGCGTTCTC CGGCGTCGTA TGGCGTGCCG CCGCACCGAA GCCGGACGAG
TCGAGCGCCG ATGTCGCCGC GCGGATCGAG ACCGAGATCG CCAACCTGCT CAGCGCGGTG
CACGCCGTGC AACGCGGTGA GGTGAAGCTG CCAGGGGGTC TGTGGATCGT CACCGAGCGG
GCGGTGGCCT GCGAATCCGG CGAACCGGTC GACCCGGTGC AGGCGGCGCT GTGGGGCTTC
GGGCGTACCA CGATCAACGA GGAGCCGGCG CTGCGCTGCA AACTCGTCGA CTGCGACGGA
TCCCCGGAAG CGGTCGAGGC GCTGAGTGCC CTGCTCACCA CGCCGGTCGA CGAGCCGGAA
CTGGCACTGC GCCAAGGGAA GTTGCTCGCG TCGAGGTTGT TGCACTGGGC GCGCAGCGGT
CATCTCACGG TGCCGCGATC GACCGACTAC GTCCTGGCGC CCACCGAACG CGGCGCGATC
GACAACCTGC GGCTCACCGA GACGGAGGTG CCGCCGCCGG CCGAGGGCTA CGTGCAGGTG
GAGGTGGAAG CCGCAGGCCT GAACTTCCGC GACGTGCTCA ACGTCCTCGG GCTCTACCCC
GGTGACCCGG GACCGATCGG CGGCGACTTC GCCGGTGTCG TCACGCAATT GGGTGACGGG
GTCGGTTCGG GGCGAGCGGA GCGACGGGAT GGAATCAAGC TCGAGGTGGG TCAGCGCGTC
TACGGCTTCA TGCAGGGCGC GTTCTCGAGC CGGTTCAACG TGCCGGCCCA GTTGCTCGCG
CCGATCCCCG ACGGGGTGGG CGCGGTCGAG GCTGCCACGA TTCCCGCTGC GGCGCTCACG
GCCCGCCTCG CGTTCGACTG GGCGCAACTC GAGCCCGGCG ACCGGGTGCT CATCCACGCT
GCCAGCGGTG GCGTCGGACT GGCCGCCATC CAGCTGGCCC AGCAGCACGG CGCCGTCGTG
TTCGCCACCG CGAGCACCTA CAAGCGCGCG ACGCTGCGCA AGATGGGTGT GGAGTACGTC
TACGACTCGC GCAGTACGGA TTTCGCCGAC CAGATCCTGG CCGACACCGA CGGCGCAGGC
GTCGACGTGG TGCTCAACAG CCTGACCAAC GAGGGTTTCG TCGAGGCGAC CGTGCGCGCC
ACCGCGCAGA ACGGCCGGTT CGCCGAGATC GCCAAGCGCG ACATCTGGAC GCACGAGCAG
ATGGCGGCGG CCCGTCCCGA CATCTCCTAC GAGATCGTGG CTCTCGATAC GGTGACCATT
CAGGAGCCCG AGCGCATCCG CGGGCTGCTC GGCGAGGTGT CGGACGGGCT GGGCAAGGGT
GAGTGGGCGC CGTTGCCTGC CGAGATCTAT CCGCTGACCG AGGCCAGGGC CGCGTTCCGG
CGCATGCAGC AGGCACGCCA CATCGGCAAG ATCGTGGTCC AGATGCCAAG CCCGCTGCAG
CCGCGTCCCG ACCGCAGCTA CCTGATCACC GGTGGCCTGG GTGCGATCGG CCTGCACACC
GCGTCGTACC TGGCCCAACT CGGCGCAGGG GACATCGTGT TGACCAGCCG TCGGGAACCC
GATGCGGACA CTCAGCAGGT GATCGACGAG ATCACCGACC GCCACCGCTG CCGCATCCAC
ACCTTCGCCG CCGACGTCGG CGACGAGTCC CAGGTCGAGG AACTGCTCGA GCGGATCCGC
GCGGAGCTGC CGCCGCTCGC CGGTGTCGCA CATCTGGCCG GTGTGCTCGA CGACGCGCTG
CTGTCCCAAC AGAGCGTGGA GCGGTTCCGA ACCACGCTGG CGCCCAAGGC TTTCGGCGCC
TACCACCTGG ACCACCTGAC CAGAGACGAC GATCTGGACT TCTTCATCGT GTCCTCGTCG
GTGTCGAGCC TGTTCGGATC CCCCGGCCAG GCCAACTACG CCACGGCCAA CGCACTGCTC
GACGGACTGG TCGCGCGGAG AAGGGCGCAC GGCCTGCCGG CCACCGGTGT CAACTTCGGT
CCGTGGGCAC AGGGCGGCAT GGCATCCTCG GAGGCCGCCA CCGCCAACAT CAGTGCCCAG
GGCCTGGTTC CGCTGGAGCC GTCGGCGGCG CTGAGCGCAC TCGCCGAGGT CGTCGCGAAC
GGCACCGCAC AGGCCACCGT GATCAAGGCC AACTGGCAGC GTGCGGCCAA GGTGCTGGGC
GCATCGCGGC CACCGCTCCT CGATCTGGTC CTGCCGAGTG CGGCCGGGGA GGTGACCGGT
GACAGCGAAC TGCTCCGGCA GCTGCAGGAG ATCCCGGTCG CGCAGCGGGC CGGGTTCGTC
ACCGAGTTCC TCCAGCGCGA GGTGCAGAAC TTCCTGCGAC TCGCGCAGCC GCCCGCCGCG
TCGAGCCGGT TCCTGGATCT CGGCACGGAT TCCCTGATGG CGATCGAACT CCGCAACCGG
TTGCACAGTC AGTTCGGCGG CGCGTTCACG ATCAACGCGA CCGCGGTGTT CGACTACCCG
ACCATCGGGG GACTCGCCGA GTATCTGGTG GGTCAGCTGC CCGACGCCGA ATCACCGGCG
GCGGAGACGG CGCCCGCTGC GGAGGTTCCG GCGGCCGACT GA
 
Protein sequence
MSHRGRALTA DAREDTNGME SAAYPHVPAN RFAIVGYAAR FPGAQDADEF WDLLRDGREA 
ISEVPQDRWN VDEFFDPEPG APGKVVTRRA GFVDDVTGFD APFFGMSTRE VRLMDPQHRL
LLETAWRAVE HSGTAPTDLA NSNTGVFVGL ATHDYLGMAS DELSYPEIEA YMAIGTSNAA
AAGRISYRLG LQGPAVAVDT ACSSSLVAIH QACQALQLGE CDLALAGGAN VLLTPATMIT
FSNAHMLAPD GRCKTFDAAA DGYVRGEGCG VIVIKRLEDA LRDGDRIRAV IRGSAINQDG
ASGGLTVPNG VAQQRVITDA LKRAGVRAGD VGYLEAHGTG TSLGDPIEAQ AAGEVLGAGR
EPSRPLLIGS AKTNIGHLEA AAGIAGVIKV ILSLEHETLP KHLNFTTPSP HIPWDRLAVK
VVTESTPWER NGRPRIAGVS SFGFAGTNAH VILEEAPEST VTPAATPEPG ERFSLLPLSA
RTPAALVRIA DRYRSWMTTH PEATLADVCF TASTARAHLE QRAALVVDSR ESAVELLGAL
ADDRPAPGLV RGESHDTPKT AWLFTGQGSQ YPGMARELFD TEPVFAETMK QCAAAVADVL
EKPLLDVIFD AHGPEAAETL RQTSYAQPAL FAVEMGLARL WQSWGFEPDV VLGHSVGQYA
AACVAGVLGL EDGARLMAER GRLFGSLPAG GRMVAVFTAA ERVENITDEY PRLSVAAYNG
ANTVLSGPAG DLEKAVATLA ADGIRCDWLD TSHAFHSALL DPILDEFESY AQQIEYAAPQ
RVLIDNRTGA ALGWSAKLDG TYWRRHARQP VEFAKSVRTL AELNCKVLLE IGPRPVLTAA
ALSAWPDPAT APRAIASLRR NTADHRQITE ALADAYVLGH RPDFAAIQRP DAHKLDLPTY
PFEHRQYSFR DNQAERPEQA GPPIHQAART EAVGLLEDGR IEELAALLGD TDGDRQTFDV
LSKLAAQHNQ QRTSQSITDD RYEIRWEELT TASPAATGEP STWIIVGDDT DAARPLIDAV
TARGDRHRLV GSPVSDADEA SLADALRASV DDASAPGGAV RILHIAALDA TTAPSMRTLL
SMQHRILGRT RRLFHAVTTT GLRTPIWLIT RGAQRVTATD TVAPDQSALW GFGRAASLEL
PHLWGGLADL PTGPGASEDE WSRLLDRIAA PRHSDVTEDQ VALRDGAVHV PRLVRRTGQP
SGAPLRLRSD ATYLVTGGLG AIGLEIAGYL AAHGAGNIVL TSRRAPGDAA QQRIDALRDK
FGCAIRVATA DVADAHDVAR LLAGVQAELP PLAGIVHAAG EIGTTALSAM DDESQQAEVD
RVFAGKVWGA WHLSEAAVDL QLDFFISTSS IASVWGGFGQ TAYGAANAFL DGLAWRLREQ
GIAATSVNFG PWSAGMADAE SRARLEQRGV RTLDPADALA GLADVVAGPA TQGVIARIDW
ARFLPLYQQA GRRAFLTELE REAPVASTDA APAVTASGKT PLVERLSGAP VQQRKKLLTD
YLRDAVAEVT RVDSAEIRED AGFFDLGMDS LMAVELRRRL EQGVGKEIPV TLVMDHPRLS
DAADYLLGEV LGLAEQAPAE SRPALASLAS ERTDDPIAIV AVSCRFPGAP DPEAFWDLLS
GGVDAIREVP EDRFDIDEFY DPDPDAAGKT YTRFGGFLDG IDGFDPEFFG ISPREAVWIE
PQQRLMLETV WEGIERAGLS PADLRGSRTG VFVGVAANEY AHLLSSESIE KIEPHFITGN
ALNAISGRVA FALGLEGPAV AVDTACSSAL VAVHQAVQAL HSGDCDLAVA GGVNVLLSPV
TVVAASRARM LSPVGRCKTF DASADGYVRS EGCGILVLKR LSDAVRDGDR VCAVIPGTAV
NQDGASSGLT VPNGGAQQRL IKTALTRAGL TGGDVDYLEA HGTGTPLGDP IEVQAAAAAY
GASRDADRPL LMGSVKSNIG HLESASGAAG LIKVVLSLQH GVLPQSLHFD NPSPHIPWDS
LPVRVVDEAV PWQPNGRPRR AGVSSFGFTG TNAHVLVEEA PQAPVSEDQE SDTDDQPVHV
LPLSARSPEA LVALARRYDS WLGAHPDADL AGVCFTAGTG RSHFEHRAAM VVDSVASARQ
GLADLADNRT RPGVVRGEHT NRPTTAWLFT GQGSQYPRMA RELFDAEPVF AETVTRCADA
VDGMLPLPLL EVLFAADRET AERLRHTSYA QPALFAVEMG LARLWQSWGV TPDVVLGHSV
GQYAAACVAG VFSLEDGARL MAERGRMFGS LPEGGRMVAI FADAKQVEQI AGEFARVSVG
AYNGPNTVLS GPGEDLEQIV ARFADEGIRC TWLETSHAFH SELLDPVLDE FESYAAQFEF
AAPTMPVVCN RTGAVLTGQT PLDAQYWRRH SRQPVQFAES VRTVATLGCS VLMEIGPQPV
LTGAAVQIWP EHLAAPRAIA SLRKGVGDRR QIADALAAAY VGGLRPDFAA LQGQPHHRLE
LPTYPFQRRR FWPKTSSITV DGPATSGILG SAKDLASGDT VYTSRLSVKS QPWLSDHVIY
GTVVVPGATY AAMALAAVGT PARVKDVFFY EPIILPEKSS REVQLTLHIL GDGEQKFQVH
SRSYGVRDAE WSLNAEGTVV RGVDDAPVSQ DDPVDEAIER CNRMRPQELF ETFADMELAW
GPTWSGSLKS LWLGDGEAIG DVLVGAELAE QLGSEPIHPV LMDLCTGVAF PAFPALLAAE
QGVSDLFLPL RYGQVTVQEK MPRRFYCRAK WHHSELDSET QVFDLDFISR DGRPLGGIRE
FTVKRAPREA LLRGLGGDAT RLLYTLGWHE VPLPAVDPAA PNGNWLIAGF DELAASVPGC
IPFDRTTDPE PLGQLLTQAH ERGMAFSGVV WRAAAPKPDE SSADVAARIE TEIANLLSAV
HAVQRGEVKL PGGLWIVTER AVACESGEPV DPVQAALWGF GRTTINEEPA LRCKLVDCDG
SPEAVEALSA LLTTPVDEPE LALRQGKLLA SRLLHWARSG HLTVPRSTDY VLAPTERGAI
DNLRLTETEV PPPAEGYVQV EVEAAGLNFR DVLNVLGLYP GDPGPIGGDF AGVVTQLGDG
VGSGRAERRD GIKLEVGQRV YGFMQGAFSS RFNVPAQLLA PIPDGVGAVE AATIPAAALT
ARLAFDWAQL EPGDRVLIHA ASGGVGLAAI QLAQQHGAVV FATASTYKRA TLRKMGVEYV
YDSRSTDFAD QILADTDGAG VDVVLNSLTN EGFVEATVRA TAQNGRFAEI AKRDIWTHEQ
MAAARPDISY EIVALDTVTI QEPERIRGLL GEVSDGLGKG EWAPLPAEIY PLTEARAAFR
RMQQARHIGK IVVQMPSPLQ PRPDRSYLIT GGLGAIGLHT ASYLAQLGAG DIVLTSRREP
DADTQQVIDE ITDRHRCRIH TFAADVGDES QVEELLERIR AELPPLAGVA HLAGVLDDAL
LSQQSVERFR TTLAPKAFGA YHLDHLTRDD DLDFFIVSSS VSSLFGSPGQ ANYATANALL
DGLVARRRAH GLPATGVNFG PWAQGGMASS EAATANISAQ GLVPLEPSAA LSALAEVVAN
GTAQATVIKA NWQRAAKVLG ASRPPLLDLV LPSAAGEVTG DSELLRQLQE IPVAQRAGFV
TEFLQREVQN FLRLAQPPAA SSRFLDLGTD SLMAIELRNR LHSQFGGAFT INATAVFDYP
TIGGLAEYLV GQLPDAESPA AETAPAAEVP AAD