Gene Mjls_0234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_0234 
Symbol 
ID4875980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp256908 
End bp268016 
Gene Length11109 bp 
Protein Length3702 aa 
Translation table11 
GC content70% 
IMG OID640137548 
Productbeta-ketoacyl synthase 
Protein accessionYP_001068538 
Protein GI126432847 
COG category[C] Energy production and conversion
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases
[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR00128] malonyl CoA-acyl carrier protein transacylase
[TIGR00517] acyl carrier protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCACA GGGGTCGTGC ACTGACCGCA GACGCAAGGG AGGACACGAA CGGCATGGAG 
TCTGCCGCAT ATCCGCACGT GCCGGCCAAC CGATTTGCCA TCGTCGGCTA CGCGGCTCGG
TTTCCAGGGG CACAGGACGC AGACGAGTTC TGGGACCTTT TGCGGGACGG TCGCGAGGCG
ATTTCGGAAG TCCCCCAAGA CCGCTGGAAT GTCGACGAAT TCTTCGACCC GGAACCGGGG
GCGCCCGGTA AGGTCGTGAC CCGCCGTGCG GGGTTCGTCG ACGACGTGAC CGGGTTCGAC
GCGCCGTTCT TCGGCATGTC GACGCGCGAG GTCAGGTTGA TGGACCCGCA GCACCGGCTG
CTGCTCGAGA CGGCGTGGCG TGCGGTCGAA CATTCGGGCA CCGCGCCAAC GGATCTGGCC
AACAGCAACA CCGGAGTCTT CGTCGGTCTG GCCACCCACG ACTACCTGGG CATGGCTTCG
GACGAGCTCA GCTACCCCGA GATCGAGGCC TACATGGCCA TCGGTACGTC GAATGCGGCG
GCGGCAGGCC GGATCAGCTA TCGGCTGGGC CTGCAGGGGC CCGCGGTCGC CGTCGACACC
GCGTGCAGCT CGTCGCTGGT GGCCATCCAC CAGGCGTGCC AGGCGCTGCA ACTGGGCGAA
TGCGACCTCG CGCTGGCCGG CGGTGCGAAC GTCCTGCTCA CCCCCGCCAC CATGATCACG
TTCTCCAACG CGCACATGCT CGCACCCGAC GGCCGGTGCA AGACCTTCGA CGCCGCCGCG
GACGGTTACG TACGCGGTGA GGGATGCGGC GTCATCGTCA TCAAACGCCT CGAGGACGCG
CTGCGCGACG GCGACCGGAT CCGGGCGGTG ATCCGCGGCA GCGCAATCAA CCAGGACGGT
GCTTCCGGCG GTCTGACCGT GCCGAACGGT GTTGCCCAGC AAAGGGTTAT CACCGATGCG
CTCAAACGAG CGGGTGTTCG GGCCGGCGAT GTCGGATATC TGGAGGCGCA CGGCACCGGA
ACCTCGCTGG GCGACCCGAT CGAGGCGCAG GCCGCAGGTG AGGTGCTCGG CGCCGGACGC
GAACCGAGCC GGCCGCTGCT GATCGGCTCG GCGAAGACGA ACATCGGGCA CCTGGAAGCG
GCGGCGGGGA TCGCCGGCGT CATCAAGGTG ATCCTGTCGC TCGAACACGA GACCCTGCCG
AAGCACCTCA ACTTCACCAC CCCGTCGCCG CACATCCCGT GGGACCGGCT CGCGGTGAAG
GTCGTGACGG AGTCCACGCC CTGGGAGCGC AACGGCCGGC CGCGCATCGC CGGGGTGAGC
TCGTTCGGGT TCGCCGGCAC CAACGCCCAC GTCATCCTCG AAGAGGCACC CGAGTCGACC
GTCACGCCCG CGGCGACCCC CGAACCCGGC GAGCGGTTCA GCCTCCTTCC CCTCTCGGCG
CGGACACCGT CCGCCCTGGT GCGGATCGCC GATCGGTACC GCAGCTGGAT GACCACCCAT
CCGGAGGCCA CGCTGGCCGA CGTCTGCTTC ACCGCGAGCA CGGCGCGCGC ACACCTGGAG
CAGCGAGCCG CACTGGTGGT CGATTCGCGG GAGTCGGCGG TCGAACTGCT CGGTGCGCTG
GCCGAGGACC GACCGGCACC CGGCCTGGTG CGGGGCGAAT CCCATGACAC ACCGAAGACG
GCGTGGCTGT TCACCGGGCA GGGCAGCCAG TACCCGGGCA TGGCGCGGGA GCTGTTCGAC
ACCGAACCGG TGTTCGCCGA GACGATGAAG CAGTGTGCGG CCGCGGTCGC CGATGTGCTC
GAAAAGCCGT TGCTCGACGT GATATTCGAT GCCCACGGTC CAGAGGCTGC CGAGACGTTG
CGGCAGACCT CGTACGCGCA GCCCGCCCTG TTCGCCGTCG AGATGGGCCT GGCCCGGCTC
TGGCAATCGT GGGGCTTCGA ACCCGATGTG GTGCTGGGCC ACAGCGTCGG CCAGTACGCG
GCGGCCTGCG TCGCGGGTGT GCTCGGCCTC GAGGACGGTG CGCGGCTGAT GGCAGAACGC
GGCCGCCTGT TCGGCAGCCT GCCCGCGGGT GGCCGCATGG TCGCGGTGTT CACCGCCGCC
GAACGGGTCG AGAACATCAC CGACGAGTAC CCGCGCCTGT CGGTCGCGGC CTACAACGGC
GCCAACACCG TATTGTCCGG CCCGGCAGGC GATCTGGAAA AGGCCGTCGC CACGCTGGCG
GCCGACGGTG TCCGGTGCGA CTGGTTGGAC ACCAGCCACG CTTTCCACTC GGCGCTGCTC
GACCCCATCC TCGACGAGTT CGAGTCGTAC GCGCAGCAGA TCGAATACGC CGCCCCGCAA
CGGGTTCTGA TCGACAACCG CACCGGCGCC GCACTCGGCT GGAGCGCGAA ACTCGACGGC
ACCTATTGGC GCAGGCACGC GCGCCAACCC GTGGAGTTCG CCAAGAGCGT GCGCACCCTT
GCCGAACTGA ACTGCAAGGT CCTGCTCGAG ATCGGTCCGC GACCGGTGCT CACCGCCGCA
GCCCTTTCGG CCTGGCCCGA CCCCGCCACC GCGCCGCGGG CGATCGCCTC GCTGCGGCGC
AACACCGCCG ACCACCGGCA GATCACCGAA GCCCTCGCCG ACGCCTACGT CCTGGGCCAC
CGGCCCGATT TCGCGGCGAT TCAGCGGCCG GACGCGCACA AGCTCGACCT GCCGACGTAT
CCGTTCGAGC ACCGCCAGTA CTCGTTCCGG GACAACCAGG CGGAACGTCC CGAACAGGCA
GGCCCGCCGA TCCATCAGGC CGCGCGCACC GAGGCGGTCG GCCTCCTCGA GGACGGCCGG
ATCGAGGAAC TCGCGGCCCT GCTCGGCGAC ACCGACGGTG ATCGGCAGAC CTTCGACGTG
CTGAGCAAGC TTGCGGCACA ACACAACCAA CAGCGTACGA GTCAGTCGAT CACCGATGAC
CGCTACGAGA TCCGCTGGGA GGAGCTCACC ACCGCATCGC CGGCGGCGAC CGGCGAGCCG
TCCACGTGGA TCATCGTCGG CGACGACACC GACGCCGCCC GTCCGCTGAT CGACGCGGTG
ACCGCCCGCG GCGACCGCCA CCGGCTCGTC GGGTCGCCGG TGTCCGACGC CGACGAGGCG
TCCCTCGCGG ATGCGTTGCG CGCCTCAGTG GACGACGCAT CGGCGCCAGG GGGCGCTGTC
CGCATCCTGC ACATCGCGGC CCTCGACGCC ACCACCGCAC CGTCGATGCG GACCCTGCTG
AGGATGCAGC ACCGGATCCT CAACAGAACC CGACGGCTCT TCCATGCCGT GACCACCACC
GGACTGCGCA CCCCGATCTG GCTGATCACC CGTGGCGCAC AACGAGTCAC GGCCACCGAC
ACCGTCGCGC CCGATCAGAG CGCGTTGTGG GGATTCGGAC GGGCCGCGTC GCTGGAGCTT
CCGCACCTGT GGGGCGGGTT GGCCGACCTG CCGACCGGTC CCGGTGCGAG CGAAGACGAA
TGGTCCCGGC TCCTCGACCG GATCGGCGCG CCGCGACAGT CGGATGTCAC CGAAGACCAG
GTCGCGCTGC GCGACGGCGC CGTCCACGTG CCCCGGCTGG TCCGGCGGAC CGGGCAGCCC
AGCGGTGCTC CGCTGCGGCT GCGGAGCGAC GCAACGTATC TCGTGACCGG CGGGCTGGGT
GCGATCGGCC TGGAGATCGC GGGATACCTG GCCGCGCACG GTGCCGGGAA CGTCGTGCTG
ACCAGCAGGC GCGCACCCGG CGATGCCGCG CAACAGCGCA TCGACGCGCT GCGCGACAAG
TTCGGCTGCG CGATCCGGGT GGCCACCGCC GACGTCGCCG ACGCGCACGA CGTGGCACGC
CTGTTGGCGG GTGTGCAGGC CGAGCTACCG CCGTTGGCCG GCATCGTCCA CGCCGCCGGT
GAGATCGGCA CCACCGCACT GAGCGCGATG GACGACGAAT CCCAGCAAGC CGAGGTCGAT
CGCGTATTCG CCGGAAAGGT CTGGGGCGCC TGGCATCTCA GCGAGGCGGC GGTCGACCTG
CAGCTCGACT TCTTCATCAG CACCTCGTCG ATCGCCTCGG TCTGGGGTGG GTTCGGTCAG
ACCGCCTACG GGGCGGCGAA CGCCTTCCTC GACGGACTGG CCTGGCGCCT GCGCGAACAG
GGTATCGCCG CGACCAGCGT CAACTTCGGT CCGTGGTCGG CGGGGATGGC CGACGCGGAG
TCCCGCGCGC GCCTCGAGCA GCGCGGAGTC CGGACCCTCG ACCCGGCCGA TGCACTGGCC
GGCCTGGCCG ACGTCGTGGC GGGTCCTGCG ACTCAGGGCG TGATCGCGCG GATCGACTGG
GCCCGTTTCC TGCCGCTGTA CCAGCAGGCG GGTAGGCGCG CGTTCCTGAC CGAGCTGGAG
CGCGAGGTGC CGGTCGCGTC GACGGATGCG GCGCCGGCGG TGACGGCATC CGGGAAGACC
CCACTGGTCG AGCGGCTCTC GGGTGCCCCG GTGCAGCAGC GCAAGAAGCT GCTCACCGAC
TATCTGCGGG ACGCGGTGGC CGAGGTGACA CGCGTCGATT CCGCCGAGAT CCGCGAGGAC
GCAGGGTTCT TCGACCTCGG GATGGATTCG CTGATGGCCG TCGAACTGCG GCGCCGCCTC
GAACAGGGTG TCGGCAAGGA GATCCCGGTC ACCCTGGTGA TGGACCATCC CCGACTGTCC
GACGCGGCCG ACTACCTGCT CGGCGAGGTG CTCGGCCTGT CCGAACAGGC GCCCGCCGAA
TCCCGGCCCG CGCTGGCGTC TCTGGCCTCC GAGCGCACGG ACGACCCGAT CGCGATCGTC
GCGGTCTCGT GCCGGTTCCC CGGCGCTCCC GACCCGGAAG CCTTCTGGGA TCTGCTCTCC
GGTGGTGTCG ACGCGATCCG GGAGGTCCCG GAGGATCGGT TCGACATCGA CGAGTTCTAC
GACCCGGATC CGGACGCCGC AGGCAAGACC TACACGCGCT TCGGCGGATT CCTCGACGGG
ATCGACGGAT TCGACCCCGA GTTCTTCGGC ATCTCCCCCC GTGAGGCCGT CTGGATCGAA
CCGCAGCAGC GACTGATGCT CGAAACGGTC TGGGAGGGCA TCGAAAGAGC CGGCCTCTCC
CCGGCGGACC TGCGGGGCAG CCGGACCGGG GTCTTCGTGG GCGTGGCCGC CAACGAGTAC
GCCCACCTGC TGTCGTCGGA GTCGATCGAG AAGATCGAGC CCCACTTCAT CACCGGTAAC
GCGCTCAACG CCATCTCCGG CAGGGTCGCC TTCGCGCTGG GCCTCGAGGG TCCGGCGGTC
GCGGTGGACA CCGCGTGCAG TTCGGCGTTG GTGGCCGTCC ACCAGGCCGT TCAGGCACTG
CACTCCGGGG ACTGCGACCT GGCAGTGGCC GGCGGGGTGA ACGTCCTGCT CAGCCCGGTG
ACGGTGGTCG CCGCCTCGCG CGCGCGGATG CTCTCCCCCG TCGGTCGGTG CAAGACCTTC
GACGCCTCCG CCGACGGCTA CGTGCGCAGC GAAGGCTGCG GCATCCTGTT GCTCAAGCGA
CTCAGCGACG CGGTGCGCGA CGGAGACCGG GTCTGCGCGG TCATCCCCGG CACCGCTGTG
AACCAGGACG GCGCCTCCAG CGGTCTGACC GTGCCCAACG GTGGTGCGCA GCAACGCCTC
ATCAAGACCG CACTGACCCG CGCCGGCCTG ACGGGCGGTG ACGTCGACTA CCTCGAGGCA
CACGGGACGG GCACCCCGCT GGGTGATCCG ATCGAGGTGC AGGCGGCCGC CGCCGCCTAC
GGCGCCTCCC GTGACGCGGA CCGGCCACTG CTGATGGGAT CGGTCAAGAG CAACATCGGT
CACCTCGAAT CCGCCTCGGG CGCAGCAGGT CTGATCAAGG TCGTGTTGTC GCTTCAGCAC
GGCGTGCTGC CGCAGAGCCT GCACTTCGAC AACCCGTCGC CGCACATCCC GTGGGATTCG
CTGCCGGTGC GGGTCGTCGA CGAGGCGGTG CCGTGGCAGC CCAACGGCAG GCCGCGGCGC
GCCGGGGTGA GTTCCTTCGG GTTCACCGGC ACGAACGCGC ACGTGTTGGT AGAAGAGGCG
CCGCAGGCAC CGGTGTCCGA GGACCAGGAG TACGACACCG ACGATCAGCC CGTCCACGTC
CTCCCGCTGT CTGCCCGGTC ACCGGAGGCA CTGGTGGCGT TGGCCCGGCG TTACGACTCG
TGGCTGGGCG CGCACCCGGA CGCCGACCTC GCCGGCGTGT GCTTCACGGC CGGTACGGGT
CGTTCGCATT TCGAACATCG CGCGGCGATG GTCGTGGATT CGGTCGCCAG CGCCCGCCAG
GGTCTGGCCG ACCTGGCCGA CAACCGCACC CGCCCCGGTG TCGTGCGGGG TGAGCACACG
AACCGCCCGA CGACCGCGTG GTTGTTCACC GGACAGGGCA GCCAGTACCC CCGGATGGCG
CGCGAGTTGT TCGACGCCGA ACCGGTTTTC GCGGAAACCG TGACGCGATG TGCGGACGCG
GTCGACGGTA TGCTGCCGCT TCCGCTGCTC GAGGTGCTGT TCGCCGCTGA CCGCGAGACC
GCCGAACGGT TGCGGCACAC CTCGTACGCG CAGCCCGCGC TGTTCGCGGT CGAGATGGGC
CTGGCCCGGC TGTGGCAGTC GTGGGGTGTC ACACCCGACG TGGTGCTGGG GCACAGCGTG
GGCCAGTACG CCGCGGCGTG CGTGGCCGGG GTGTTCAGCC TCGAGGACGG CGCCCGGCTG
ATGGCCGAAC GCGGCCGGAT GTTCGGCAGC CTGCCCGAGG GCGGGCGGAT GGTGGCGATC
TTCGCCGACG CCAAACAGGT CGAGCAGATC GCCGGTGAGT TCGCGCGCGT GTCCGTCGGC
GCCTACAACG GACCCAACAC CGTGCTCTCC GGTCCCGGTG AGGATCTGGA ACAGATCGTC
GCCCGGTTCG CCGACGAGGG GATCCGCTGC ACCTGGTTGG AGACCAGTCA CGCCTTCCAC
TCCGAACTGC TCGATCCGGT GCTCGACGAG TTCGAGTCCT ACGCGGCGCA GTTCGAGTTC
GCCGCCCCGA CGATGCCGTT GGTGTGCAAC CGGACCGGCG CGGTACTGAC CGGGCAGACC
CCGCTCGATG CGCAGTACTG GCGGCGGCAT TCGCGCCAGC CGGTGCAGTT CGCCGAGAGT
GTGCGCACCG TCGCGGCGCT CGGCTGTTCG GTGCTGATGG AGATCGGTCC GCAACCCGTG
CTGACCGGGG CCGCGGTGCA GATCTGGCCG GAGCACCTGG CCGCTCCGCG GGCGATCGCC
TCGCTGCGCA AGGGCGTCGG CGATCGGCGT CAGATCGCCG ATGCGCTGGC CGCAGCCTAT
GTCGGCGGCC TCCGGCCCGA TTTCGCTGCG CTGCAAGGTC AGCCGCATCA CCGGCTCGAA
CTGCCCACGT ATCCTTTCCA ACGTCGACGG TTCTGGCCGA AGACGTCGAG CATCACGGTG
GACGGTCCGG CGACGTCCGG AATCCTGGGC AGTGCCAAGG ATCTCGCGTC CGGCGACACC
GTCTACGCCA GCAGGCTGTC CGTCAAGTCG CAGCCGTGGC TGTCCGACCA CGTCATCTAC
GGCACGGTCG TCGTCCCCGG AGCCACCTAT GCGGCGATGG CGTTGGCCGC GGTCGGTACC
CCGGCGCGGG TGAAGGACGT GTTCTTCTAC GAACCGATCA TCCTGCCCGA GAAGAGTTCT
CGCGAAGTGC AGCTGACCCT GCACATGCTC GGCGACGGTG AGCAGAAGTT CCAGGTGCAC
AGCCGTTCGT ACGGTGTGCG GGACGCCGAG TGGTCGTTGA ACGCCGAAGG CACTGTGGTG
CGGGGTGTCG ACGACGCGCC GGTGTCGCAG GATGATCCGG TCGACGAGGC GATCGAGCGG
TGCAACCGCA TGCGTCCGCA GGAACTGTTC GAGACCTTCG CCGACATGGA ACTGGCGTGG
GGTCCGACCT GGTCCGGATC CCTGAAGTCG CTGTGGCTCG GTGACGGTGA GGCGATCGGT
GACGTCCTCG TCGGCGCGGA ACTCGCCGAA CAACTCGGCA GCGAGCCGAT CCACCCGGTG
CTGATGGACC TGTGCACCGG CGTCGCGTTC CCGGCGTTCC CGGCGCTTCT CGCCGCCGAG
CAGGGGGTGA GCGATCTGTT CCTGCCGCTG CGGTACGGGC AGGTGACGGT GCAGGAGAAG
ATGCCTCGGC GGTTCTACTG CCGCGCCAAG TGGCACCACA GCGAACTGGA CAGTGAGACC
CAGGTCTTCG ACCTCGACTT CATCAGTCGG GACGGCCGCC CCCTCGGCGG TATCCGCGAG
TTCACGGTCA AACGCGCACC TCGCGAGGCG CTCCTACGCG GCCTGGGCGG CGACGCCACC
CGGCTGCTCT ACACCCTGGG CTGGCACGAG GTGCCGTTGC CTGCGGTGGA TCCGGCTGCC
CCCAACGGCA ACTGGCTGAT CGCCGGGTTC GACGAACTGG CCGACGCGGT CCCCGGATGC
ATCCCCTTCG ACCGGACGAC GGATCCGGAG CCGCTCGGCC AGCTGCTCAC CCAGGCACAC
GAGCGCGGTA TGGCGTTCTC CGGTGTCGTA TGGCGTGCCG CCGCACCGAA GCCGGACGAG
TCGAGCGCCG ATGTCGCCGC GCGGATCGAG ACCGAGATCG CCAACCTGCT CAGCGCGGTG
CACGCCGTGC AACGCGGTGA GGTGAAGCTG CCGGGGGGTC TGTGGATCGT CACCGAGCGG
GCGGTGGCCT GCGAATCCGG TGAACCGGTC GACCCGGTGC AGGCGGCGCT GTGGGGCTTC
GGGCGTACCA CGATCAACGA GGAGCCGGCG CTGCGCTGCA AACTCGTCGA CTGCGACGGA
TCCCCGGAAG CGGTCGAGGC GCTGAGTGCC CTGCTCACCA CGCCGGTCGA CGAGCCGGAA
CTGGCACTGC GCCAAGGGAA GTTGCTCGCG TCGAGGTTGT TGCACTGGGC GCGCAGCGGT
CATCTCACGG TGCCGCGATC GACCGACTAC GTCCTGGCGC CCACCGAACG CGGCGCGATC
GACAACCTGC GGCTCACCGA GACGGAGGTG CCGCCGCCGG CCGAGGGCTA CGTGCAGGTG
AAGGTGGAGG CCGCAGGCCT GAACTTCCGC GACGTGCTCA ACGTCCTCGG GCTCTACCCC
GGTGACCCGG GACCGATCGG CGGCGACTTC GCCGGTGTCG TCACGCAATT GGGTGACGGG
GTCGGTTCGG GGCGAGCGGA GCGACGGGAT GGAATCAAGC TCGAGGTGGG TCAGCGCGTC
TACGGCTTCA TGCAGGGCGC GTTCTCGAGC CGGTTCAACG TGCCGGCCCA GTTGCTCGCG
CCGATCCCCG ACGGGGTGGG CGCGGTCGAG GCTGCCACGA TTCCCGCTGC GGCGCTCACG
GCCCGCCTCG CGTTCGACTG GGCGCAACTC GAGCCCGGCG ACCGGGTGCT CATCCACGCT
GCCAGCGGTG GCGTCGGACT GGCCGCCATC CAGCTGGCCC AGCAGCACGG CGCCGTCGTG
TTCGCCACCG CGAGCACCTA CAAGCGCGCG ACGCTGCGCA AGATGGGTGT GGAGTACGTC
TACGACTCGC GCAGTACGGA TTTCGCCGAC CAGATCCTGG CCGACACCGA CGGCGCAGGC
GTCGACGTGG TGCTCAACAG CCTGACCAAC GAGGGGTTCG TCGAGGCGAC CGTGCGCGCC
ACCGCGCAGA ACGGCCGGTT CGCCGAGATC GCCAAGCGCG ACATCTGGAC GCACGAGCAG
ATGGCGGCGG CCCGTCCCGA CATCTCCTAC GAGATCGTGG CTCTCGATAC GGTGACCATT
CAGGAGCCCG AGCGCATCCG CGGGCTGCTC GGCGAGGTGT CGGACGGGCT GGGCAAGGGT
GAGTGGGCGC CGTTGCCTGC CGAGATCTAT CCGCTGACCG AGGCCAGGGC CGCGTTCCGG
CGCATGCAGC AGGCTCGCCA CATCGGCAAG ATCGTGGTGC AGATGCCAAG CCCGCTGCAG
CCGCGTCCCG ACCGCAGCTA CCTGATCACC GGTGGCCTGG GTGCGATCGG CCTGCACACC
GCGTCGTACC TGGCCCAACT CGGCGCAGGC GACATCGTGT TGACCAGCCG TCGGGAACCC
GATGCGGACA CTCAGCAGGT GATCGACGAG ATCACCGAGC GCCACCGCTG CCGCATCCAC
ACCTTCGCCG CCGACGTCGG CGACGAGTCC CAGGTCGAGG AACTGCTCGA GCGGATCCGC
GCGGAGCTGC CGCCGCTCGC CGGTGTCGCA CATCTGGCCG GTGTGCTCGA CGACGCGCTG
CTGTCCCAAC AGAGCGTGGA GCGGTTCCGA ACCACGCTGG CGCCCAAGGC TTTCGGCGCC
TACCACCTGG ACCACCTGAC CAGAGACGAC GATCTGGACT TCTTCATCGT GTCCTCGTCG
GTGTCGAGCC TGTTCGGATC CCCCGGCCAG GCCAACTACG CCACGGCCAA CGCACTGCTC
GACGGACTGG TCGCGCGGAG AAGGGCGCAC GGCCTGCCGG CCACCGGTGT CAACTTCGGT
CCGTGGGCAC AGGGCGGCAT GGCATCCTCG GAGGCCGCCA CCGCCAACAT CAGTGCCCAG
GGCCTGGTTC CGCTGGAGCC GTCGGCGGCG CTGAGCGCAC TCGCCGAGGT CGTCGCGAAC
GGCACCGCAC AGGCCACCGT GATCAAGGCC AACTGGCAGC GTGCGGCCAA GGTGCTGGGC
GCATCGCGGC CACCGCTCCT CGATCTGGTC CTGCCGAGTG CGGCCGGGGA GGTGACCGGT
GACAGCGAAC TGCTCCGGCA GCTGCAGGAG ATCCCGGTCG CGCAGCGGGC CGGGTTCGTC
ACCGAGTTCC TCCAGCGCGA GGTGCAGAAC TTCCTGCGAC TCGCGCAGCC GCCCGCCGCG
TCGAGCCGGT TCCTGGATCT CGGTACGGAT TCCCTGATGG CGATCGAACT CCGCAACCGG
TTGCACAGTC AGTTCGGCGG CGCGTTCACG ATCAACGCGA CCGCGGTGTT CGACTACCCG
ACCATCGGGG GGCTCGCCGA GTATCTGGTG GGTCAGCTGC CCGACGCCGA ATCACCGCCG
GCGGCAACCG CCGATCCGGT GGCCGCCGGA GACGACAGCG GCGGCGCGCC GGAGGTGGAA
GGTGGTTAG
 
Protein sequence
MSHRGRALTA DAREDTNGME SAAYPHVPAN RFAIVGYAAR FPGAQDADEF WDLLRDGREA 
ISEVPQDRWN VDEFFDPEPG APGKVVTRRA GFVDDVTGFD APFFGMSTRE VRLMDPQHRL
LLETAWRAVE HSGTAPTDLA NSNTGVFVGL ATHDYLGMAS DELSYPEIEA YMAIGTSNAA
AAGRISYRLG LQGPAVAVDT ACSSSLVAIH QACQALQLGE CDLALAGGAN VLLTPATMIT
FSNAHMLAPD GRCKTFDAAA DGYVRGEGCG VIVIKRLEDA LRDGDRIRAV IRGSAINQDG
ASGGLTVPNG VAQQRVITDA LKRAGVRAGD VGYLEAHGTG TSLGDPIEAQ AAGEVLGAGR
EPSRPLLIGS AKTNIGHLEA AAGIAGVIKV ILSLEHETLP KHLNFTTPSP HIPWDRLAVK
VVTESTPWER NGRPRIAGVS SFGFAGTNAH VILEEAPEST VTPAATPEPG ERFSLLPLSA
RTPSALVRIA DRYRSWMTTH PEATLADVCF TASTARAHLE QRAALVVDSR ESAVELLGAL
AEDRPAPGLV RGESHDTPKT AWLFTGQGSQ YPGMARELFD TEPVFAETMK QCAAAVADVL
EKPLLDVIFD AHGPEAAETL RQTSYAQPAL FAVEMGLARL WQSWGFEPDV VLGHSVGQYA
AACVAGVLGL EDGARLMAER GRLFGSLPAG GRMVAVFTAA ERVENITDEY PRLSVAAYNG
ANTVLSGPAG DLEKAVATLA ADGVRCDWLD TSHAFHSALL DPILDEFESY AQQIEYAAPQ
RVLIDNRTGA ALGWSAKLDG TYWRRHARQP VEFAKSVRTL AELNCKVLLE IGPRPVLTAA
ALSAWPDPAT APRAIASLRR NTADHRQITE ALADAYVLGH RPDFAAIQRP DAHKLDLPTY
PFEHRQYSFR DNQAERPEQA GPPIHQAART EAVGLLEDGR IEELAALLGD TDGDRQTFDV
LSKLAAQHNQ QRTSQSITDD RYEIRWEELT TASPAATGEP STWIIVGDDT DAARPLIDAV
TARGDRHRLV GSPVSDADEA SLADALRASV DDASAPGGAV RILHIAALDA TTAPSMRTLL
RMQHRILNRT RRLFHAVTTT GLRTPIWLIT RGAQRVTATD TVAPDQSALW GFGRAASLEL
PHLWGGLADL PTGPGASEDE WSRLLDRIGA PRQSDVTEDQ VALRDGAVHV PRLVRRTGQP
SGAPLRLRSD ATYLVTGGLG AIGLEIAGYL AAHGAGNVVL TSRRAPGDAA QQRIDALRDK
FGCAIRVATA DVADAHDVAR LLAGVQAELP PLAGIVHAAG EIGTTALSAM DDESQQAEVD
RVFAGKVWGA WHLSEAAVDL QLDFFISTSS IASVWGGFGQ TAYGAANAFL DGLAWRLREQ
GIAATSVNFG PWSAGMADAE SRARLEQRGV RTLDPADALA GLADVVAGPA TQGVIARIDW
ARFLPLYQQA GRRAFLTELE REVPVASTDA APAVTASGKT PLVERLSGAP VQQRKKLLTD
YLRDAVAEVT RVDSAEIRED AGFFDLGMDS LMAVELRRRL EQGVGKEIPV TLVMDHPRLS
DAADYLLGEV LGLSEQAPAE SRPALASLAS ERTDDPIAIV AVSCRFPGAP DPEAFWDLLS
GGVDAIREVP EDRFDIDEFY DPDPDAAGKT YTRFGGFLDG IDGFDPEFFG ISPREAVWIE
PQQRLMLETV WEGIERAGLS PADLRGSRTG VFVGVAANEY AHLLSSESIE KIEPHFITGN
ALNAISGRVA FALGLEGPAV AVDTACSSAL VAVHQAVQAL HSGDCDLAVA GGVNVLLSPV
TVVAASRARM LSPVGRCKTF DASADGYVRS EGCGILLLKR LSDAVRDGDR VCAVIPGTAV
NQDGASSGLT VPNGGAQQRL IKTALTRAGL TGGDVDYLEA HGTGTPLGDP IEVQAAAAAY
GASRDADRPL LMGSVKSNIG HLESASGAAG LIKVVLSLQH GVLPQSLHFD NPSPHIPWDS
LPVRVVDEAV PWQPNGRPRR AGVSSFGFTG TNAHVLVEEA PQAPVSEDQE YDTDDQPVHV
LPLSARSPEA LVALARRYDS WLGAHPDADL AGVCFTAGTG RSHFEHRAAM VVDSVASARQ
GLADLADNRT RPGVVRGEHT NRPTTAWLFT GQGSQYPRMA RELFDAEPVF AETVTRCADA
VDGMLPLPLL EVLFAADRET AERLRHTSYA QPALFAVEMG LARLWQSWGV TPDVVLGHSV
GQYAAACVAG VFSLEDGARL MAERGRMFGS LPEGGRMVAI FADAKQVEQI AGEFARVSVG
AYNGPNTVLS GPGEDLEQIV ARFADEGIRC TWLETSHAFH SELLDPVLDE FESYAAQFEF
AAPTMPLVCN RTGAVLTGQT PLDAQYWRRH SRQPVQFAES VRTVAALGCS VLMEIGPQPV
LTGAAVQIWP EHLAAPRAIA SLRKGVGDRR QIADALAAAY VGGLRPDFAA LQGQPHHRLE
LPTYPFQRRR FWPKTSSITV DGPATSGILG SAKDLASGDT VYASRLSVKS QPWLSDHVIY
GTVVVPGATY AAMALAAVGT PARVKDVFFY EPIILPEKSS REVQLTLHML GDGEQKFQVH
SRSYGVRDAE WSLNAEGTVV RGVDDAPVSQ DDPVDEAIER CNRMRPQELF ETFADMELAW
GPTWSGSLKS LWLGDGEAIG DVLVGAELAE QLGSEPIHPV LMDLCTGVAF PAFPALLAAE
QGVSDLFLPL RYGQVTVQEK MPRRFYCRAK WHHSELDSET QVFDLDFISR DGRPLGGIRE
FTVKRAPREA LLRGLGGDAT RLLYTLGWHE VPLPAVDPAA PNGNWLIAGF DELADAVPGC
IPFDRTTDPE PLGQLLTQAH ERGMAFSGVV WRAAAPKPDE SSADVAARIE TEIANLLSAV
HAVQRGEVKL PGGLWIVTER AVACESGEPV DPVQAALWGF GRTTINEEPA LRCKLVDCDG
SPEAVEALSA LLTTPVDEPE LALRQGKLLA SRLLHWARSG HLTVPRSTDY VLAPTERGAI
DNLRLTETEV PPPAEGYVQV KVEAAGLNFR DVLNVLGLYP GDPGPIGGDF AGVVTQLGDG
VGSGRAERRD GIKLEVGQRV YGFMQGAFSS RFNVPAQLLA PIPDGVGAVE AATIPAAALT
ARLAFDWAQL EPGDRVLIHA ASGGVGLAAI QLAQQHGAVV FATASTYKRA TLRKMGVEYV
YDSRSTDFAD QILADTDGAG VDVVLNSLTN EGFVEATVRA TAQNGRFAEI AKRDIWTHEQ
MAAARPDISY EIVALDTVTI QEPERIRGLL GEVSDGLGKG EWAPLPAEIY PLTEARAAFR
RMQQARHIGK IVVQMPSPLQ PRPDRSYLIT GGLGAIGLHT ASYLAQLGAG DIVLTSRREP
DADTQQVIDE ITERHRCRIH TFAADVGDES QVEELLERIR AELPPLAGVA HLAGVLDDAL
LSQQSVERFR TTLAPKAFGA YHLDHLTRDD DLDFFIVSSS VSSLFGSPGQ ANYATANALL
DGLVARRRAH GLPATGVNFG PWAQGGMASS EAATANISAQ GLVPLEPSAA LSALAEVVAN
GTAQATVIKA NWQRAAKVLG ASRPPLLDLV LPSAAGEVTG DSELLRQLQE IPVAQRAGFV
TEFLQREVQN FLRLAQPPAA SSRFLDLGTD SLMAIELRNR LHSQFGGAFT INATAVFDYP
TIGGLAEYLV GQLPDAESPP AATADPVAAG DDSGGAPEVE GG