Gene Amir_4699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_4699 
Symbol 
ID8328897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp5594239 
End bp5606481 
Gene Length12243 bp 
Protein Length4080 aa 
Translation table11 
GC content77% 
IMG OID644945143 
ProductMycocerosate synthase., Erythronolide synthase 
Protein accessionYP_003102375 
Protein GI256378715 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR00128] malonyl CoA-acyl carrier protein transacylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCACC CCACAGCCGG GCAGCACGAG GCCCCCGGCG GGCCGCAGGA CGCCGTCGCC 
GTCATCGGGA TGGCCGTGCG CCTGCCGTCG GCGGACGGCC CCGACGCGTT CTGGGCGCTG
CTCCGCGACG GCGTGGACGC CATCACCGAC GTTCCCGCCG GGCGCTGGGA GCAGGCGTCA
CCCGGTGTGA CCCGAGGAGG GTTCCTCGAC TCGGTCGCGG AGTTCGACGC GGACTTCTTC
GGCGTCTCGC CGCGCGAGGC GGCGGCGATG GACCCGCAGC AGCGCCTGGT GCTCGAACTG
GCCTGGGAGG CCGTCGAGGA CGCCAGGATC GTCCCCGCCG ACCTGCGCGG CAGCCGCACC
TCGGTGTTCG TCGGCAGCAT CCGCGACGAC TACGCCGCCC TGCTCTACCA GCACGGCACG
TCGGCGATCA CGCCGCACAC CGTGACCGGC ACGCACCGGG GCATCATCGC GAACCGGGTG
TCCTACTCGC TGGACCTGCG CGGCCCGAGC GTCACCGTGG ACGCCGCGCA GGCGTCGTCG
CTGGTGGCCG TGCACCTGGC GGCGGAGGGG CTGCGCGACG GCGCGGTCGA CCTGGCCATC
GCGGGCGGCG TGAACCTGAA CCTGCTCGGC GAGCAGACCG TCGGCCTGGA GCGCTTCGGC
GCCCTGTCCC CGGACGGCCG CAGCCGCCCC TTCGACGCCG CGGCCAACGG CTACGTCCGG
GGCGAGGGCG GCGCGGTGGT GCTGCTCAAG CGGCTGGACC GGGCGCTGGC CGACGGCGAC
CCGGTGCACG CGGTGCTGCT GGGCAGCGCC GTCAACAGCG ACGGCGCCAC CGAGGGCCTG
ACCGTGCCGG GGGAGCGGGC GCAGACCGAG GTCGTCCGCG CCGCGTTCGC CGCCGCGGGC
GTGGCGCCCG CGAGCGCCCA GTACGTGGAG CTGCACGGCA CCGGGACCCC GGTGGGCGAC
CCGATCGAGG CGCGCGCGCT GGGCGCCGCG ATGGCGGGCC GGGAGGACGA CCCGCTGCGG
GTCGGCTCCG CCAAGTCGAA CGTCGGCCAC CTGGAGGGCG CGGCGGGCAT CGTCGGCCTG
GTGAAGACCG CGCTCGCCAT CCGGCACCGC AGGCTGCCCG CGACGCTGAA CTTCGCGACG
CCGAACCCGG AGATCCCCCT GGGGGAGCTG GGGCTGCGCG TGCACGCGGA ACTGTCGGAC
TGGCCCCACC CCGACCGGCC GCTCGTGGCG GGCGTCAGCT CGTGGGGGAT GGGCGGCACG
AACGCGCACG TCGTGCTCAC CGAGGCCCCG GCCCCGGTCC GCCGCCCCGC GCACCCGGAC
GGGGGCGGGG CCGCTGCCGG AACCGCGCCC GCCGCGGAGA CCGCGCCCGC CGCTGCGGGC
GAGCCTGCCG CCGGGAGCGG ACGCCCGAGG GCCGCCGAGG GCTTGCCGGC CGCCGGGAAC
CCGCCCGCCG CCGAGACCGG GCTCATCGCC GGGAGCGCGC CCGCCGCCGA GGCCGGGCGC
GGGCTGGCCG CCGACGGGCT GGCCGCCGAC GGGCTGGCCG CCGACGGGCA GGCCGCCGAG
GCCGCGGTCG TGGGCGACCC CGCCGCCGCG CCCGAGCACG CGCCCGCCGC CGCGCCCCCG
CCCGTTTCGG CGCCGCCCAC CGCCCACGTC CCGTGGCTGC TCTCCGGGCG CAGCCGTGCC
GCCGTGCAGG CCCAGGCCGC CGCGCTGGCC GCGCACCTCA CCGACGAGCG GCCCGTCGAC
GTGGCCCTCT CGCTGGCCAC CACCAGGACC GCCTTCGAGC ACCGCGCCGT CGTCGTCGGT
TCCGGGCGTG CCGAGCTGGA CGCCGGGCTC GCCGCCCTCG CCTCGGGCGA GGCCGCCGCC
AACCTGGTGC GCGGCGAGGT CGGGGCCACC GGCGGGGTCG TGCTGGTCTT CCCCGGCCAG
GGCTCGCAGT GGGCGGGCAT GGCCGTCGAG CTCGCCGACT CCTCCCCGGT CTTCGCCGGG
CACCTCGCCG CGTGCGGCCG GGCGCTCTCC GCGTTCACCG ACTGGTCACT GGACGAGGTC
CTGCGCCAGG TCGACGGCGC GCCGGGGCTG GACCGGGTGG ACGTCGTGCA GCCCGCCCTG
TTCGCGGTCA TGGTGTCGCT GGCCCGCCTG TGGCAGCACC ACGGCCTGCG CGTGGACGCG
GTCATCGGCC ACTCCCAGGG CGAGATCGCC GCCGCGCACG TCGCGGGCGC CCTCTCGCTC
GACGACGCCG CCCGCGTCGT CTGCCTGCGC AGCAAGGCGA TCCGCGCGCT CGCGGGCACC
GGCGGCATGG CCGCGGTGAC CCTGCCCGCC GCCGAGGTCG CCGAGCGGAT CGCCGAGCAC
GGCGGCCGGG TCTCGATCGC CGCCGTCAAC GGCCCCGCGA ACACCGTCGT CGCGGGCGAC
CCGGAGGCGC TGCGGGAGAT CGTCGCCGAC TACCAGGCGC GGGACGTGCG AGCCTCGGTC
ATCGCCGTCG ACTACGCCTC GCACTCCGCC GACGTGGAGA GCCTGCGCGC GGAGCTGCTC
GAACTCCTCG AACCGGTCCG CCCGCGCCAC GCCGACGTCC CGTTCTACTC CACCCTGACC
GGCGAGGTGC TCGACACCAC CGGCATGGAC GCCTCCTACT GGTTCCGGAA CCTGCGCCAC
ACCGTGAACT TCCACGACAC CGTCCGCCTG CTCCTGTCCG ACGGCCACCG CGCGTTCGTC
GAGACCAGCC CGCACCCGGT GCTCACCCAC GCCGTCAAGT CGACGATCGA GGCCGAGGGC
GTCGCGGGCG CCACCGCGCT CGGCTCGCTG CGCCGCGACG ACGGCGGCCC CCGCCGCTTC
CTGCTCGCGC TGGGCGCGGC GCACGCGCAC GGCCTCGCCC TCGACTGGAC CGGACCGCTC
ACCGACGCCC CGCAGCTCGT CGACCTGCCC ACCTACGCCT TCCAGCGCAA GCACTTCTGG
TTCTCCGGCG ACGGCGTGAT CACCCGCGCC GCCACCCCCG CGAGCGCGAA CGCCCCCGCG
CCGCAGGACG AGCGCACCGG CGCGCTGCGC GCCCGCCTCA CCGGCCTGTC CGACCGCGAG
CGGCTGCGCG TGCTGGTCGA CCTGGTGCGG GCCAACGCCG CCCCGGTGCT CGGCTTCGAC
GGACCGGCCG ACGTCGCCGA GGGCCGCACG TTCAAGGACC TCGGCATGGA CTCGGTCACC
GCCGTCGAGT TCCGCGACCA GCTCGGCGCG GCCACCGGCC TGGCCCTGCC CGCCGCGCTG
ACCTTCAACC AGCCCACCCC GCGCGCGCTC GCCGAGCACC TCGACCGCGA GCTGGGCGGC
GCCACCGCCG AGCCGGCCGT GGACGGCGCG ACGCCCACCT CCGACGACCC GATCGCCGTG
GTCGGCATGG CCTGCCGCTT CCCCGGCGGG GTGCGCTCCC CGGAGCAGCT GTGGGAGCTG
GTGCGCGACG GCGTGGACGC CATCTCCGAG TTCCCGGACA ACCGGGGCTG GGACACCGAG
GCGCTGCACG ACCCCGACCC GGACAAGCGC GGCTCCACCA ACGCCCGCTA CGGCGGTTTC
CTGCACGACG CCGACGAGTT CGACGCCGCG TTCTTCGGCA TCAGCCCCCG CGAGGCCACC
GCCATGGACC CGCAGCAGCG GCTGCTGCTG GAGGTCGCGT GGGAGGCGTT CGAGCGGGCG
GGCATCGACC CGGCCACGCT GCGCGGCGCC GACGCGGGCG TGTTCGTCGG CGCGATGGCC
CAGGACTACG GCCCGCGCCT GCACCAGGGC GCCGAGGGCC ACGACGGCTA CCTGCTCACC
GGCAACCAGC TCAGCGTCGC GTCCGGCCGC CTCGCCTACA CCTTCGGCCT CGAGGGTCCG
GCGCTGACGA TCGACACGGC GTGCTCGTCG TCGCTGGTGG CGATCCACCT GGCCGGCCAG
GCGCTCGCCA GGGGAGAGAC CTCCCTCGCC CTCGCGGGCG GCGTCACGGT CATGTCCACG
CCCGGCATCT TCGTCGAGTT CTCCCGTCAG CGCGGGTTGT CGGCCGATGG CCGGTGCAAG
GCGTTCTCGG CCGACGCCGA TGGCACGGGC TGGTCGGAGG GCGCTGGTCT GCTGGTGCTG
GAGCGGTTGT CGGACGCGCG TCGCAACGGG CACCCGGTGC TGGCGGTGGT GCGGGGCACG
GCGGTGAACC AGGACGGGGC GTCGAACGGC CTGACCGCGC CCAACGGTCC CTCGCAGGAG
CGGGTCATCC GGCGGGCGCT GGGAGCGGCG GGTCTGCGTC CGTCCGATGT GGACGTGGTG
GAGGCGCACG GCACGGGCAC CTCCCTCGGT GACCCGATCG AGGCGCAAGC GCTGCTCGCC
ACCTACGGGC AGGACCGGGA GACCCCGCTG TACCTGGGGT CCCTCAAGTC CAACATCGGC
CACACCCAGG CCGCCGCGGG CGTCGCGGGC GTCATCAAGT CCATCGAGGC CATCCGCAAC
GGGATCGTGC CCAGGACCCT GCACGCCGAC GAGCCCACCC CGCACGTCGA CTGGGAAGCG
GGCGCGGTGC GCCTGGCCAC CGACAGCGCC CCGTGGCCCG ACACCGACCG GCCGCGCCGC
GCGGGCGTCT CGTCGTTCGG CATCTCCGGC ACCAACGCGC ACGTCATCGT CGAGCAGGCC
GAACCGGCCG ACACCCCCGC CGCGCCAGCG GAATCCGACA CCACGACCGG CCCGCTGCCG
TGGGTGCTCT CCGGCCGGGG CGCAGGCTCG CTCGCCGCGC AGGCCGAACG CCTCCTGGGG
TTCGCCACCT CCACCGAGCA CACCACCGCC GACGTCGGCT GGTCCCTGGC CACCACCAGG
ACCGCGCTCG ACCACCGGGC CGTCGTGTGG GGCGAGGACC GCCAGGACCT CCTCGCGGCG
CTCGCCGCCG TCGCGCGTGG CGAGAGCGGC CCCACCGCCG TCACCGGCCA GGCCGCCCCG
ACCCGCACCG CGTTCCTGTT CACCGGCCAG GGCGCGCAGC GCGCCCGCAT GGGCCTCGCC
CTCGCCGAGG CGTCCCCGGT CTACGCGGCA GCGTTCGCCG AGGTGTGCGC CGCGCTCGAC
CCGCACCTGG ACCGCCCGCT GCGCGAGGTC GTCGACAGCG GCGACGGCCT GGACGAGACC
GGCTTCACCC AGCCCGCCCT GTTCGCCGTC GAGGTGGCCC TGTTCCGGCT CCTCGCCGAC
CACGGCGTGC GCCCGGACTT CCTGGCGGGC CACTCCATCG GCGAGCTCGC CGCCGCGCAC
GTCGCGGGCG TGTTCTCCCT GGACGACGCG GCCCGCCTGG TCGCCGCGCG CGGCAGGCTC
ATGCAGCGAC TTCCGCGCGA CGGCGTCATG ATCGCCGTCG AGGCCACCGA GGACGAGGTG
CTCGCCGAAC TGGCCGGGCA CGGCGACCGG GTGGGCGTCG CGGGCGTCAA CGGCCCCACC
GCGCTGGTCG TCGCGGGCGA GACCGAGGCC GCCACCGCGG TCGCCGAGGC GCTCGCCGCG
CGCGGCAGGC GCACCAAGCG CCTCGAGGTC AGCCACGCCT TCCACTCGCC GCTCATCGAG
CCGATGCTCG CCGAGTTCGC CGAGGTCGCG GCCACCGTCG CCTACGCGGC CCCGCGCGTG
CCCCTGGTCT CCACCGTCAC CGGCCGCCTC GCCGACCCCG CCGAGCTGGC TGCCCCGGAC
TACTGGGTCG GCCAGGCCCG CGCCGCCGTG CGCTTCTCCG ACGCGGTCGG CGCCCTGCTC
GACGAGGGCG TCACCGCGCT CGTCGAACTG GGCCCCACGG CCGTGCTGTC CGCGCTGGTC
CCCGCCATCG CCGCGGGGGC GGGCGTCGAC GTGGTCGCCG CGCCGCTGCT GCGCGCCGAC
CGCGACGAGC CCCGCTCGGT GCTGTCCGCG CTGGCCGCCC TGTTCGTCGC GGGCGTCGAC
GTCGACTGGG CCGCCCCGTA CCGGGGCGGC CGGGCGCGCC GCGTCGACCT GCCCACCTAC
GCGTTCCGCC CGCAGCGGTT CTGGCTCGAC GCCCCCACCG GCGGCGACGT CACCGGGGTG
GGCCTCGCCC CCGCCGGGCA CCCGCTGCTG GGCGCGGCCG TCGACCTGGC GGGCGGCAGG
CTCGTGCTCA CCGGCAGGCT CGCCGCCACC AGCCACCCGT GGCTGGCCGA CCACGCCATC
GCCGGGGCCG CCGTGCTGCC CGGCACCGCC TACCTGGAGC TGGCCCTGTG GGCGGGCGGC
CGGGTCGGCG CGGACCACGT CGACGAGCTG ACGCTGGCCG CGCCGCTGGT GCTGGCCGAG
CGCGGCGGCA CCCTCGTCCA GGTCGTCGTG GACGCCCCCG ACGCCGACGG GTCCCGCGCG
CTGCGCGTGC ACGCCCGCCG CGACCGCGAC GGCGCCGAGT GGGTCGAGCA CGCCACCGGA
ACCCTCACCC CCACCCCACC CGCGGCCGAC ACCGACCTCG GCGCCTGGCC GCCGAACGCC
ACCGAGCTCG ACCTGACCGA CGCCTACGCC CGGCTCGCCG AGCGCGGCTA CGGGTACGGC
CCGGTCTTCC GCGGCCTGGA ACGCGCCTGG CGCGCGGGCG ACGACCTGTT CGCCGAGGTC
GCCCTCCCCG CCGACCAGTG GCCCGCCGCG GCCTCGTTCA CGCTGCACCC GGCGCTGCTC
GACGCCGCGC TGCACCCGCT GCTGCCCGGC GTCGCCGACG ACGGCGACAC CACCGTCCTG
CCGTTCGCCT GGTCCGGCGT CACCGCGCAC GCCGAGGGCG CCAGGTCGCT GCGCCTGCGC
CACCGGGTCA CCCGGCCTGA CCAGGACACC CTCGTGGTGT CCGTGACCGC CGTCGACACC
TCGGGCGCGC CGGTCCTGAC GGCCCGCTCG CTGGTGCTGC GCCCGGTGTC CCGCGCGGCG
CTGCGCGACA GCGGCGGCGC CACCCTGCTG CGCCCGGAGT GGCGACCCGC CCAGACCGGC
GGGCCCGTGC CGTGGTCGGC GGTCGGCGCG GAGGGCTTCG TGGACGTGTT CGCCCTGCCC
GGCGACGCCA CCCCCACCCG CGCCTACTAC GACCTGCCCG ACCTGGCCCA GGCGCTCGGC
GGCGACGACG TGCCGCGCCA CGCGGTCGTC CTGCTGCCCG AACCGGACGG CGACCCGGCG
CGCGCCGCCC ACGCCACCAC CCGCGCCGCG CTGGAGCTGC TCCAGGTGTG GCTGGCCGAC
CCGAGGCTCC AGGACACCGC GCTGGCCGTG GTGACCACCG GCGGCGGCGG GGCGCGCCCC
GACGAGTCCC CGCACCACGC GGGCGTCTGG GGCCTGCTGC GCAGCGCCCA GTCCGAGCAG
CCCGGCCGGT TCGCCCTCGT CGACCACGAC GGGACCGCCG AGTCGTGGGC CGCGCTGCCC
TCGGCGCTGG GCACGGGGGA GCCGCAGCTG GCGCTGCGCG CCGGAGAGGT CCTCGTGCCC
CGCCTGGTCG CGGCCGAGCC CGAGCCTGCC ACCGCCGTCG CGTGGGACGC GGGCACGGTC
CTGATCACCG GCGGCACCGG GGCGCTCGGC GCGCTCGCCG CCAGGCACCT GGTGCGCGAG
CACGGCGTCC GCGACCTGCT GCTGCTCAGC CGCCGGGGCC CCGACGCGCC CGGCGCGGCC
GACCTGGTCG CCGAGCTGAC CGCCGAGGGC GCGCGCGTCA CCGTCACCGC CGTCGCCGCC
CAGGACCGCG CGGCGCTCGC CGACGCCTTG CGCGGCCACG ACGTGCGCGT CGCGCTGCAC
ACGGCGGGCG TCGTCGACGA CGGCGTCGTC ACCTCGCTCG ACGCGCGCGG CCTGACCACC
GCGCTCACCC CGAAGGTCGA CGCGGCCTGG AACCTGCACG AGCTGCTCGG CGACCGCGCG
ACGCTCGTCC TGTACTCGTC GGTCGCGGGC GTCCTGGGCA ACCCCGGCCA GGGCAACTAC
GCGGCGGGCA ACGCGTTCCT GGACGCGCTG GCCCGCCACC GCGCCCACCT CGGCCGGCCC
GCCACCTCGA TCGCCTGGGG CCTGTGGGCC GACAGCAGCG GCATCACCGG CGCGCTCACC
GACACCGACC GCACCCGCCT GGCCCGCAAC GGCGTCCTGC CGCTGACCGG CGAGGCCGGG
CTCGCCCTGC TCGACGCGGC CACCGCCTCC GGCCTGCCCG AGGTGACCGC CGCCGCGCTC
GACCACGCGG CGCTGCGCGC GCTCGGCGAG CGGCTGCCCG CCGTGCTGCG CGGCCTGATC
ACCCCGGCCG CCACCCGCCG CGCCGCGGCC GGGGAGGCCG AGCCGTCCGC GGACGGCGGG
TCCGCGCTGG AGCGCCGCCT GGCCGGGCTG TCCGAGCGGG AGCGCGACAC GGCCGTCGCC
GAGCTGGTCA GGGCCACCGT CGCGCAGGTG CTCGGCCACG CCGACGGCTC CCGCGTCGAG
ATGGCCGCCG CGTTCAAGGA GCTGGGCTTC GACTCGCTGA CCGCCGTCGA GCTGCGCAAC
CACCTGCACA CCGCCACCGG GCTGCGGCTG CCGAGCACGC TCGTGTTCGA CTACCCGTCG
CCCTCGGCGG TCGCGGGCTA CCTGTCCGCG CAGGTCTCCG GCCCCGCCGC GCCCGACGAG
ACCCGGCAGG ACCGGGTCGC CGCCACCGAG GACGACCCGA TCGCGATCGT CGCCATGGCC
TGCCGCTTCC CCGGCGGGGT GCGCTCCCCG GAGCAGCTGT GGGAGCTGGT GCGCGACGAG
GTCGACGCGG TCGGCGACTT CCCCACCGAC CGCGGCTGGG ACACCGACGC CCTCTACGAC
CCGGACCCGG ACCGGGCGGG CCGCACCTAC ACGCGCAGCG GCGGTTTCCT GCACGACGCC
TACGACTTCG ACCCGGAGTT CTTCGGCATG TCCCCGCGCG AGGCGCTGGC CACCGACCCG
CAGCAGCGGC TGCTGCTGGA GGTGGCGTGG GAGGCGTTCG AGCGGGCGGG CATCGACCCG
GCCACGCTGC GCGGCAGCAG CACCGGGGTG TTCGCGGGCG TCATGTACAC CGACTACACC
GAGCAGTCCG GTCAGCTCCC GGCCGAGCTG GAGGGCTACC TGGCCAGCGG CACGGCGGGC
AGCGTCGCGT CCGGCCGCCT CGCCTACACC TTCGGCCTGG AAGGTCCGGC GCTGACGATC
GACACGGCGT GCTCGTCGTC GCTGGTGGCG ATCCACCTGG CCGGCCAGGC GCTGCGCTCC
GGCGAGGCCG ACCTGGCGCT CGCGGGCGGC GTCACCGTCA TGTCCACGCC CAACCCGTTC
ATCGAGTTCT CCCGTCAGCG CGGGTTGTCG GCCGATGGCC GGTGCAAGGC GTTCTCGGCC
GACGCCGATG GCACGGGCTG GTCGGAGGGC GCTGGTCTGC TGGTGCTGGA GCGGTTGTCG
GACGCGCGTC GCAACGGGCA CCCGGTGCTC GCGGTGGTGC GGGGCACGGC GGTGAACCAG
GACGGGGCGT CGAACGGCCT GACCGCGCCC AACGGTCCCT CGCAGGAGCG GGTCATCCGG
CGGGCGCTGG GAGCGGCGGG CTTGCGTCCG TCCGATGTGG ACCTGGTGGA GGCGCACGGC
ACGGGCACCT CCCTCGGTGA CCCGATCGAG GCGCAAGCGC TGCTCGCCAC CTACGGGCAG
GACCGGGAGA CCCCGCTGTA CCTGGGTTCC CTCAAGTCCA ACATCGGCCA CACCCAGGCC
GCCGCGGGCG TCGCGGGCGT CATCAAGTCC GTCCAGGCCA TCCACAACGG CCTGATGCCC
AAGACCCTGC ACGTCACCGA GCCGTCCCCG CACGTCTACT GGGACGCGGG CGCGGTGACC
CTGCTGACCG AGGCCACCCC GTGGCCCGAC GCGCACCGGC CGCGCCGCGC GGGCGTCTCG
TCGTTCGGCA TCTCCGGCAC CAACGCGCAC GTCATCATCG AGCAGGCCCC CGAGCACGGG
CAGCCGGGGG AGGGCGGCGC GGACCCAGTC GTGCCGTGGC TGCTGTCCGC CAAGACCCCC
GCCGCGCTCA AGGCCCAGGC CGACGCGCTC GCGGCCTTCC TCGCCGAGCA CCCCGACACC
CGCCCCGCCG ACGTCGCGCT CACCCTGGCC ACCCGGCGCG CCACCCACGA GCGGCACGCC
GTCCTCGTCG GCTCCGACGC CGACGACCTG CGCGCCCGCC TCACCGCGCT CGACCCGGCC
GCGGCCGGCC GCACCACCGG CGCGGGCAGG CTCGCCGCGC TGTTCACCGG CCAGGGCGCG
CAGCGCGTCG GCATGGGCAT CGACCTCGTC CACGCCCACC CCGCCTACGA GGCCGCGTTC
GACGAGGTGT GCGCCGCGCT CGACCCGCAC CTGGGCCGCT CGCTGCGCGA GGCCGTCACC
ACCGGCGAGG CGCTCGACGA GACCTGGCTG ACCCAGCCCG CGCTGTTCGC CGTCGAGGTC
GCCCTGTTCC GGCTGTGGCA GTCCTGGGGC GTGCGCTTCG ACTTCCTGGC GGGGCACTCC
ATCGGCGAGC TCGCCGCCGC GCACGTCGCG GGCGTGTTCT CCCTGGACGA CGCGGCCCGC
CTGGTCGCCG CGCGCGGCAG GCTCATGCAG CAGCTGCCCG CGACCGGCGT CATGATCGCC
GTGGAGGCCA CCGAGGAGGA GGTCCGCGCG GGGCTGGCCG GTCAGCCGGG GCTGGTCGAC
GTCGCCGCCG TCAACGGGCC GCGCGCCGTC GTCCTCGCGG GCGAGGAGCA GGCCACCCTG
GCAGCCGCCG AGCCGTGGCG CGCCCTCGGC AGGCGCACCC GCAGGCTCAA GGTCAGCCAC
GCCTTCCACT CCCCGCTCGT GGAGCCGATC CTCGACGCGT TCGCCGAGGT CGCCGCCACC
GTCACCTACC ACCGGCCGAC CATCCCGATC GTCTCCACCC TCACCGGCGC CGCCGACGCG
CCGGTGGACA CCCCCGACCA CTGGGTGCGG CACGTGCGCG GCGCCGTCCG GTTCGCCCCC
GCCACCACCG CGCTGGTCGC GCTCGGCGCC ACCACGTTCC TGGAGGTCGG GCCGGACACC
GTGCTCGCCA CCATGGCCGA GCAGGTGCTC GACGCGCTGC CCGAGCGGCG CGACCGGGCC
GCCCTCGCCT CCACCCGCCG CGACCGCCCC GAGGTCGACA CCACCGCCGA GACCCTGGCG
CTGCTGCACA CCAGGGGAGT GGCCGTCGAC TGGGCCGCGT TCCTCGACGG CGCCGGGGCC
AGGCACGTCG ACCTGCCCAC CTACGGCTTC CAGCGCGACC GGTACGCGCT CGTCGTCGCC
CCCGGCGCGG GCCGGACCGC CGCCGCCCCC GACTGGGAGG AGATCCACCC GCCCCGGCCC
GCCGGGCCGA GGTCCCTCGT CGTGCTCGAC CTGGACCACG GCGCGGCCGA CCTGCCCGGC
CTGCCCGTCG TCACCACCGC CGCCGACGTG CCCGCCGACG CCGACGCGGT GCTGCTGCCG
CTGGCCGTGC ACCTCACCGC CGACCCGGCG GACACCCCCG AGGCCCGCTA CGCCCGCACC
AGGGGCACCC TGGACCTGGT CCAGGGCTGG GACGGCCCCC GGCTCGTCGT CACCACCACC
GGGATCGCCC CGGCGCTGGG CGGCGACCGG GCGGGCGAGG ACGCCAGGCT CGCCTGGAAC
CTGCTGCGCG CCGCCGCCGA GCACAACGGG CGGGTCCTGC TGGTCGACCT GGACACCGAC
CGGCCCGACC CCGACGTGCT CGCGGCCCTG CTCGCGGCGG GCGTCCCGCT GGCCGCCGCG
CGCGGCGACC GGCTCCTGGT CCCGCCCCGC GAGGGCGCGC CCACCGCCGA GCCGGTCACC
TCGCGGCTGC TCGCCGACCT CGCCGCCGCG CCCCCCGCCA AGCACCGGAC GATCCTGCTG
GCCGCCGTGC TCGCCGAGGT CGCCTCGGTG CTGCACCGCG ACGACGCCGA CGGCATCGAG
GAGGACCGGG CCCTCCAGGA GCTCGGCTTC GACTCGCTCA CCTCGGTCGA CCTGCGCAAC
CGGCTCAACG CCGCCACCGG CGCGGCGCTG CCCGCCACCG CCGTGTTCGA CCACCCGACC
CCGGCCGCCC TCGCCGACCA CCTGCTCGGG CTGCTGGCGC CCGGCGGGCC GGAACCCGCC
GCGCCGCTGC ACGCCGAGCT GGACCGGCTG GAGTCGCTGC TCGCCACCGC CCCGCGCGGC
GGCGACGGCG CGGAGTCGAC CGCCGAGGTC GCCGACCGGC TGCGCGCCAT CCTCGCGCGG
CTCACCGAAC CCACCGCCGC CGCGAACGCC CTGGGCGAGG ACCCGGTGGG GCAGCTGGCG
GAAGCCACCG CGGACGACCT GTTCGCGTTC ATCGACAACG AGCTGGGCCG CACCGCGGGC
TGA
 
Protein sequence
MDHPTAGQHE APGGPQDAVA VIGMAVRLPS ADGPDAFWAL LRDGVDAITD VPAGRWEQAS 
PGVTRGGFLD SVAEFDADFF GVSPREAAAM DPQQRLVLEL AWEAVEDARI VPADLRGSRT
SVFVGSIRDD YAALLYQHGT SAITPHTVTG THRGIIANRV SYSLDLRGPS VTVDAAQASS
LVAVHLAAEG LRDGAVDLAI AGGVNLNLLG EQTVGLERFG ALSPDGRSRP FDAAANGYVR
GEGGAVVLLK RLDRALADGD PVHAVLLGSA VNSDGATEGL TVPGERAQTE VVRAAFAAAG
VAPASAQYVE LHGTGTPVGD PIEARALGAA MAGREDDPLR VGSAKSNVGH LEGAAGIVGL
VKTALAIRHR RLPATLNFAT PNPEIPLGEL GLRVHAELSD WPHPDRPLVA GVSSWGMGGT
NAHVVLTEAP APVRRPAHPD GGGAAAGTAP AAETAPAAAG EPAAGSGRPR AAEGLPAAGN
PPAAETGLIA GSAPAAEAGR GLAADGLAAD GLAADGQAAE AAVVGDPAAA PEHAPAAAPP
PVSAPPTAHV PWLLSGRSRA AVQAQAAALA AHLTDERPVD VALSLATTRT AFEHRAVVVG
SGRAELDAGL AALASGEAAA NLVRGEVGAT GGVVLVFPGQ GSQWAGMAVE LADSSPVFAG
HLAACGRALS AFTDWSLDEV LRQVDGAPGL DRVDVVQPAL FAVMVSLARL WQHHGLRVDA
VIGHSQGEIA AAHVAGALSL DDAARVVCLR SKAIRALAGT GGMAAVTLPA AEVAERIAEH
GGRVSIAAVN GPANTVVAGD PEALREIVAD YQARDVRASV IAVDYASHSA DVESLRAELL
ELLEPVRPRH ADVPFYSTLT GEVLDTTGMD ASYWFRNLRH TVNFHDTVRL LLSDGHRAFV
ETSPHPVLTH AVKSTIEAEG VAGATALGSL RRDDGGPRRF LLALGAAHAH GLALDWTGPL
TDAPQLVDLP TYAFQRKHFW FSGDGVITRA ATPASANAPA PQDERTGALR ARLTGLSDRE
RLRVLVDLVR ANAAPVLGFD GPADVAEGRT FKDLGMDSVT AVEFRDQLGA ATGLALPAAL
TFNQPTPRAL AEHLDRELGG ATAEPAVDGA TPTSDDPIAV VGMACRFPGG VRSPEQLWEL
VRDGVDAISE FPDNRGWDTE ALHDPDPDKR GSTNARYGGF LHDADEFDAA FFGISPREAT
AMDPQQRLLL EVAWEAFERA GIDPATLRGA DAGVFVGAMA QDYGPRLHQG AEGHDGYLLT
GNQLSVASGR LAYTFGLEGP ALTIDTACSS SLVAIHLAGQ ALARGETSLA LAGGVTVMST
PGIFVEFSRQ RGLSADGRCK AFSADADGTG WSEGAGLLVL ERLSDARRNG HPVLAVVRGT
AVNQDGASNG LTAPNGPSQE RVIRRALGAA GLRPSDVDVV EAHGTGTSLG DPIEAQALLA
TYGQDRETPL YLGSLKSNIG HTQAAAGVAG VIKSIEAIRN GIVPRTLHAD EPTPHVDWEA
GAVRLATDSA PWPDTDRPRR AGVSSFGISG TNAHVIVEQA EPADTPAAPA ESDTTTGPLP
WVLSGRGAGS LAAQAERLLG FATSTEHTTA DVGWSLATTR TALDHRAVVW GEDRQDLLAA
LAAVARGESG PTAVTGQAAP TRTAFLFTGQ GAQRARMGLA LAEASPVYAA AFAEVCAALD
PHLDRPLREV VDSGDGLDET GFTQPALFAV EVALFRLLAD HGVRPDFLAG HSIGELAAAH
VAGVFSLDDA ARLVAARGRL MQRLPRDGVM IAVEATEDEV LAELAGHGDR VGVAGVNGPT
ALVVAGETEA ATAVAEALAA RGRRTKRLEV SHAFHSPLIE PMLAEFAEVA ATVAYAAPRV
PLVSTVTGRL ADPAELAAPD YWVGQARAAV RFSDAVGALL DEGVTALVEL GPTAVLSALV
PAIAAGAGVD VVAAPLLRAD RDEPRSVLSA LAALFVAGVD VDWAAPYRGG RARRVDLPTY
AFRPQRFWLD APTGGDVTGV GLAPAGHPLL GAAVDLAGGR LVLTGRLAAT SHPWLADHAI
AGAAVLPGTA YLELALWAGG RVGADHVDEL TLAAPLVLAE RGGTLVQVVV DAPDADGSRA
LRVHARRDRD GAEWVEHATG TLTPTPPAAD TDLGAWPPNA TELDLTDAYA RLAERGYGYG
PVFRGLERAW RAGDDLFAEV ALPADQWPAA ASFTLHPALL DAALHPLLPG VADDGDTTVL
PFAWSGVTAH AEGARSLRLR HRVTRPDQDT LVVSVTAVDT SGAPVLTARS LVLRPVSRAA
LRDSGGATLL RPEWRPAQTG GPVPWSAVGA EGFVDVFALP GDATPTRAYY DLPDLAQALG
GDDVPRHAVV LLPEPDGDPA RAAHATTRAA LELLQVWLAD PRLQDTALAV VTTGGGGARP
DESPHHAGVW GLLRSAQSEQ PGRFALVDHD GTAESWAALP SALGTGEPQL ALRAGEVLVP
RLVAAEPEPA TAVAWDAGTV LITGGTGALG ALAARHLVRE HGVRDLLLLS RRGPDAPGAA
DLVAELTAEG ARVTVTAVAA QDRAALADAL RGHDVRVALH TAGVVDDGVV TSLDARGLTT
ALTPKVDAAW NLHELLGDRA TLVLYSSVAG VLGNPGQGNY AAGNAFLDAL ARHRAHLGRP
ATSIAWGLWA DSSGITGALT DTDRTRLARN GVLPLTGEAG LALLDAATAS GLPEVTAAAL
DHAALRALGE RLPAVLRGLI TPAATRRAAA GEAEPSADGG SALERRLAGL SERERDTAVA
ELVRATVAQV LGHADGSRVE MAAAFKELGF DSLTAVELRN HLHTATGLRL PSTLVFDYPS
PSAVAGYLSA QVSGPAAPDE TRQDRVAATE DDPIAIVAMA CRFPGGVRSP EQLWELVRDE
VDAVGDFPTD RGWDTDALYD PDPDRAGRTY TRSGGFLHDA YDFDPEFFGM SPREALATDP
QQRLLLEVAW EAFERAGIDP ATLRGSSTGV FAGVMYTDYT EQSGQLPAEL EGYLASGTAG
SVASGRLAYT FGLEGPALTI DTACSSSLVA IHLAGQALRS GEADLALAGG VTVMSTPNPF
IEFSRQRGLS ADGRCKAFSA DADGTGWSEG AGLLVLERLS DARRNGHPVL AVVRGTAVNQ
DGASNGLTAP NGPSQERVIR RALGAAGLRP SDVDLVEAHG TGTSLGDPIE AQALLATYGQ
DRETPLYLGS LKSNIGHTQA AAGVAGVIKS VQAIHNGLMP KTLHVTEPSP HVYWDAGAVT
LLTEATPWPD AHRPRRAGVS SFGISGTNAH VIIEQAPEHG QPGEGGADPV VPWLLSAKTP
AALKAQADAL AAFLAEHPDT RPADVALTLA TRRATHERHA VLVGSDADDL RARLTALDPA
AAGRTTGAGR LAALFTGQGA QRVGMGIDLV HAHPAYEAAF DEVCAALDPH LGRSLREAVT
TGEALDETWL TQPALFAVEV ALFRLWQSWG VRFDFLAGHS IGELAAAHVA GVFSLDDAAR
LVAARGRLMQ QLPATGVMIA VEATEEEVRA GLAGQPGLVD VAAVNGPRAV VLAGEEQATL
AAAEPWRALG RRTRRLKVSH AFHSPLVEPI LDAFAEVAAT VTYHRPTIPI VSTLTGAADA
PVDTPDHWVR HVRGAVRFAP ATTALVALGA TTFLEVGPDT VLATMAEQVL DALPERRDRA
ALASTRRDRP EVDTTAETLA LLHTRGVAVD WAAFLDGAGA RHVDLPTYGF QRDRYALVVA
PGAGRTAAAP DWEEIHPPRP AGPRSLVVLD LDHGAADLPG LPVVTTAADV PADADAVLLP
LAVHLTADPA DTPEARYART RGTLDLVQGW DGPRLVVTTT GIAPALGGDR AGEDARLAWN
LLRAAAEHNG RVLLVDLDTD RPDPDVLAAL LAAGVPLAAA RGDRLLVPPR EGAPTAEPVT
SRLLADLAAA PPAKHRTILL AAVLAEVASV LHRDDADGIE EDRALQELGF DSLTSVDLRN
RLNAATGAAL PATAVFDHPT PAALADHLLG LLAPGGPEPA APLHAELDRL ESLLATAPRG
GDGAESTAEV ADRLRAILAR LTEPTAAANA LGEDPVGQLA EATADDLFAF IDNELGRTAG