Gene Mmcs_3468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_3468 
Symbol 
ID4112300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp3687327 
End bp3688658 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content70% 
IMG OID638032603 
Productbeta-ketoacyl synthase 
Protein accessionYP_640631 
Protein GI108800434 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGGCG CGAACCGACT CGCCGCCGAA CCGGATCCGG TCGTCATCGT CGGAATGGCG 
GTGGAGGCGC CCGGCGGCAT CGACACTCCA GAGGACTACT GGACGCTGCT GTCCGAACAG
CGCGAGGCGC TGTGTCCGTT TCCCACCGAC CGCGGCTGGT CGCTTCGCGA ACTGTTCACC
GGGTCCAGGC GCGAGGGTTT CAAACCGATC CACGATCTCG GCGGATTCCT TACCAGCGCA
TCGACATTCG ATCCCGAGTT CTTCGGCCTC TCGCCGCGCG AGGCCGTCGC GATGGACCCG
CAGCAGCGGG TGGCGCTGCG CCTGGCGTGG CGTGCGCTGG AGAACAGCGG CATCAACCCC
GACGACCTGG CCGGCCACGA CGTCGGCTGC TATCTCGGGG CCTCGGCACT GGAGTACGGG
CCCGAACTCA GCGAGTTCTC CCACCACAGC GGCCACCTGA TCACCGGCAC GTCGCTGGGT
GTCGTCTCCG GGCGGATCGC CTACACCCTC GATCTGGCCG GCCCCGCCGT CACCGTCGAC
ACCTCCTGCT CCTCGGCGCT GTCCGCCTTC CACCTCGCCG TGCAGGCGCT GCGATCCGGT
GACTGCGACC TCGCGCTGAC CGGCGGGGTG TGCGTGATGG GGTCCCCCGG ATACTTCGTC
GAATTCTCCA CACAGCACGC GCTCTCCGAC GACGGCCACT GCCGGCCCTA CAGTGCACAC
GCCAGCGGCA CCGTCTGGGC CGAGGGCGCG GGACTGTTCG TGCTGCAACG CAAGTCAGGC
GCGGTGCGCG ACGGCCGTGA CATCCTCGCC GAGGTCCGCG CCAGCAGCAT CAACTCCGAC
GGCCGCACCG TCGGGCTGAC CGCGCCCAGC GAACGTGCCC AGAGCCGCCT GTTCGGCAAG
GCGATCCGAG AGGCCGGGAT CTCATCCGAG GACGTCGGCA TGATCGAAGG TCACGGCACC
GGAACGCGGC TGGGTGACAA GACCGAACTG CGCTCGCTGG CGGCCACCTA CGGTGACACC
GCCCCCGCCC GTGGACCGCT GCTCGGCTCG GTGAAGTCCA ACATCGGGCA CACCCAGTCG
GCGGCGGGCG CCCTCGGGCT GGCCAAGGTG CTCGTTTCCG CCGAGCGCGC CACCATCCCG
CCCACCCTGC ACGCCGACGA GCCCAGTCGC GAGATCGACT GGGACACCAG CGGCCTTCGG
TTGGCGAACA AGCTCACCCC CTGGCCGGCC CGCAACGGCG AACGCGTGGG CGCGGTCTCG
GCGTTCGGTA TGAGCGGCAC CAACACCCAC GTGGTCGTGG CCGTGCCGGA CCGGCCCCGG
GAAGCGAAGT GA
 
Protein sequence
MTGANRLAAE PDPVVIVGMA VEAPGGIDTP EDYWTLLSEQ REALCPFPTD RGWSLRELFT 
GSRREGFKPI HDLGGFLTSA STFDPEFFGL SPREAVAMDP QQRVALRLAW RALENSGINP
DDLAGHDVGC YLGASALEYG PELSEFSHHS GHLITGTSLG VVSGRIAYTL DLAGPAVTVD
TSCSSALSAF HLAVQALRSG DCDLALTGGV CVMGSPGYFV EFSTQHALSD DGHCRPYSAH
ASGTVWAEGA GLFVLQRKSG AVRDGRDILA EVRASSINSD GRTVGLTAPS ERAQSRLFGK
AIREAGISSE DVGMIEGHGT GTRLGDKTEL RSLAATYGDT APARGPLLGS VKSNIGHTQS
AAGALGLAKV LVSAERATIP PTLHADEPSR EIDWDTSGLR LANKLTPWPA RNGERVGAVS
AFGMSGTNTH VVVAVPDRPR EAK