Gene Mmcs_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_2044 
SymbolispG 
ID4110877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp2193119 
End bp2194300 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content68% 
IMG OID638031165 
Product4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase 
Protein accessionYP_639208 
Protein GI108799011 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.119477 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTTCCG GCCCCGCCAT CGGGCTTGGT ATGCCGCCCG CACCCCCGCC GGTGCTGGCA 
CCGCGGCGCA AGACCCGTCA GCTGATGGTG CGCGACGTCG GCGTGGGCAG CGATCATCCG
ATCTCGGTCC AGTCCATGTG CACCACCAAG ACCCACGACA TCAACTCGAC CCTGCAGCAG
ATCGCCGAAC TCACCGCGTC GGGCTGCGAC ATCGTCCGGG TGGCGTGCCC GCGGCAGGAG
GACGCCGACG CGCTGCCGAT CATCGCCAAG AAGTCGAAGA TCCCGGTGAT CGCCGACATC
CACTTCCAGC CGAAGTACAT CTTCGCCGCG ATCGACGCCG GATGTGCGGC GGTGCGCGTC
AACCCCGGCA ACATCAAGGA GTTCGACGGT CGGGTCAAGG AGGTGGCCAA GGCCGCCGGT
GACGCCGGCA TCCCGATCCG CATCGGCGTC AACGCCGGAT CGCTGGACAA GCGATTCCTG
CAGAAGTACG GCAAGGCCAC GCCCGAGGCG CTCGTCGAGT CGGCGCTGTG GGAGGCCTCG
CTGTTCGAGG AGCACGGCTT CGGCGACATC AAGATCAGCG TCAAGCACAA CGACCCCGTC
GTGATGGTCG CGGCCTACGA GTTGCTGGCC GCCCGCAGCG ACTACCCGCT TCACCTCGGT
GTCACCGAGG CCGGCCCGGC GTTCCAGGGG ACGATCAAGT CCGCGGTCGC CTTCGGCGCG
TTGCTCTCCA AGGGCATCGG CGACACCATC CGGGTCTCGC TGTCCGCGCC GCCGGCCGAG
GAGGTCAAGG TCGGCAACCA GATCCTCGAA TCGCTCAACC TGCGCCCGCG CGGTCTGGAG
ATCGTGTCCT GCCCGTCGTG CGGACGCGCC CAGGTCGACG TGTACACCCT CGCCAACGAG
GTCACCGCCG GCCTCGAGGG CATGGACGTC CCGTTGCGCG TCGCCGTCAT GGGCTGTGTC
GTCAACGGTC CCGGCGAAGC CCGCGAAGCC GATCTCGGGG TGGCCTCCGG CAACGGCAAG
GGTCAGATCT TCGTCAAGGG TGAGGTCATC AAGACCGTGC CCGAGGCGCA GATCGTCGAG
ACGCTGATCG AGGAGGCCAT GCGCATCGCG GAGGAGATCG GCGCCGCCGG TGACAGCCCC
GAGGGAAGTC CCAGCGGTTC GCCGGTTGTG ACCGTAAGCT GA
 
Protein sequence
MTSGPAIGLG MPPAPPPVLA PRRKTRQLMV RDVGVGSDHP ISVQSMCTTK THDINSTLQQ 
IAELTASGCD IVRVACPRQE DADALPIIAK KSKIPVIADI HFQPKYIFAA IDAGCAAVRV
NPGNIKEFDG RVKEVAKAAG DAGIPIRIGV NAGSLDKRFL QKYGKATPEA LVESALWEAS
LFEEHGFGDI KISVKHNDPV VMVAAYELLA ARSDYPLHLG VTEAGPAFQG TIKSAVAFGA
LLSKGIGDTI RVSLSAPPAE EVKVGNQILE SLNLRPRGLE IVSCPSCGRA QVDVYTLANE
VTAGLEGMDV PLRVAVMGCV VNGPGEAREA DLGVASGNGK GQIFVKGEVI KTVPEAQIVE
TLIEEAMRIA EEIGAAGDSP EGSPSGSPVV TVS