Gene Mkms_2090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_2090 
SymbolispG 
ID4613644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp2212103 
End bp2213284 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content68% 
IMG OID639791755 
Product4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase 
Protein accessionYP_938078 
Protein GI119868126 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.378407 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.448978 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTCCG GCCCCGCCAT CGGGCTTGGT ATGCCGCCCG CACCCCCGCC GGTGCTGGCA 
CCGCGGCGCA AGACCCGTCA GCTGATGGTG CGCGACGTCG GCGTGGGCAG CGATCATCCG
ATCTCGGTCC AGTCCATGTG CACCACCAAG ACCCACGACA TCAACTCGAC CCTGCAGCAG
ATCGCCGAAC TCACCGCGTC GGGCTGCGAC ATCGTCCGGG TGGCGTGCCC GCGGCAGGAG
GACGCCGACG CGCTGCCGAT CATCGCCAAG AAGTCGAAGA TCCCGGTGAT CGCCGACATC
CACTTCCAGC CGAAGTACAT CTTCGCCGCG ATCGACGCCG GATGTGCGGC GGTGCGCGTC
AACCCCGGCA ACATCAAGGA GTTCGACGGT CGGGTCAAGG AGGTGGCCAA GGCCGCCGGT
GACGCCGGCA TCCCGATCCG CATCGGCGTC AACGCCGGAT CGCTGGACAA GCGATTCCTG
CAGAAGTACG GCAAGGCCAC GCCCGAGGCG CTCGTCGAGT CGGCGCTGTG GGAGGCCTCG
CTGTTCGAGG AGCACGGCTT CGGCGACATC AAGATCAGCG TCAAGCACAA CGACCCCGTC
GTGATGGTCG CGGCCTACGA GTTGCTGGCC GCCCGCAGCG ACTACCCGCT TCACCTCGGT
GTCACCGAGG CCGGCCCGGC GTTCCAGGGG ACGATCAAGT CCGCGGTCGC CTTCGGCGCG
TTGCTCTCCA AGGGCATCGG CGACACCATC CGGGTCTCGC TGTCCGCGCC GCCGGCCGAG
GAGGTCAAGG TCGGCAACCA GATCCTCGAA TCGCTCAACC TGCGCCCGCG CGGTCTGGAG
ATCGTGTCCT GCCCGTCGTG CGGACGCGCC CAGGTCGACG TGTACACCCT CGCCAACGAG
GTCACCGCCG GCCTCGAGGG CATGGACGTC CCGTTGCGCG TCGCCGTCAT GGGCTGTGTC
GTCAACGGTC CCGGCGAAGC CCGCGAAGCC GATCTCGGGG TGGCCTCCGG CAACGGCAAG
GGTCAGATCT TCGTCAAGGG TGAGGTCATC AAGACCGTGC CCGAGGCGCA GATCGTCGAG
ACGCTGATCG AGGAGGCCAT GCGCATCGCG GAGGAGATCG GCGCCGCCGG TGACAGCCCC
GAGGGAAGTC CCAGCGGTTC GCCGGTTGTG ACCGTAAGCT GA
 
Protein sequence
MTSGPAIGLG MPPAPPPVLA PRRKTRQLMV RDVGVGSDHP ISVQSMCTTK THDINSTLQQ 
IAELTASGCD IVRVACPRQE DADALPIIAK KSKIPVIADI HFQPKYIFAA IDAGCAAVRV
NPGNIKEFDG RVKEVAKAAG DAGIPIRIGV NAGSLDKRFL QKYGKATPEA LVESALWEAS
LFEEHGFGDI KISVKHNDPV VMVAAYELLA ARSDYPLHLG VTEAGPAFQG TIKSAVAFGA
LLSKGIGDTI RVSLSAPPAE EVKVGNQILE SLNLRPRGLE IVSCPSCGRA QVDVYTLANE
VTAGLEGMDV PLRVAVMGCV VNGPGEAREA DLGVASGNGK GQIFVKGEVI KTVPEAQIVE
TLIEEAMRIA EEIGAAGDSP EGSPSGSPVV TVS