Gene Mlg_1461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1461 
SymbolispG 
ID4270242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1667469 
End bp1668716 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content68% 
IMG OID638126217 
Product4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase 
Protein accessionYP_742300 
Protein GI114320617 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.0743791 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAGA CCAGCAGCCA GCACCCGAGA CGCCGCAGCG TGCCCGTGCC CGTAGGCCCG 
GTCACCATTG GCGGCAACCA CCCCATCGTG GTCCAGTCGA TGACCAACAC CGATACCGCC
GATGACATCC GCACGGTGGT GCAGGTGGCC GAGCTGGCAC GGGCGGGCTC GGAAATTGTC
CGGCTCACCG TCAACAACGA CGAAGCGGCG GCGGCGGTTC CGCACATCCG CGAGCGGCTG
GATGCCATGG GCCTGGAGGT GCCCCTGGTC GGTGACTTCC ACTTTAACGG CCACAAGCTG
CTGGCGAAAC ACCCCGCCTG TGCCGAGGCA TTGGCCAAGT TCCGCATCAA CCCCGGCAAT
GTCGGCAAGG GCCGTCGCCG GGACCCGCAG TTTGCCGAGA TGATCGAGTT CGCCTGTCGC
TACGACAAGC CGGTACGCAT CGGCGTCAAC TGGGGCAGCC TGGACCAGGA TCTGCTGGCC
GCGATGATGG ATGAGAATGC GGCTCTCCCC CGCCCGCTCC CACCGGAGCA GGTGATGAAG
CAGGCGGTGA TCGCCTCCGC ACTGCAGAGC GCGGAGAAGG CCGAGGCCCT CGGCCTGCCA
CGGGAGCGGA TCGTGCTCTC CTGCAAGATG TCCGGCGTCC AGGACTTGAT CGAGGTCTAC
CGCGATCTCG CGGCCCGCTG CGACTACGCT CTGCACCTGG GCCTCACCGA GGCCGGTATG
GGCTCCAAGG GCATTGTCGC CTCCACTGCC GCGCTGGCGG TCCTGCTGCA GGAGGGGATC
GGCGACACCA TCCGGGTCTC CCTCACCCCG GAGCCGGACC AGCCCCGCAC CGACGAGGTG
GTGGTTGCCC AGCAGATCCT GCAGACCATG GGCCTGCGCG CCTTCACCCC CATGGTCACC
GCCTGCCCCG GCTGTGGCCG GACCACCAGC ACCTACTTCC AGGCGCTGGC CCGCGACATC
CAGGCTCACG TCCAGCGGCG TATGCCGGAG TGGCGTCGTA CCTATCCGGG GGTCGAGAAC
CTGACCCTGG CGGTGATGGG TTGCGTGGTC AACGGTCCGG GCGAGAGCCG GAACGCCGAT
ATCGGCATCA GCCTGCCCGG TACCGGAGAA CGGCCCGTGG CCCCGGTCTA CGTGGATGGC
GAGAAAACAG TGACCCTGAA GGGCGAGCGT ATCGCCGAAG AGTTCCAGGC CATCGTCGAA
GACTATATCG AGGACCGCTT CGGGCAGCAG CGGGCCGGCG ACCGCTGA
 
Protein sequence
MPETSSQHPR RRSVPVPVGP VTIGGNHPIV VQSMTNTDTA DDIRTVVQVA ELARAGSEIV 
RLTVNNDEAA AAVPHIRERL DAMGLEVPLV GDFHFNGHKL LAKHPACAEA LAKFRINPGN
VGKGRRRDPQ FAEMIEFACR YDKPVRIGVN WGSLDQDLLA AMMDENAALP RPLPPEQVMK
QAVIASALQS AEKAEALGLP RERIVLSCKM SGVQDLIEVY RDLAARCDYA LHLGLTEAGM
GSKGIVASTA ALAVLLQEGI GDTIRVSLTP EPDQPRTDEV VVAQQILQTM GLRAFTPMVT
ACPGCGRTTS TYFQALARDI QAHVQRRMPE WRRTYPGVEN LTLAVMGCVV NGPGESRNAD
IGISLPGTGE RPVAPVYVDG EKTVTLKGER IAEEFQAIVE DYIEDRFGQQ RAGDR