Gene Tmz1t_3390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3390 
SymbolispG 
ID7873881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3709080 
End bp3710339 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content70% 
IMG OID643700329 
Product4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase 
Protein accessionYP_002890361 
Protein GI237654047 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCCC GACACGAACT CCAGCCGATC GAAGCCCGCC CGCTCGCCCG TCATCGCACC 
CATCAGGTGC GCGTCGGCAA GGTGAGGATC GGCGGCGAAG CCCCGGTCGT CGTGCAGTCG
ATGACCAATA CCGACACGGC CGACGTGCTC GCCACCGCGA TGCAGGTCGC CGAGCTCGCC
CGCGCCGGCT CCGAGATCGT GCGCATCACG GTCAACAACG AGGCCGCGGC GGCGGCGGTG
CCGAAGATCC GCGACCGCCT GCTCGCGCTC AACATGGACG TGCCGCTGGT CGGCGACTTC
CACTACAACG GCCACAAGCT GCTCACCGAC TTCCCGGCGT GCGCCGAGGC GCTCGCCAAG
CTGCGCATCA ACCCGGGCAA CGTCGGCGCC GGCCGCAAGC GCGACCCGCA GTTCGCCGCG
ATCGTCGAGC TTGCCTGCCG CTACGACAAG CCGGTGCGCA TCGGCGTGAA CTGGGGCAGC
CTCGACCAGT CGGTCCTTGC CCGCATCATG GACGCCAACG CCAAGCGTGC CGAGCCGCGC
GACGCCGGCG CGGTGATGCG CGAGGCGCTC GTCGTCTCGG CGCTCGAATC CGCGGCCAAG
GCCGAGGAAT ACGGCCTCGG CCGCGAGCGC ATCATCCTGT CGGCCAAGGT TTCCAGCGTG
CAGGACCTGA TCGCGGTGTA CCGCGATCTC GCCCGGCGCA GCGACTACGC GCTGCATCTG
GGCCTCACCG AGGCCGGCAT GGGCAGCAAG GGCATCGTCG GCTCCACCGC CGCGCTCGCC
GTGCTGCTGC AGGAAGGCAT CGGCGACACC ATCCGCATCT CGCTCACCCC CGAGCCGGGC
GGCAGCCGCA CCCAGGAGGT CGTGGTCGCG CAGGAGATCC TGCAGACCAT GGGCCTGCGC
GCCTTCACCC CCATGGTGAC TGCCTGCCCG GGCTGCGGCC GCACCACCAG CACCTTCTTC
CAGGAGCTCG CCTCCGGTAT CCAGGACTAC GTGCGCGCGC AGATGCCGGT GTGGCGCGAA
CAGTACGACG GCGTCGAGAA CATGACGCTG GCAGTGATGG GCTGCGTGGT CAACGGCCCG
GGCGAGAGCA AGCACGCCAA CATCGGCATC TCGCTGCCGG GCACCGGCGA AACCCCGGCG
GCGCCGGTGT TTGTCGACGG CGAGAAGGTC GTCACCCTGC GCGGCGACAA CATCGCTGCA
GAGTTCAAGG CGCTCGTCGA CGATTACGTC GCCACCCGCT ACGTGAAGAA GGGCGCCTGA
 
Protein sequence
MNPRHELQPI EARPLARHRT HQVRVGKVRI GGEAPVVVQS MTNTDTADVL ATAMQVAELA 
RAGSEIVRIT VNNEAAAAAV PKIRDRLLAL NMDVPLVGDF HYNGHKLLTD FPACAEALAK
LRINPGNVGA GRKRDPQFAA IVELACRYDK PVRIGVNWGS LDQSVLARIM DANAKRAEPR
DAGAVMREAL VVSALESAAK AEEYGLGRER IILSAKVSSV QDLIAVYRDL ARRSDYALHL
GLTEAGMGSK GIVGSTAALA VLLQEGIGDT IRISLTPEPG GSRTQEVVVA QEILQTMGLR
AFTPMVTACP GCGRTTSTFF QELASGIQDY VRAQMPVWRE QYDGVENMTL AVMGCVVNGP
GESKHANIGI SLPGTGETPA APVFVDGEKV VTLRGDNIAA EFKALVDDYV ATRYVKKGA