Gene Namu_4387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4387 
SymbolispH 
ID8450013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4869937 
End bp4870917 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content70% 
IMG OID645043434 
Product4-hydroxy-3-methylbut-2-enyl diphosphate reductase 
Protein accessionYP_003203663 
Protein GI258654507 
COG category[I] Lipid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0761] Penicillin tolerance protein 
TIGRFAM ID[TIGR00216] (E)-4-hydroxy-3-methyl-but-2-enyl pyrophosphate reductase (IPP and DMAPP forming) 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCC CGACCTCTTC GCCCGCCAAG CGCGTCCTGC TGGCCAGCCC GCGCGGCTAC 
TGCGCGGGGG TCGACCGCGC GGTCGTCACC GTCGAGAAGG CCCTGGAGCA GTACGGGCCG
CCGGTGTACG TGCGCAAGCA GATCGTGCAC AACAAGCACG TGGTGGCCAC CCTGGAGTCC
CGCGGGGCGA TCTTCGTGGA GGAGACCGAC GAGGTCCCGG AGGGTGAGAT CGTCGTCTTC
TCGGCGCACG GGGTGTCCCC GGCCGTGCAC GAGCAGGCCG CGCGCCGGCA GCTGCAGGTG
ATCGACGCGA CCTGCCCGCT GGTCACCAAG GTGCACAAGG AGGCCCGGCG GTTCGCCGCC
GAGGACTACG ACATCCTGCT CATCGGCCAT CGCGGGCACG AGGAGGTCGA GGGCACCCAC
GGGGAGGCGC CCGAGGCGAT CCAGCTGATC AACGATGCCT CCGACGTCGA TGCGGTGACC
GTGCGCGACC CGGAAAAGGT GATCTGGCTC TCGCAGACCA CGCTGTCGGT GGATGAGACA
TTGGGCACCG TCGATCTGCT TCGCAAGCGC TTCCCGCTGA TGACCTCGCC GCCCAGCGAC
GACATCTGTT ACGCCACCCA GAACCGGCAG GAAGTGGTCA AGCAGATCGC CGCGGACTGC
GACCTGGTGA TCGTGGTCGG GTCGACGAAC TCGTCCAACT CGGTGCGGCT GGTCGAGGTG
GCGCTGGGAG CCGGGGCCGA TACCGCCTAC CTGGTCGACG ACGCCGGCGA GATCGACGAG
GCCTGGCTGG ACGGGGTGCA CACCGTCGGG GTGACCAGTG GGGCCTCGGT GCCCGACGAC
CTGGTTGAAG GAGTGCTGGC CCACCTGCAG GAACGGGGCT TTCCGCCGGC CGAGGAGTTC
ACCGCGGCGA CCGAGACGCT GACGTTCTCG CTGCCCAAGG AACTGCGCCG GCCGGCGGCC
TCCACTCGCG GGGGCAGCTG A
 
Protein sequence
MTAPTSSPAK RVLLASPRGY CAGVDRAVVT VEKALEQYGP PVYVRKQIVH NKHVVATLES 
RGAIFVEETD EVPEGEIVVF SAHGVSPAVH EQAARRQLQV IDATCPLVTK VHKEARRFAA
EDYDILLIGH RGHEEVEGTH GEAPEAIQLI NDASDVDAVT VRDPEKVIWL SQTTLSVDET
LGTVDLLRKR FPLMTSPPSD DICYATQNRQ EVVKQIAADC DLVIVVGSTN SSNSVRLVEV
ALGAGADTAY LVDDAGEIDE AWLDGVHTVG VTSGASVPDD LVEGVLAHLQ ERGFPPAEEF
TAATETLTFS LPKELRRPAA STRGGS