Gene Emin_0501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0501 
Symbol 
ID6262659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp549751 
End bp550656 
Gene Length906 bp 
Protein Length301 aa 
Translation table11 
GC content39% 
IMG OID642610971 
Product4-diphosphocytidyl-2C-methyl-D-erythritol kinase 
Protein accessionYP_001875394 
Protein GI187250912 
COG category[I] Lipid transport and metabolism 
COG ID[COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase 
TIGRFAM ID[TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000076215 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAAG CTTCCTTAAA GATATTCTGT CCCGCTAAAA TAAATTTATT TTTAGAGATA 
GTTTCCAAAC TGCCTAACGG TTACCATGAA CTGCAAACAA TTTTTGCAAA GTTAGATTTT
GGGGATAATA TTTTACTAAC CCTTTCTCCG TCAGATAAGA CGGAAATCAA CCTTAAAATA
ACAGGCCCCT ACGGGCATGC CATAACGGCC GACGCGGATA ACCTTGTTTA TAAGGCGGCT
CAGCGTTTTT TTGAATTTAC AGGCATAAGC GCCAAATGCG ATATATCGCT TGAAAAAAAT
ATTCCGACAG GCGCGGGGTT GGGGGGCGGC TCTTCGGATG CAGGATGCCT CCTTCGCACT
TTTTGCAACC ATTATAAGAC AGATTTTACA ATGCTTGTTC CTTTGGCTGC TAAACTCGGC
GCGGACGTAG CGTTGTTTCT ATATGACGAA CCTGTTTTAA AAGGAGAAGG CATAGGTGAA
AAACTTACGC CTTTAAAAAT TAAAGACGCA CTGCCTTATG TGGTGTTGTC TTACCCAGAT
ACGCACATAT CTACTAAAGA TGTTTTTGAT AGGCTGAAGG TTGGAAGTAA AGAAGAAATA
TTGACAAACT TGGCTAAGCT TGATAAAATT ATAGCTGGTC TTACAGAAGG AAGTGCGTGG
GAAAAATACA TATATAACAG ATTAGAAGAT TATGTATTAC CTTTCAGTAA GCCTGTTTTG
GAGTTAAAGA AGTTAATGCA AACCCTAGGA GCCAAAAATA TTATGATGTC CGGTTCCGGT
TCAACAGTTT TTAGTTTATT TGATAGTTCC AGTGATGCCT GTGCATTTGC TGAAAAATTA
ATAAATCGGG GTTGTGTTGC AGTAAAAACG CAACTTTGGA GGGGTTTGTA TAATGAAAAT
TACTGA
 
Protein sequence
MNEASLKIFC PAKINLFLEI VSKLPNGYHE LQTIFAKLDF GDNILLTLSP SDKTEINLKI 
TGPYGHAITA DADNLVYKAA QRFFEFTGIS AKCDISLEKN IPTGAGLGGG SSDAGCLLRT
FCNHYKTDFT MLVPLAAKLG ADVALFLYDE PVLKGEGIGE KLTPLKIKDA LPYVVLSYPD
THISTKDVFD RLKVGSKEEI LTNLAKLDKI IAGLTEGSAW EKYIYNRLED YVLPFSKPVL
ELKKLMQTLG AKNIMMSGSG STVFSLFDSS SDACAFAEKL INRGCVAVKT QLWRGLYNEN
Y