Gene Emin_1165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1165 
Symbol 
ID6263694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1262736 
End bp1263884 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content44% 
IMG OID642611645 
Product2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase 
Protein accessionYP_001876054 
Protein GI187251572 
COG category[I] Lipid transport and metabolism 
COG ID[COG0245] 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase
[COG1211] 4-diphosphocytidyl-2-methyl-D-erithritol synthase 
TIGRFAM ID[TIGR00151] 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase
[TIGR00453] 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones101 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGATT TTAAGGTTTC GGCTATTATA GTTGCGGGGG GCAGCGGCAC CAGAATGGGG 
CGTCCTAAAC AGATGCTTGA TATTGCCGGC AAACCCGCGC TTGCAAGAAC GGTTGAGGCT
TTTAAAAAAG TAAAAAATAT TACGGAAATT ATTGTTGTTT CCGCGCCGGA AACGGCGGCG
GAAATTAAAA AAATATTTCC CGAAATTAAA ACCGTTGCGC CCGGTGCGAC AAGATTAGGT
TCCGTAATAA GCGGTGTTGA AGCGGTTGAT AAAAACGCGG ATGTCATTTC CGTGCATGAC
GGCGCCAGGC CGCTTGTAAA CCCGGAAAAA GTTGATTTAG CGCTTAAAAC AGCCTATGAT
AAAGGCGCCA GCGTTTTAGC TGTGCCCGTT AAAGACACTA TTAAAGAATG TTCAAACGGC
GTTGTGTGTA AAACCTTGGA CCGCGGAGTT TTATACGCCG CCCAAACCCC GCAAAGCTAC
AGGGCGGATG TTTTAAAAAA CGCTTTAGAA AAATACGGCA AAGAGTTAAA CGCTACGGAT
GAATCCCAGC TTGTTGAAAA AACGGGCGTT AAAGTAAATA TTGTCGAGTC CGACTATAAA
AATATTAAAA TAACAACTCC GGAAGATTTA ATTATGGCAG AAGCTTTAGT TAAAGAAGAT
AAAGAAACAA TTTACAGAAC AGGCATTGGC TTTGATCTGC ACAGGCTTGT TGAGGGGCGT
AAACTTTTTA TAGGCGGTTT GGAAATACCG CATACAAAAG GCTTTTTGGG CCACAGCGAC
GGCGACGTTG TTTTGCATGC TGTGTGCGAC GCGGCTTTGG GCGCGGTTTG CGCCGGCGAA
ATTGGTGTTT ATTACCCGCC TACCGATGCT AAAATAGAAG GCATTTCCAG TGTTGATATA
GCTAAAAAAG TTATAGAAAT ATTAAAAGAA AAAAACGCCA GAATCGTTCA CATAGACGCG
GTTATAATAA CGGAAGAACC GAAAATGAAA CCGCATTATC AGGCTATACG TGAAAGCCTT
GGCAAAGTTT TTAATATGGG GGTTGACAGT ATCAGCTTTA AATCCAAAAG CCATGAAAAA
CTTGGCGACA TCGGCGCGGG TAACGCTGCT ATGTGCCAGT GTGTTGTAAC AGTAAAAACA
GAGAGGTAG
 
Protein sequence
MNDFKVSAII VAGGSGTRMG RPKQMLDIAG KPALARTVEA FKKVKNITEI IVVSAPETAA 
EIKKIFPEIK TVAPGATRLG SVISGVEAVD KNADVISVHD GARPLVNPEK VDLALKTAYD
KGASVLAVPV KDTIKECSNG VVCKTLDRGV LYAAQTPQSY RADVLKNALE KYGKELNATD
ESQLVEKTGV KVNIVESDYK NIKITTPEDL IMAEALVKED KETIYRTGIG FDLHRLVEGR
KLFIGGLEIP HTKGFLGHSD GDVVLHAVCD AALGAVCAGE IGVYYPPTDA KIEGISSVDI
AKKVIEILKE KNARIVHIDA VIITEEPKMK PHYQAIRESL GKVFNMGVDS ISFKSKSHEK
LGDIGAGNAA MCQCVVTVKT ER