Gene Namu_0867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0867 
Symbol 
ID8446459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp957451 
End bp958596 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content74% 
IMG OID645040004 
Product2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase 
Protein accessionYP_003200267 
Protein GI258651111 
COG category[I] Lipid transport and metabolism 
COG ID[COG0245] 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase
[COG1211] 4-diphosphocytidyl-2-methyl-D-erithritol synthase 
TIGRFAM ID[TIGR00151] 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase
[TIGR00453] 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.861098 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTCG CCGCAATAGT GCCTGCCGCC GGGTCCGGCG AACGGCTCGG TGCCGGCATC 
CCCAAGGCCT TTGTCCGGGT GCAGGGACGC GAACTCGTGG CGCACGCGGT GGACCGCCTG
CGCCAGGCCG GCGTCGATCA TGTCGTGGTC GCGGTGTCCG CCGATCAGGT CGAGCAGGCC
CGGTCGTTGG TCGGCGACGT GGCTTCGGTG GTGATCGGCG GCGCCGACCG GACGGCCTCG
GTCGCGGCCG GCCTGGCTGC CGTGCCGGCC GACGCCGGCA TCGTGCTCGT GCACGATGCG
GCCCGCGCGT TCGCGCCGGC CACGCTCATC GAGCAGGTGA TCGCGCGGGT CGGCGAGGGT
GCCGACGCCG TCGTTCCGGT GCTGCCGGTG GTCGACACCA TCCGCACCAT CACCGACGGC
GGCGCGCTCA CCGGCACCGT CGACCGCAGC TGCCTGCGGA TCGTGCAGAC CCCGCAGGGC
TTCCGCGCCG CGGTGCTCCG GCGCGCGCAT GCCGCGGCCG CCGGTCGCGG CGAGTCGGCC
ACCGACGACG CCGCGCTGTG CGAATCGATC GGCGTCCGGG TCACTGCCGT CGACGGCGAT
CGGTCAGCCT TCAAGATCAC CACCGCGGAC GATTTGGAGA CGGCGCAGCG CATGACGACC
GATCCGGCCC CGGTCACCGA CCTGCGAGTC GGTTCCGGCA TCGACGTGCA CCCGATCGAG
CCCGGCCGGG ACTGCTGGGT GGCCGGATTG CTCTTCGAAG ATGCCGACGG TTGCGCCGGA
CATTCCGACG GTGACGTGGC CGCCCACGCG CTGTGCGACG CCCTGCTGTC CGCGGCCGGG
TTGGGCGACC TGGGTGCAGT CTTCGGCACG TCCGATCCCC GCTGGTCGGG CGCCTCCGGG
GCCACCTTGC TGGCCGAGGT GGTGGCCCGG GTCAAGGCCG CCGGATACCA GGTGATCAAC
GCCTCGGTGC AGGTGATCGC CAACACGCCC AAGCTCTCGC CCCGCCGGGT CGAGGCCCAG
CAGGCACTTT CCGCCGTCGT CGGCGGTCCG GTCTCGGTGG CCGGGACGAC CACCGACGGC
CTTGGCCTGA CCGGGCGCGG GGAAGGCCGC GCGGCCACCG CGACCGCCCT GCTCGGGCCG
GCCTGA
 
Protein sequence
MTVAAIVPAA GSGERLGAGI PKAFVRVQGR ELVAHAVDRL RQAGVDHVVV AVSADQVEQA 
RSLVGDVASV VIGGADRTAS VAAGLAAVPA DAGIVLVHDA ARAFAPATLI EQVIARVGEG
ADAVVPVLPV VDTIRTITDG GALTGTVDRS CLRIVQTPQG FRAAVLRRAH AAAAGRGESA
TDDAALCESI GVRVTAVDGD RSAFKITTAD DLETAQRMTT DPAPVTDLRV GSGIDVHPIE
PGRDCWVAGL LFEDADGCAG HSDGDVAAHA LCDALLSAAG LGDLGAVFGT SDPRWSGASG
ATLLAEVVAR VKAAGYQVIN ASVQVIANTP KLSPRRVEAQ QALSAVVGGP VSVAGTTTDG
LGLTGRGEGR AATATALLGP A