Gene Namu_4555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4555 
Symbol 
ID8450183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5069740 
End bp5070693 
Gene Length954 bp 
Protein Length317 aa 
Translation table11 
GC content77% 
IMG OID645043596 
Product4-diphosphocytidyl-2C-methyl-D-erythritolkinase 
Protein accessionYP_003203823 
Protein GI258654667 
COG category[I] Lipid transport and metabolism 
COG ID[COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase 
TIGRFAM ID[TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGCCT CGAACTCCGG CCCGACGGCG CTTGCGCCCG CGGGTCCGTC GGTGCGGGTC 
CGGGCGCCGG CCAAGATCAA CCTGTACCTG GCCGTCGGTG ACGTGCGCGA GGACGGCTAC
CACGATCTGG TGACCGTGTT CCAGGCCGTT GACCTGGCCG ACGAGCTCAC CGTGCGACCG
GCCCGCCGGT CCGCGGTGCG CACCACGCCG CCGGACGGCG TGCCGGTGGG GGCCAAGAAC
CTGGCCGGGG TCGCGGCCCG GCTGCTGGCC ACCCGCACCA AGTCCGGTGG CCCGGTGGCC
ATCGACATCG CCAAGCAGAT CCCGGTGGCC GGCGGGATGG CCGGCGGCAG CGCCGATGCC
GCGGCCGCGC TGGTTGGCTG CGCCGCCCTG TGGGAGCTGC GGGTGCAGCG GCAGGAATTG
ATCGAGATCG GCCGGGAGAT CGGGGCCGAC GTGCCGTTCG CGCTCGCCGG CGGCACCGCG
CTGGGCACCG GGCGGGGCGA TCTGCTGTCC CCGGTGATGA CCCGGGCCCG GCTGCACTGG
GTGCTGGCCA TCGCCGATCA CGGCCTGTCC ACCCCGGCCG TGTTCGCCGA GCTGGACCGG
CTGCGGGCCC AGGGCGGCGG GCCCCCACCG GTGCGCCCGG TGGACACCAT GCTGGCCGCC
CTGACCAGCG GCGAGCCGGC CAAGATCGCC GCCGCGTTGG GCAACGACCT GCAGGCCGCC
GCGATCTCAC TGGCCCCCGG GTTGCGCCGC ACCCTGCGGG CGGGGGAGCA GGCGGGCGCG
CTGGGCGGGC TGGTCTCCGG ATCGGGGCCG ACCGTCGCCC TGCTGTGCGC CGACGCGGAA
TCGGCCGGCG CGGTGGCCGC CGAACTGGCC GGGTCCGGAA CCTGCCGATC CGTGCGGGTG
GCCGCCGGCC CAGCCCCGGG GGCCCGGGTC CTGCCCAACG GAGGGACCGG CTGA
 
Protein sequence
MMASNSGPTA LAPAGPSVRV RAPAKINLYL AVGDVREDGY HDLVTVFQAV DLADELTVRP 
ARRSAVRTTP PDGVPVGAKN LAGVAARLLA TRTKSGGPVA IDIAKQIPVA GGMAGGSADA
AAALVGCAAL WELRVQRQEL IEIGREIGAD VPFALAGGTA LGTGRGDLLS PVMTRARLHW
VLAIADHGLS TPAVFAELDR LRAQGGGPPP VRPVDTMLAA LTSGEPAKIA AALGNDLQAA
AISLAPGLRR TLRAGEQAGA LGGLVSGSGP TVALLCADAE SAGAVAAELA GSGTCRSVRV
AAGPAPGARV LPNGGTG