Gene Nmul_A0588 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0588 
Symbol 
ID3783986 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp672751 
End bp673635 
Gene Length885 bp 
Protein Length294 aa 
Translation table11 
GC content57% 
IMG OID637810670 
Product4-diphosphocytidyl-2C-methyl-D-erythritol kinase 
Protein accessionYP_411288 
Protein GI82701722 
COG category[I] Lipid transport and metabolism 
COG ID[COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase 
TIGRFAM ID[TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGGAA TACTGTCAAC GACTGAAGTG GATTCGCAGG CGGAACTGAG CTGCCCTGCG 
CCTGCCAAAC TCAATCTGTT CCTGCATGTG GTGGGACGGA GGGAGGATGG GTACCATCTT
CTGCAAACCG TTTTCCGCCT GGTGGATTTC GCCGACCAGC TCCATTTCGG GCTGCGGGCG
GACGGTGTGA TCAAGCTGCA TACGCCCACT CCGGGGGTGC CGGAAGAGCA GGATTTGTGC
GTGCGCGCAG CAAAACTGCT GCAACGGGAA AGCGGTACTC CCTGGGGGGC CAATATCTTT
CTGGAAAAGC GCATCCCGAT GGGTGGTGGC CTGGGAGGCG GCAGTTCGGA TGCAGCCACG
ACATTGCTTG CGCTCAACCG CTTGTGGAAG CTGGGCTGGC GCCGGAATCA ACTTTTGAAA
CTGGCCCCGG AACTGGGTGC GGATGTTCCC GTATTCGTTT TCAGTGAAAA TGCCTTTGCT
GAGGGCATCG GCGAAAAACT CCTGCCGATT GCGTTACCCC CGGCATGGTA TCTGATACTC
ACACCGCCCG TGCATGTCTC AACGGCAAAG GTTTTTTCAA GTAAGGAATT GACACGAAAC
ACGATTCCGA TCAAAATACC GCCCTTTTCC ACCGAGCAAG GGCATAATGA TCTCGAGCCG
GTGGTGTGTG CTTCATACCC CGAGGTAGCA CGCCACCTCG AGTGGCTGCG GCAGCTCGAA
GGTGCAAGGA TGGCGGCCAT GACGGGTTCC GGCGCGTGCG TTTTTGCCGA GTTTGCGACC
GAATCCGGGG CCAGAAGCGC ACTGGGGAAG ATTCCATACG GTATGAAGGG TTTTGTGGCG
CAGGGACTTG ATCGCCATCC CTTGCATGAT TTTGCAGAAC AATAA
 
Protein sequence
MNGILSTTEV DSQAELSCPA PAKLNLFLHV VGRREDGYHL LQTVFRLVDF ADQLHFGLRA 
DGVIKLHTPT PGVPEEQDLC VRAAKLLQRE SGTPWGANIF LEKRIPMGGG LGGGSSDAAT
TLLALNRLWK LGWRRNQLLK LAPELGADVP VFVFSENAFA EGIGEKLLPI ALPPAWYLIL
TPPVHVSTAK VFSSKELTRN TIPIKIPPFS TEQGHNDLEP VVCASYPEVA RHLEWLRQLE
GARMAAMTGS GACVFAEFAT ESGARSALGK IPYGMKGFVA QGLDRHPLHD FAEQ