Gene Nther_0056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_0056 
Symbol 
ID6316825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp71216 
End bp72142 
Gene Length927 bp 
Protein Length308 aa 
Translation table11 
GC content37% 
IMG OID642642428 
Product4-diphosphocytidyl-2C-methyl-D-erythritol kinase 
Protein accessionYP_001916243 
Protein GI188584698 
COG category[I] Lipid transport and metabolism 
COG ID[COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase 
TIGRFAM ID[TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCACCAG ATATTTTGAG AGTGGAATCT CCTGCTAAGA TTAATCTCTA CTTAGATATT 
TTAGGGAAGA GGCAGGATGG TTTCCATGAA GTTGAAATGG TAATGCAATC GATTTCTTTA
TGTGATCGGT TAACTTTTAT AAGACAATCC CAGGGTAACA ATAACAATAT TGAATTGCTA
ATGAAATCAT CGGACTGTAT TGACCAATTA CCTGTCAGGG GAGATAACTT AATAATTAAG
GCTGCCAGGT TATTGATGAA TGATTTTAGA TTACCGCCAA TCAAGGTGAT TTTAGAGAAG
AACATTCCAA CTGAAGCGGG GTTAGCCGGT GGCAGTAGTA ATGCTGCTGC AACTTTATGG
GCAATAAATC ACATGTTTCA ACTAGGACTT ACCGAGCAAG AACTATCTGA CATAGGTGCT
CGCCTTGGTT CAGATATTCC CTTTTGTTTG TTTGGCGGAA CCAAGCTTGC CAAGGGACGA
GGAGAAATTT TACATCCACT ACCTTCTTTG CCCAACTGTT ATTTTGTACT AGTTAAACCC
AATTTTGGAG TTAGTACGGG AAAAGTGTAT CAAGAGCTAG GGTTTAAAAC GGATACCGAA
TATGGAGAAA ATCATCAAAC AAAATCAAGG GTCAATGGAA TTATTTCAGG CTTGGAAAAA
GGAACTTTAA CTGGAATTGT AGAAAATATG TATAATAAGA TGGAAGAGAT TGTGTTTAAA
TGGCATAGGG ATATGCAAAT AATATCACAG CAAATTGAGC AATTAGGGGC TTTAAAAGTA
TTAATGAGTG GAAGTGGTTC AACTATTTTT GGCGTTTTTG ATAACTATGA CACTGCTAAA
TATGCAAAAA AACAATTAGA AAGAGAATTT AAGTATGTGT TTCTCTCGAT CCCCCGTGAT
ATGGGAGTGG GAAAGGAGAA TAATTAA
 
Protein sequence
MSPDILRVES PAKINLYLDI LGKRQDGFHE VEMVMQSISL CDRLTFIRQS QGNNNNIELL 
MKSSDCIDQL PVRGDNLIIK AARLLMNDFR LPPIKVILEK NIPTEAGLAG GSSNAAATLW
AINHMFQLGL TEQELSDIGA RLGSDIPFCL FGGTKLAKGR GEILHPLPSL PNCYFVLVKP
NFGVSTGKVY QELGFKTDTE YGENHQTKSR VNGIISGLEK GTLTGIVENM YNKMEEIVFK
WHRDMQIISQ QIEQLGALKV LMSGSGSTIF GVFDNYDTAK YAKKQLEREF KYVFLSIPRD
MGVGKENN