Gene Dgeo_0192 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_0192 
Symbol 
ID4058438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp178980 
End bp180077 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content69% 
IMG OID641229192 
Productnucleotidyl transferase 
Protein accessionYP_603664 
Protein GI94984300 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1209] dTDP-glucose pyrophosphorylase 
TIGRFAM ID[TIGR01208] glucose-1-phosphate thymidylylransferase, long form 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.415123 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCTCTG CCCCGCCTCT CCCTTCTCTC ACAGACGTGC CGCTATACTC GGTTCACCCT 
ATGAAAGGTG TGATCCTCGC TGCCGGGCGT GGCAGCCGTC TTTTTCCCGT CAGCGCGGGC
AGGCCCAAGC ATGCGGTGCC GATCGCCGGA GTGCCGATCA TCGCGTGGGC AGTGCGGGCC
GTACGGGAAG CGGGGGTGGA GGAGGTGGCA GTGGTTACCA GCTCCAACAA CGAGGCAGCG
CTGCGCGAAG CCACCCGGGA CGAAGGGCCA CTGACCTTTT TGCGGCAGGA GGAACCGCGC
GGCACCGGTG ACGCCGTGCT GGCGGCCCGC GCCTTTCTGG AGGGCAGCCC GGCCCTGCTG
TATCTGGGTG ACAACCTGTT CGCGGATCCC CTGACGCCCC TGACCGAGGC CCTGCAGGAC
GCGGACGCGG CTCTGGGCGT CAAACAGGTC CCCGATCCCA GCGCCTATGG CGTGGCAGCC
GTTCGGGACA ACCTCCTCAC CAACTTGGAT GAGAAACCTG CCGCTCCAGC CAGCGATCTA
GCGGCCTGCG GCGTCTTCGC CTTTCACCCA CATGTGCTGG AGGAGGTCGC GCGGCTCGAA
CCGAGCGTAC GCGGTGAGAT CGAGTTTCCG CAGGCGCTGC TGCGGGTGAT CGCGGCGGGC
GGGCGGGTGC GGGCGGTGAC ATTCCCAGGG TTCTGGAGCG ACGCGGGGAC ACCCGCCGAC
CTGCTCAGTG CCAGCGCCCA TTTCCTGAGC AAGCTGGCGC CGCGGGTGGA CGGCGAGGTG
CGCCGCAGCA GCCTGAGCGG ACCGGTCGTG ATCGAGGCGG GCGCCACGGT GGAGGACTCT
CTCCTGGTTG GCCCGGTGCT GATTGGAGCG GGAGCCAGCG TCCGCGGCAG CACCGTGGGG
CCAAATGTCA GTGTTGGCCC GCAAGCCCGG CTCGAGGGCG CGACCCTATC CGATACCCTG
ATCGACGAGG CCGCGACTGT CCGCTCTCCC ACCCGTCCCC TCGTGCGGAC GGTGGTTGGG
CGCCGGGCCA CCATCACCGC GCCGAGCGCC AGCGGTCTGC AGATCGTGGT GGGCGACTAC
AGCGTGGTGC GCGTGTGA
 
Protein sequence
MTSAPPLPSL TDVPLYSVHP MKGVILAAGR GSRLFPVSAG RPKHAVPIAG VPIIAWAVRA 
VREAGVEEVA VVTSSNNEAA LREATRDEGP LTFLRQEEPR GTGDAVLAAR AFLEGSPALL
YLGDNLFADP LTPLTEALQD ADAALGVKQV PDPSAYGVAA VRDNLLTNLD EKPAAPASDL
AACGVFAFHP HVLEEVARLE PSVRGEIEFP QALLRVIAAG GRVRAVTFPG FWSDAGTPAD
LLSASAHFLS KLAPRVDGEV RRSSLSGPVV IEAGATVEDS LLVGPVLIGA GASVRGSTVG
PNVSVGPQAR LEGATLSDTL IDEAATVRSP TRPLVRTVVG RRATITAPSA SGLQIVVGDY
SVVRV