Gene Dgeo_0077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_0077 
Symbol 
ID4058518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp72893 
End bp73894 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content70% 
IMG OID641229073 
Productdihydrouridine synthase, DuS 
Protein accessionYP_603549 
Protein GI94984185 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTGCG GTCCCGGCTT CTACGCTCGC CGCCTGGCCC GGCCGGGGGC CGTCCTTGCC 
CCGATGGCGG GCTACAGCGA CGCACCCATG CGCCAACTCG CCGCCGAGCA GGGGGCGCTG
TGGACCGTCA GCGAGATGAT CAGCGCGCGC GGTCTGGTGC TGGGTGGCGA CTCGGAAAAG
CTCACGCTGG GGCGGCCCTA TCCCGGCGAG GTCGGGCGGG TCGTTCAGCT CTTTGGTGCT
GAGCCCGACG TGCTGGCACA GGCGGTGGCC CGCGCTGAAA GCTGGTTTGC ACCCGCCGCC
CTGGACCTGA ACATGGGCTG CCCGGTCCCC AAGGTCAAGG GCCGCGGCGG CGCCTGTCTT
CTCCAGACAC CAGAAGTCGC CTACACGCTG GTGCGGGCCA TGCGCTCGGC CACGACACTC
GACGTGAGTG CCAAGATTCG CCTGGGCTGG GACACCGACC GCAGCGTGGA GATTGCGCAG
GGACTGGCGG CCGCGGGGGC AGCGCTGATC ACCGTCCATG GACGGACCAG CGTGCAGCGC
TACAGCGGTG AGGCAGACTG GGACGCCATC GCCCGAGTCG CGGCCAGCGT GAAGGTGCCG
GTGGTCGGCA GCGGCGACGT CAAAAGCGCC GAACAGGCTC GTGCCCGCCT GAACACTGGG
GTGGCCGCCG TGATGATTGG GCGCGGTGCG GTGGGAAATC CCTGGCTGTT TCGCGCGCTC
GCCAGCGGCG ACGACGTGGT TCCCAGCGCG CAGGAGCGGG CCCGCACGGC GCTGCGCCAC
GCCCAGTTGC ATGTCACCTT CTACGGCCCC GACCGGTTCG GGCTGCTCAG CGTGCGCCCG
TTGCGCAAGG TGTTGCCGCA CTACCTGCCT GACCATCCCG AGCTGCGCGC GGCACTGGTG
CAGGTGAACA CGGTGGCCGA TGTGGAGCAG GCACTGGCGC CCCTGCTGGT TGACGCGCTC
CCGCCGCAGA CTCAGAACTT CGTGCGGATG AGTGCCGAAT AA
 
Protein sequence
MICGPGFYAR RLARPGAVLA PMAGYSDAPM RQLAAEQGAL WTVSEMISAR GLVLGGDSEK 
LTLGRPYPGE VGRVVQLFGA EPDVLAQAVA RAESWFAPAA LDLNMGCPVP KVKGRGGACL
LQTPEVAYTL VRAMRSATTL DVSAKIRLGW DTDRSVEIAQ GLAAAGAALI TVHGRTSVQR
YSGEADWDAI ARVAASVKVP VVGSGDVKSA EQARARLNTG VAAVMIGRGA VGNPWLFRAL
ASGDDVVPSA QERARTALRH AQLHVTFYGP DRFGLLSVRP LRKVLPHYLP DHPELRAALV
QVNTVADVEQ ALAPLLVDAL PPQTQNFVRM SAE