Gene Dgeo_2058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_2058 
Symbol 
ID4058404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp2165243 
End bp2166292 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content68% 
IMG OID641231097 
Producttype I topoisomerase, putative 
Protein accessionYP_605521 
Protein GI94986157 
COG category[L] Replication, recombination and repair 
COG ID[COG3569] Topoisomerase IB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.419584 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.93712 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGGCC GTACCGACCT GCTGCACGAG GAGTACCTGC GCCGCGAGGG GAACAAACCC 
GGTGAGTTCC GCTACTTCTG GCCGGACGGA GAAGAGTACA CCGACCCAGA GGGCCTCGAC
CGCATTGCCG CGCTTGCGGT GCCGCCCGCC TACACGGAGG TCTACGTCTC GCCCGACCCC
GACGCAGAAC TCCAGGCGTT TGGCCGCGAT GCCGCCGGAC GCCTCCAGTA CCGCTACCAC
CCGGACTTCG TGCAGGCGGG CGCGCTGAAG AAGTGGCAAC GGCTGGCGCG GTTTGCTGGG
GTGCTGCCCA CCCTGCGTGC GGTGACCGCT GCTGACCTGC GCCTCTCCGG TTTGCCGCGC
CGCAAGGTGC TCGCCGTGAT GTCCCGGCTG CTGCACGTCG CACATTTCCG GGTGGGCAGC
GACGCCTATG CCCGCGCGCA TAGAACCTAC GGCCTCTCCA CCCTGCGGCA GCGGCACGTC
AGGGTGAGCG GACAGGACAT CACCTTCCGC TTCAAGGGCA AGCATGCCAT CCTGCAGGAG
AAGACGGTCC GTAACCGCAC GCTGGCGACC AACATCGAGC GGCTGCTGGA GCTGCCCGGC
CCTTGGCTGT TCCAGAGCGT GGACGAAGGC GAGCGGACCC GTGTCCGCGC CCCTGACCTG
AACGCCTCCC TGCGCGAGGT GATCGGCCCC TTTACGGCCA AGGATTTCCG GACCTGGGGC
GGTACGCTGC TCGCTGCCGA ATTTCTGGCG GAGGCGGGAC CGCCCGAAAC GGAGCGCCAG
GCCCGCAAGA CCATCGTGGA ATGCGTGAAG TTTGTCGCCG CTGACCTCGG CAACACGCCC
GCCGTCACGC GCGGCAGCTA CATCTGCCCC GTCATCTTCG ACCGCTATCA GGCGGGCAAG
GTGCTCGACG ACTACGAACC CCGCGCGGGC CGCCCCGAAC CGGAACTGGA GGGCCTCACC
CGCAGCGAGG CCGCGCTGAA GCGGATGCTG GAGAGTGAAC AGGCACTGCG GACGCGCCAA
AGCAGGAAGA AGGCAAAAGA GGCCGCCTGA
 
Protein sequence
MAGRTDLLHE EYLRREGNKP GEFRYFWPDG EEYTDPEGLD RIAALAVPPA YTEVYVSPDP 
DAELQAFGRD AAGRLQYRYH PDFVQAGALK KWQRLARFAG VLPTLRAVTA ADLRLSGLPR
RKVLAVMSRL LHVAHFRVGS DAYARAHRTY GLSTLRQRHV RVSGQDITFR FKGKHAILQE
KTVRNRTLAT NIERLLELPG PWLFQSVDEG ERTRVRAPDL NASLREVIGP FTAKDFRTWG
GTLLAAEFLA EAGPPETERQ ARKTIVECVK FVAADLGNTP AVTRGSYICP VIFDRYQAGK
VLDDYEPRAG RPEPELEGLT RSEAALKRML ESEQALRTRQ SRKKAKEAA