Gene Dret_0083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0083 
Symbol 
ID8417887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp112305 
End bp114560 
Gene Length2256 bp 
Protein Length751 aa 
Translation table11 
GC content60% 
IMG OID645036648 
ProductDNA topoisomerase I 
Protein accessionYP_003196963 
Protein GI258404221 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACAG ACCTGATTAT CGTTGAATCG CCGGCCAAGG TAAAAACCAT CCGCAAGTTC 
CTGGGGTCCG ACTATCTTGT CGAGGCCTCG GTCGGACACG TCCGCGACTT GCCCTCAAGC
AATCTTGGGG TGGATGAAGA CAACAATTTC CAGCCCCAAT ACCAGATTAT TTCCGGAAAA
CAGAAGGTCG TCAGCCGCCT GAAAAGCGCG GCCAAAAAAG CCACGACCGT CTATCTGGCT
CCCGACCCGG ACCGGGAAGG GGAAGCGATT GCTTGGCATG TGGCCGAACT GCTCAAAAAA
ACAAATACCA ATCTCAAACG GATCCAATTC AATGAGATCA CGTCCCGGGC GGTCAAAGAC
GCCCTGGAGC ATCCCCGGGA TCTAGACGAA CGGCTGTTCT ATTCCCAGCA GGCCCGGCGC
ATCCTGGACC GCCTCGTCGG CTATAAAATC TCACCGCTTC TCTGGAAAAA GGTCAAGCGG
GGTCTGTCCG CCGGACGGGT CCAGTCGGTG GCCCTGCGGC TTATAGCCGA ACGCGAACGT
GAACGCCAGC AATTTGATCC TCAGGAATAC TGGGTCCTCA AGGCCCATGT CCAGGCCGCG
GCCCCGCCGC CGGTGGTCGC CGAATTGTGG AAAATCGGCG GCAAAAAACC CCATGTCGCC
AACGAGACCC AGGCCCTGGA AATAGAAAAG AAGGTCTCCG AAGCCGGTTT CCATGTCGAA
TCCGTGGAAG AAAAGGAACG CAAACGGCAC CCCAAGCCGC CGTTTATCAC CTCGACCCTC
CAGCAGGACG CCAGCAACCG GCTCGGGTTC GCCGCCAAAC GGACCATGCG CATCGCCCAG
CAGCTCTACG AAGGGCTGGA TCTGGGCGAC AAGGGCACGA CAGCGCTGAT CACCTACATG
CGCACCGACT CCGTGCGTAT CTCCAACGAG GCCCGCAATG CGGCCCAAAA ATGGATCGTC
TCCACCCTGG GCGAGGCCTA CTATCCCGAA AAGCCGCGCT ATTTCAAGAC CAAGGGGTCA
GCTCAGGACG CCCACGAAGC CATCCGCCCC GTCGATCCGA CCCTGACCCC CGGATCCATA
CAGTCCTACC TCTCCCGGGA GCATTTCCGG CTCTACAAAC TCATCTGGGA ACGGTTCATG
GCCTCGCAGA TGGCCCCGGC CCGGTTCTGG GATACCCAGC TCACCCTCGC CTCGGCCAAC
ACCTTGTGGC GGGCCAAAGG GGAACGGCTC ATCTTTGACG GCTACCTCCG GGTCTATTCG
GCCGACAAAT CCCAGGAAGA GGTCGAACTG CCCAAGGTCC AGGCGAAGGA CGCCCTGACC
CTGGAAAAAA TCGACAAAGA ACAGAAATTT ACCCAACCGC CGGCCCGGTT TTCCGAAGCC
TCGCTGGTGC GCAAACTCGA AGAGCTCGGT ATCGGTCGCC CCTCGACGTA TGCCCAGATC
ATCTCCACGT TGCTCGACCG CAACTACGTC CAGCTGGCCA AAAAGCAATT CGTGCCCACG
GAAATGGGCT TTGTGGTCGC CGACCTGCTC ACGGCGCATT TCCCGCAGCT CTTGGACGTC
GGCTTCACGG CGGAGATGGA AAAGAAACTC GACAGTGTCG CCGAGGGGGA CCAGGACTGG
ACGCAACTCC TGCGCGAATT CACCGAGAGC TTCTATCCCA CCCTGGAAAA GGCCGAACAG
GAGATGCAGC AGGTCAAGAC CGGGGTCGAA ACCGGGGTCA GCTGTCCCAA ATGCGGGAAG
CCGGTGGTCA TCAAATTCGG CCGCAACGGC GAATTTCTGG CCTGTACCGC GTATCCGGAC
TGCGACTTCA CCTCGAACTT CACCCGCGAC GAGGCCGGCA CTATCGTCAT CGTTGAGCCC
GAACCCCAGG AGCGGCAAAA AGTGGGCACC TGCCCCGAGT GCGGCCAGGA CCTCGTGCTC
AAAAAGGCCC GCACCGGCAG CCGCTTTATC GCCTGCACCG GCTACCCCAA ATGCAAATAC
ACCCAGTCCT ACTCCACGGG CGTCAAATGC CCCAAACAAG ACTGCCCGGG CGAACTGGTG
GAAAAAAGCT CCAAACGGGG CAAGGTCTTC TACGCCTGCA ACCAGTATCC GGACTGCAAG
ACCGCCTATT GGAACTGGCC CATCGCCGAA GAGTGCCCTA CCTGCGGCTC ACCGATCCTC
GTGCGCAAGG AGACCAAGGC CCGTGGCGAG CATGTCGCCT GCCCGGAAAA GGGCTGCGGC
TATTGGCGGG AATTGCGCGA CGACGAAAAA CACTAG
 
Protein sequence
MSTDLIIVES PAKVKTIRKF LGSDYLVEAS VGHVRDLPSS NLGVDEDNNF QPQYQIISGK 
QKVVSRLKSA AKKATTVYLA PDPDREGEAI AWHVAELLKK TNTNLKRIQF NEITSRAVKD
ALEHPRDLDE RLFYSQQARR ILDRLVGYKI SPLLWKKVKR GLSAGRVQSV ALRLIAERER
ERQQFDPQEY WVLKAHVQAA APPPVVAELW KIGGKKPHVA NETQALEIEK KVSEAGFHVE
SVEEKERKRH PKPPFITSTL QQDASNRLGF AAKRTMRIAQ QLYEGLDLGD KGTTALITYM
RTDSVRISNE ARNAAQKWIV STLGEAYYPE KPRYFKTKGS AQDAHEAIRP VDPTLTPGSI
QSYLSREHFR LYKLIWERFM ASQMAPARFW DTQLTLASAN TLWRAKGERL IFDGYLRVYS
ADKSQEEVEL PKVQAKDALT LEKIDKEQKF TQPPARFSEA SLVRKLEELG IGRPSTYAQI
ISTLLDRNYV QLAKKQFVPT EMGFVVADLL TAHFPQLLDV GFTAEMEKKL DSVAEGDQDW
TQLLREFTES FYPTLEKAEQ EMQQVKTGVE TGVSCPKCGK PVVIKFGRNG EFLACTAYPD
CDFTSNFTRD EAGTIVIVEP EPQERQKVGT CPECGQDLVL KKARTGSRFI ACTGYPKCKY
TQSYSTGVKC PKQDCPGELV EKSSKRGKVF YACNQYPDCK TAYWNWPIAE ECPTCGSPIL
VRKETKARGE HVACPEKGCG YWRELRDDEK H