Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_0083 |
Symbol | |
ID | 8417887 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | - |
Start bp | 112305 |
End bp | 114560 |
Gene Length | 2256 bp |
Protein Length | 751 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 645036648 |
Product | DNA topoisomerase I |
Protein accession | YP_003196963 |
Protein GI | 258404221 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACAG ACCTGATTAT CGTTGAATCG CCGGCCAAGG TAAAAACCAT CCGCAAGTTC CTGGGGTCCG ACTATCTTGT CGAGGCCTCG GTCGGACACG TCCGCGACTT GCCCTCAAGC AATCTTGGGG TGGATGAAGA CAACAATTTC CAGCCCCAAT ACCAGATTAT TTCCGGAAAA CAGAAGGTCG TCAGCCGCCT GAAAAGCGCG GCCAAAAAAG CCACGACCGT CTATCTGGCT CCCGACCCGG ACCGGGAAGG GGAAGCGATT GCTTGGCATG TGGCCGAACT GCTCAAAAAA ACAAATACCA ATCTCAAACG GATCCAATTC AATGAGATCA CGTCCCGGGC GGTCAAAGAC GCCCTGGAGC ATCCCCGGGA TCTAGACGAA CGGCTGTTCT ATTCCCAGCA GGCCCGGCGC ATCCTGGACC GCCTCGTCGG CTATAAAATC TCACCGCTTC TCTGGAAAAA GGTCAAGCGG GGTCTGTCCG CCGGACGGGT CCAGTCGGTG GCCCTGCGGC TTATAGCCGA ACGCGAACGT GAACGCCAGC AATTTGATCC TCAGGAATAC TGGGTCCTCA AGGCCCATGT CCAGGCCGCG GCCCCGCCGC CGGTGGTCGC CGAATTGTGG AAAATCGGCG GCAAAAAACC CCATGTCGCC AACGAGACCC AGGCCCTGGA AATAGAAAAG AAGGTCTCCG AAGCCGGTTT CCATGTCGAA TCCGTGGAAG AAAAGGAACG CAAACGGCAC CCCAAGCCGC CGTTTATCAC CTCGACCCTC CAGCAGGACG CCAGCAACCG GCTCGGGTTC GCCGCCAAAC GGACCATGCG CATCGCCCAG CAGCTCTACG AAGGGCTGGA TCTGGGCGAC AAGGGCACGA CAGCGCTGAT CACCTACATG CGCACCGACT CCGTGCGTAT CTCCAACGAG GCCCGCAATG CGGCCCAAAA ATGGATCGTC TCCACCCTGG GCGAGGCCTA CTATCCCGAA AAGCCGCGCT ATTTCAAGAC CAAGGGGTCA GCTCAGGACG CCCACGAAGC CATCCGCCCC GTCGATCCGA CCCTGACCCC CGGATCCATA CAGTCCTACC TCTCCCGGGA GCATTTCCGG CTCTACAAAC TCATCTGGGA ACGGTTCATG GCCTCGCAGA TGGCCCCGGC CCGGTTCTGG GATACCCAGC TCACCCTCGC CTCGGCCAAC ACCTTGTGGC GGGCCAAAGG GGAACGGCTC ATCTTTGACG GCTACCTCCG GGTCTATTCG GCCGACAAAT CCCAGGAAGA GGTCGAACTG CCCAAGGTCC AGGCGAAGGA CGCCCTGACC CTGGAAAAAA TCGACAAAGA ACAGAAATTT ACCCAACCGC CGGCCCGGTT TTCCGAAGCC TCGCTGGTGC GCAAACTCGA AGAGCTCGGT ATCGGTCGCC CCTCGACGTA TGCCCAGATC ATCTCCACGT TGCTCGACCG CAACTACGTC CAGCTGGCCA AAAAGCAATT CGTGCCCACG GAAATGGGCT TTGTGGTCGC CGACCTGCTC ACGGCGCATT TCCCGCAGCT CTTGGACGTC GGCTTCACGG CGGAGATGGA AAAGAAACTC GACAGTGTCG CCGAGGGGGA CCAGGACTGG ACGCAACTCC TGCGCGAATT CACCGAGAGC TTCTATCCCA CCCTGGAAAA GGCCGAACAG GAGATGCAGC AGGTCAAGAC CGGGGTCGAA ACCGGGGTCA GCTGTCCCAA ATGCGGGAAG CCGGTGGTCA TCAAATTCGG CCGCAACGGC GAATTTCTGG CCTGTACCGC GTATCCGGAC TGCGACTTCA CCTCGAACTT CACCCGCGAC GAGGCCGGCA CTATCGTCAT CGTTGAGCCC GAACCCCAGG AGCGGCAAAA AGTGGGCACC TGCCCCGAGT GCGGCCAGGA CCTCGTGCTC AAAAAGGCCC GCACCGGCAG CCGCTTTATC GCCTGCACCG GCTACCCCAA ATGCAAATAC ACCCAGTCCT ACTCCACGGG CGTCAAATGC CCCAAACAAG ACTGCCCGGG CGAACTGGTG GAAAAAAGCT CCAAACGGGG CAAGGTCTTC TACGCCTGCA ACCAGTATCC GGACTGCAAG ACCGCCTATT GGAACTGGCC CATCGCCGAA GAGTGCCCTA CCTGCGGCTC ACCGATCCTC GTGCGCAAGG AGACCAAGGC CCGTGGCGAG CATGTCGCCT GCCCGGAAAA GGGCTGCGGC TATTGGCGGG AATTGCGCGA CGACGAAAAA CACTAG
|
Protein sequence | MSTDLIIVES PAKVKTIRKF LGSDYLVEAS VGHVRDLPSS NLGVDEDNNF QPQYQIISGK QKVVSRLKSA AKKATTVYLA PDPDREGEAI AWHVAELLKK TNTNLKRIQF NEITSRAVKD ALEHPRDLDE RLFYSQQARR ILDRLVGYKI SPLLWKKVKR GLSAGRVQSV ALRLIAERER ERQQFDPQEY WVLKAHVQAA APPPVVAELW KIGGKKPHVA NETQALEIEK KVSEAGFHVE SVEEKERKRH PKPPFITSTL QQDASNRLGF AAKRTMRIAQ QLYEGLDLGD KGTTALITYM RTDSVRISNE ARNAAQKWIV STLGEAYYPE KPRYFKTKGS AQDAHEAIRP VDPTLTPGSI QSYLSREHFR LYKLIWERFM ASQMAPARFW DTQLTLASAN TLWRAKGERL IFDGYLRVYS ADKSQEEVEL PKVQAKDALT LEKIDKEQKF TQPPARFSEA SLVRKLEELG IGRPSTYAQI ISTLLDRNYV QLAKKQFVPT EMGFVVADLL TAHFPQLLDV GFTAEMEKKL DSVAEGDQDW TQLLREFTES FYPTLEKAEQ EMQQVKTGVE TGVSCPKCGK PVVIKFGRNG EFLACTAYPD CDFTSNFTRD EAGTIVIVEP EPQERQKVGT CPECGQDLVL KKARTGSRFI ACTGYPKCKY TQSYSTGVKC PKQDCPGELV EKSSKRGKVF YACNQYPDCK TAYWNWPIAE ECPTCGSPIL VRKETKARGE HVACPEKGCG YWRELRDDEK H
|
| |