Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rxyl_0752 |
Symbol | |
ID | 4116577 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rubrobacter xylanophilus DSM 9941 |
Kingdom | Bacteria |
Replicon accession | NC_008148 |
Strand | + |
Start bp | 781813 |
End bp | 783918 |
Gene Length | 2106 bp |
Protein Length | 701 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 638035536 |
Product | DNA topoisomerase III |
Protein accession | YP_643532 |
Protein GI | 108803595 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0611429 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGCTGA TCGTCGCCGA GAAGCCCTCC GTGGGGAGGG ACATAGCCGG CGCCCTGGGC CGCCACCGCC GCGAGGGGGA CGCCCTCGTC GGGAGCGGCT GGGTCGTAAC CTGGGCGCTC GGGCACCTCG CCGAGCTCGC CCCGCCCGAC GCCTACGGCG CCGAGTACAA GCGGTGGAGC CTACAGAACC TCCCCATACT CCCCGAGCGG TTCAGGGTCC GCGTCAACCC GAGGACCCGC GAGCGGTTCG AGGCCGTGAG GCGGTGGATG CGCGACCCCT CGGTGACCGA GGTCGTAAAC GCCTGCGACG CCGGGAGGGA GGGCGAGCTC ATCTTCGCCT ACCTCTACCA GCTCTCCCGC TGCGAAAAGC CGGTGAGGCG GCTCTGGGTC TCCTCGCTCA CCCCAGAGGC GATCCGCGAG GGCTTCGGCT CCTTGCGCGA CGGCCCCTCC ATGAAGCCCC TCGAGGACGC CGCCCGCAGC CGCGCGGAGG CCGACTGGAT CGTGGGGATG AACGCCACCC GCGCCTACTC CGTGCGCTTC TCGCGCCCCG GCGACGTCCT CTCCGTGGGG CGCGTCCAGA CCCCCACCCT CAGGCTGCTC GTCGAGCGCG AGCGGGAGAT AGAGGACTTC AGGCCCGAGA GGTTCTTCAC CGTCCACGCC CGCTTCGCGC GCGACGGGAA GACCTACGAC GGCCTCTGGT TCAGGGAGAA GGAGAGCCGC CTCAAGGAGC GGGAGGCCGC CGAGCAGATA GCGGAGAAGG TCCGCGGCGG CACCGGCGTC GTGAAGAAGG CCGAGAGGCG GCGGACCTCC GAGAGGCCGC CGCTCCTCTA CGACCTCACC GAGCTCCAGC GCAACGCCAA CGCCCGCTAC GGCTTCACCG CCGAGAGGAC GCTCCGGGCC GCCCAGGCGC TCTACGAGGA GCGCAAGCTC ATAACCTACC CCCGCACCTC GAGCCGCTAC CTCTCGGGGG ACATGGTCGG CACGCTCAAG AGGCGCGTCG AGGCGGCGGG GGGCCTGCCG GAGCTCGCGC CCTTCGCCGG GAGGCTGCTC GCGGCGGGAA GCCTCCCGGT CGGCAGGCGC GTCGTGGACG ACTCGAAGGT CACCGACCAC CACGCCATCG TCCCGACGGA CAGGAAGCCC TCCGGCGGCC TGCCGCCCGA CGAGGCGAAG GTCTACGACC TGGTGGCGCG GCGCTTCCTC GCGGTCTTCT TCCCGGCGGC CCGCTTCGAG AACACGACCG TCGTGACGGA GGCGCGCGGG GAGACGTTCC TGAGCAAGGG GCGGGTGGTG CTCGAGGCCG GGTGGCGGGA GCTCTACCCG GACGGCGTCG GCGGCAGGAA GGAGAAGGAG CCGCCCGCGC TGCCCCCCGT GGAGGCCGGC CAGGAGTGGC GGGTGGCGAA GGTGGGGGTC AAGGAGGGCG AGACAAAGCC GCCGCCGCGC TACTCCGAGT CGGCGCTCCT GGGGGCCATG GAGACCGCCG GGAAGCTCGT CGAGGACGAG GAACTGCGGC AGCAGATGAA GGACTCCGGG CTCGGCACCC CCGCGACGCG GGCGGCGATC ATCGAGCGCC TCATAAAGGT CGGCTACGTA GAGCGGGAGA AGAAGGCGCT CGTCCCCACC GCCAAGGGGC GCGCCCTGAT CTCCCTGCTC GCGGATAGCC CGCTCTCCTC GCCCGAGATG ACCGCCCGCT GGGAGCGGCG CCTCGCCCGC ATAGAGCGCG GCGAGGAGCG GCGCCCGGAC TTCATGTCCG ACATAAGCGG CTTCGCCGCC TCCGTCGTCG AGGGCGTGCG CCGCATGGAG GGCGAGAAGA TAGCCGCCCC CGGAGGCGGA GGGGAGGCCC TCGGCGCCTG CCCGAAGTGC GGCTCCCCGG TGGTCGAGAC GAAGAAGGCC TACGGGTGCT CAGCGTGGAG GAAGACCGGC TGCGACTTCG CCATCTGGAA GCGCGTCGCC GGGAAGCGCG TGAGCGAGTC GCAGGCCAGG CAGCTACTCG CGAAGGGCAG GACCGCGCGG CTCAAGGGCT TCAAGAGCAG GGCCGGAAAG CCCTTCTCCG CGGCACTCGT GCTCGACGGG GAGCACCGGG TGCGGCTGGA GCCCTTCCGG GGCTAG
|
Protein sequence | MRLIVAEKPS VGRDIAGALG RHRREGDALV GSGWVVTWAL GHLAELAPPD AYGAEYKRWS LQNLPILPER FRVRVNPRTR ERFEAVRRWM RDPSVTEVVN ACDAGREGEL IFAYLYQLSR CEKPVRRLWV SSLTPEAIRE GFGSLRDGPS MKPLEDAARS RAEADWIVGM NATRAYSVRF SRPGDVLSVG RVQTPTLRLL VEREREIEDF RPERFFTVHA RFARDGKTYD GLWFREKESR LKEREAAEQI AEKVRGGTGV VKKAERRRTS ERPPLLYDLT ELQRNANARY GFTAERTLRA AQALYEERKL ITYPRTSSRY LSGDMVGTLK RRVEAAGGLP ELAPFAGRLL AAGSLPVGRR VVDDSKVTDH HAIVPTDRKP SGGLPPDEAK VYDLVARRFL AVFFPAARFE NTTVVTEARG ETFLSKGRVV LEAGWRELYP DGVGGRKEKE PPALPPVEAG QEWRVAKVGV KEGETKPPPR YSESALLGAM ETAGKLVEDE ELRQQMKDSG LGTPATRAAI IERLIKVGYV EREKKALVPT AKGRALISLL ADSPLSSPEM TARWERRLAR IERGEERRPD FMSDISGFAA SVVEGVRRME GEKIAAPGGG GEALGACPKC GSPVVETKKA YGCSAWRKTG CDFAIWKRVA GKRVSESQAR QLLAKGRTAR LKGFKSRAGK PFSAALVLDG EHRVRLEPFR G
|
| |