Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1102 |
Symbol | |
ID | 7977590 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1153254 |
End bp | 1155329 |
Gene Length | 2076 bp |
Protein Length | 691 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644798055 |
Product | DNA topoisomerase I |
Protein accession | YP_002949228 |
Protein GI | 239826604 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000153126 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGACT ATCTTGTCAT CGTGGAATCG CCAGCGAAAG CGAAGACGAT CGAACGATAT TTAGGAAAAA AATATAAAGT AAAAGCTTCG ATGGGACATG TTCGCGATCT GCCAAAAAGC CAAATGGGCG TTGATATAAA TAACGGCTAT GAGCCAAAGT ATATTACGAT TCGCGGTAAA GGGCCAATCA TTAAAGAATT AAAAACAGCA GCGAAAAAAG CAAAAAAAGT GTTTCTTGCC GCGGACCCGG ACCGCGAAGG GGAAGCGATT GCTTGGCATT TAGCCAATAT GCTCGACCTT GATATTCATT CCGACTGCCG CGTTGTATTT AACGAGATTA CGAAGGATGC GGTTAAAGAG TCATTTCAAC ATCCACGTCC GATCAATATG AATCTTGTTG ACGCGCAGCA AGCGCGCCGA GTGTTGGACC GGCTGGTTGG ATACAACATT AGCCCGCTTC TTTGGAAAAA GGTGAAGAAA GGATTGAGCG CGGGGCGTGT TCAATCTGTA GCGCTGCGTT TGATCATCGA CCGCGAAAAA GAAATTAAAC AATTTCAGCC GGAAGAGTAT TGGACGATTC AAGCCGAATT TGTAAAAGGA AATGAAACGT TTACTGCTTC TTTTTACGGA GTGGATGGGC AAAAGCTTGA ATTAAAAAAG GAAGCAGACG TTGCCGCGAT TTTACAACGC ATAAACGGCA ACCACTTTAC GGTGACATCG GTGGCAAAAA AGGAGCGGAA ACGAAATCCA GTGCCGCCGT TTACAACGTC TTCCTTGCAG CAAGAAGCAG CGCGCAAGCT TAATTTTCGA ACGAAGAAAA CGATGATGAT CGCCCAGCAG CTATATGAAG GAATCGATCT TGGCAGTGAA GGAACGGTCG GCTTAATTAC CTATATGCGT ACAGACTCGA CAAGAGTATC AGAAAGCGCA CGGCAAGAGG CACTATCTTA TATAGAAGCG ACGTTTGGAA AAGAATTTGT CGCACAAGAA AAGCGAAAAG AAAAGAAAAA TGCCAATGCG CAAGATGCGC ATGAAGCGAT TCGTCCGACA TCTGCATTTC GCGAGCCGGA AAAGGTAAAG CCATATTTAA CCCGCGATCA ATTTCGGTTG TATAAGTTAA TTTGGGAACG TTTTATCGCA AGCCAAATGG CAGCCGCACT GTTAGATACG ATGAGCATTG AACTTGAAAA TGAAGGGGTG ATCTTTCGGG CAAGCGGCTC GAAAGTAAAA TTTCCTGGTT TTATGAAAGT ATATGTAGAG GGAACGGATG ACCAAACGGA TGAACAAGAT CGCCTTCTTC CGGATTTGCA GGAAGGGGAA ACTGTTTTCA GCAAAGATAT TGAACCAAAG CAGCATTTTA CTCAGCCGCC TCCTCGCTAT ACGGAAGCGC GGCTTGTGAA AACGCTAGAA GAGCTTGGCA TCGGCCGGCC GTCTACGTAC GCGCCGACGC TTGATACGAT TCAAAAACGA AACTATGTCG TGCTAGAAAA TAAACGTTTT GTTCCAACAG AACTTGGAGA AATCGTGTTA GAACTAATGT TAGAGTTTTT CCCAGAAATC ATTGACGTGG AGTTTACAGC GAAAATGGAG AAAAATTTGG ATGAAATCGA GGAAGGAAAA GTAGAATGGG TGAAAGTGGT CGACGAATTT TACCAGGAAT TTGAAAAGCG GCTGCAAACC GCGGAAAAGG AAATGAAAGA AGTCGAGATT AAAGACGAGC CGGCGGGAGT CGACTGCGAA GTGTGCGGAA GCCCAATGGT ATATAAAATG GGGCGATTCG GCAAATTTGT CGCCTGCTCC AATTTCCCGG AATGCCGCAA TACAAAGCCG ATCGTTAAGG AAATCGGGGT AAAATGTCCG AAATGCCGCG AAGGAAATAT TGTGGAGCGC AGCAGTAAGA AAAAGCGGAT TTTTTATGGC TGCGACCGTT TTCCACAATG CGATTTCGTC TCGTGGGATA AACCGCTTGC CCGCCCTTGC CCGAAATGCG GCGGCTTGCT AGTGGAAAAG AAACTGAAAA AAGGCGTGCA AGTGCAATGT ACGGCATGTG ATTACGAAGA AGCACCACAA TCTTGA
|
Protein sequence | MSDYLVIVES PAKAKTIERY LGKKYKVKAS MGHVRDLPKS QMGVDINNGY EPKYITIRGK GPIIKELKTA AKKAKKVFLA ADPDREGEAI AWHLANMLDL DIHSDCRVVF NEITKDAVKE SFQHPRPINM NLVDAQQARR VLDRLVGYNI SPLLWKKVKK GLSAGRVQSV ALRLIIDREK EIKQFQPEEY WTIQAEFVKG NETFTASFYG VDGQKLELKK EADVAAILQR INGNHFTVTS VAKKERKRNP VPPFTTSSLQ QEAARKLNFR TKKTMMIAQQ LYEGIDLGSE GTVGLITYMR TDSTRVSESA RQEALSYIEA TFGKEFVAQE KRKEKKNANA QDAHEAIRPT SAFREPEKVK PYLTRDQFRL YKLIWERFIA SQMAAALLDT MSIELENEGV IFRASGSKVK FPGFMKVYVE GTDDQTDEQD RLLPDLQEGE TVFSKDIEPK QHFTQPPPRY TEARLVKTLE ELGIGRPSTY APTLDTIQKR NYVVLENKRF VPTELGEIVL ELMLEFFPEI IDVEFTAKME KNLDEIEEGK VEWVKVVDEF YQEFEKRLQT AEKEMKEVEI KDEPAGVDCE VCGSPMVYKM GRFGKFVACS NFPECRNTKP IVKEIGVKCP KCREGNIVER SSKKKRIFYG CDRFPQCDFV SWDKPLARPC PKCGGLLVEK KLKKGVQVQC TACDYEEAPQ S
|
| |