Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GBAA_3971 |
Symbol | topA |
ID | 2818374 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus anthracis str. 'Ames Ancestor' |
Kingdom | Bacteria |
Replicon accession | NC_007530 |
Strand | - |
Start bp | 3648430 |
End bp | 3650508 |
Gene Length | 2079 bp |
Protein Length | 692 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637790685 |
Product | DNA topoisomerase I |
Protein accession | YP_020610 |
Protein GI | 47529261 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA [COG0551] Zn-finger domain associated with topoisomerase type I |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000571341 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGATT ACCTCGTAAT CGTGGAGTCG CCTTCTAAGG CGAAGACCAT TGAGAAATAT TTAGGGAAAA AATACAAAGT TGTCGCGTCT ATGGGACACG TTCGCGATTT ACCTAAAAGC CAAATGGGGA TAGAAGTAAA GAACAACTTC ACCCCGAAGT ATATTACCAT TCGTGGTAAA GGTCCCGTCT TAAAAGACTT AAAATCAGCG GCAAAAAAAG CAAAGAAAGT CTATCTCGCG GCCGATCCAG ACCGCGAAGG AGAAGCTATT GCTTGGCATT TAGCCAATAC GTTAAATGTG GACGTTGAAT CAGATTGCCG AGTTGTGTTT AATGAGATTA CAAAAGATGC AATAAAAGAA TCATTCAAAC ATCCTCGTGC GATTAACATG GATTTAGTAG ATGCACAACA AGCAAGACGT ATACTAGATC GTCTTGTTGG TTACAATATT AGTCCTTTAT TATGGAAGAA AGTAAAAAAA GGATTAAGTG CAGGGCGCGT ACAATCGGTA GCAGTTCGTT TAATTATCGA ACGTGAAAGA GAAATTCAAA GTTTCGAGCC TGAAGAATTC TGGACAATTA AAACAGAATT TGTGAAAGGG AAAGACACAT TTGAAGCAAG TTTTTACGGT GTAGATGGTG AAAAAGTTCA ATTAACGAAT GAAACACAAG TGAATGAAAT AATTGAACAG CTGAAAGATA ATGCGTTCTC AGTTGAAAAT GTAACGCGAA AAGAGCGAAA ACGTAATCCT GCATTACCAT TTACAACATC TTCCTTGCAA CAAGAGGCAG CGCGTAAGTT AAACATGCGA GCAAAGAAAA CGATGATGCT TGCGCAGCAA TTGTATGAAG GGATCGATCT TGGAAAACAA GGAACTGTAG GTCTTATTAC GTATATGAGA ACTGATTCAA CACGTATCTC AGAAACAGCT CAAACAGAGG CACGTACTTA TATTACTGAG GCGTATGGTA CGGAATACAT AGGAGCAGAA AAGAAGAAAG AAACGAAGAA GTCGAACGCA CAAGATGCGC ATGAGGCAAT TCGTCCTACT TCGGTAATGA GAAAGCCAGA GGAATTAAAG AGTTTCTTAA GTCGTGATCA ACTTCGATTG TATAAATTGA TTTGGGAGCG ATTTGTTGCA AGTCAAATGG CGTCTGCTAT AATGGATACT GTGACAGCGA GACTCATTAA TAACAATGTT CAGTTCCGTG CAAGTGGATC GGTTGTAAAG TTCCCAGGAT TTATGAAAGT GTATGTAGAG TCGAAAGATG ACGGGGCTGA AGAAAAGGAT AAGATGTTGC CACCTTTAGA AGTAGGGGAA ACCGTATTTT CGAAGGATTT AGAACCGAAG CAGCACTTTA CACAACCTCC GCCGCGCTAT ACTGAGGCTC GTCTAGTAAG AACGCTTGAA GAGCTTGGAA TTGGAAGACC GTCGACTTAC GTACCGACAC TTGAAACGAT TCAAAAACGT GGATATGTCG GTTTGGATAA TAAACGCTTC GTTCCAACTG AACTTGGTGA AATAGTAATT GAACTTATTT TAGAGTTTTT CCCAGAAATT ATTAACATTG AATTTACTGC TAACATGGAG CAAAGCCTCG ATGAAGTAGA AGAAGGAAAT GCAAATTGGG TAAAAATTGT TGATGATTTC TACGTAGGGT TTGAACCGCG TTTAGAAAAA GCGGAAAAAG AAATGCGTGA AGTAGAAATT AAAGATGAAC CAGCTGGGGA AGACTGTGAA TTATGTAATC ACCCAATGGT CTTTAAAATG GGTAAATACG GGAAATTTAT GGCTTGCTCG AATTTCCCAG ATTGTCGTAA TACAAAACCG ATTGTGAAAG AAATCGGTGT TACTTGTCCG AAATGCGATA AAGGTCAAAT TATTGAACGT CGTAGTAATA AAAAGAAACG CCTTTTCTAT GGATGCGGTA CGTATCCAGA GTGTGACTTT GTATCTTGGG ATAAGCCGAT TGGCCGTAAA TGTCCGAAGT GTGAAGGTAT GCTAGTAGAG AAGAAGTTGA AAAAAGGCGT ACAAGTACAA TGTATTTCGT GTGATTATGA AGAAGAACAA CAAATGTGA
|
Protein sequence | MSDYLVIVES PSKAKTIEKY LGKKYKVVAS MGHVRDLPKS QMGIEVKNNF TPKYITIRGK GPVLKDLKSA AKKAKKVYLA ADPDREGEAI AWHLANTLNV DVESDCRVVF NEITKDAIKE SFKHPRAINM DLVDAQQARR ILDRLVGYNI SPLLWKKVKK GLSAGRVQSV AVRLIIERER EIQSFEPEEF WTIKTEFVKG KDTFEASFYG VDGEKVQLTN ETQVNEIIEQ LKDNAFSVEN VTRKERKRNP ALPFTTSSLQ QEAARKLNMR AKKTMMLAQQ LYEGIDLGKQ GTVGLITYMR TDSTRISETA QTEARTYITE AYGTEYIGAE KKKETKKSNA QDAHEAIRPT SVMRKPEELK SFLSRDQLRL YKLIWERFVA SQMASAIMDT VTARLINNNV QFRASGSVVK FPGFMKVYVE SKDDGAEEKD KMLPPLEVGE TVFSKDLEPK QHFTQPPPRY TEARLVRTLE ELGIGRPSTY VPTLETIQKR GYVGLDNKRF VPTELGEIVI ELILEFFPEI INIEFTANME QSLDEVEEGN ANWVKIVDDF YVGFEPRLEK AEKEMREVEI KDEPAGEDCE LCNHPMVFKM GKYGKFMACS NFPDCRNTKP IVKEIGVTCP KCDKGQIIER RSNKKKRLFY GCGTYPECDF VSWDKPIGRK CPKCEGMLVE KKLKKGVQVQ CISCDYEEEQ QM
|
| |