Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU2549 |
Symbol | topA |
ID | 2687248 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 2810320 |
End bp | 2812593 |
Gene Length | 2274 bp |
Protein Length | 757 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637127239 |
Product | DNA topoisomerase I |
Protein accession | NP_953595 |
Protein GI | 39997644 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA [COG0551] Zn-finger domain associated with topoisomerase type I |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0550694 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCAAC ATCTCGTCAT AGTAGAATCT CCTGCCAAGG CTAAGACCAT AGAGAAGTTC CTCGGCCCGG ACTACAAGGT GCTCGCATCC TACGGCCATG TGCGCGCCCT GCCGAGCAAG CAGGGCTCCG TGGACGTGGA GCACGACTTC GAGCCCCGCT ACGCCGTCCT GCCCGAGAGC AAACGGCACA TCGACGCCAT CAAGAAGGAG TTGAAGGCGA GCGATTCGCT CCTGCTGGCC ACCGACCCCG ACCGGGAAGG GGAGGCCATC TCCTGGCACC TGCTGGCGGC CCTGGGCGTG AAGCCCGAGA AACCGCCGGT ACCGGTCAGG CGGGTGGTGT TCCACGAGAT CACCAAGGAC GCCATAGTCC ATGCCGTGGA GAATCCCCGC GATATCTCAC AGGATCTGGT GGACGCCCAG CAGGCTCGCT CAATTCTCGA TTATCTCGTG GGTTTCAATC TCTCCCCCTT CCTCTGGAAG AAGATTCGTT ACGGCCTTTC CGCCGGCCGG GTCCAGTCGG TGGCCCTGCG GCTCATCTGC GAGCGGGAGA AGGAGATCCA GGCGTTCCAG TCCCAGGAAT ACTGGACCAT CGGCGCGGAG CTGGCCAAGG AGGGGGGGCA GAAGTGCACC GCCAATCTGG TCGAAGCCGA GGGGAAGAAG CTCGACAAGT TCGACATCCC CGATCAGGCT GCGGCCGACC GGCTCGTGAA GGCCCTGGAG AACGCCACCT TCACTGTGGA CAAGGTGACG AAGAGCGAGC GCAAGCGGAC GCCGGCGCCG CCGTTTACCA CATCGACCCT CCAGCAGGAG GCTGCCCGCA AACTGGGCTT TTCGGCCAAA AAGACCATGG CCACGGCCCA AAAGCTCTAC GAAGGGGTCG CCATCGACGA AGGGCTCGTG GGTCTCATCA CTTACATGCG TACCGACAGC GTGGTGCTGT CGAACCAGGC ACTGCAGGAG GCCCACCAGG TCATCACTTC CCTGTACGGT CCCGAATACG CCCTTGCCAA GCCCCGCTTC TACAAAAACA AGGCTAAGAA CGCCCAGGAG GCCCACGAGG CGGTCCGCCC CACCTCCATC GCCCGCACCC CGGCGGAGCT GAAAAAGTAC CTCTCCTCCG ACCAGTTCAA GCTGTACGAC CTGATCTGGA AGCGGACCGT GGCCTGCCAG ATGGCCGAGG CGCTCCTGGA CCAAACCTCC GTCGATATCG GCGCGGGCAA GGGCTACCGC TTCCGGGCCG CCGGCACCGT GATCCGCTTT CCCGGATTTA TGAAGCTGTA CATCGAAGGG GTGGACGATC AGGCCGAAGA GAAGGAGGGG ACCCTCCCTC CCCTCACCGA AGGGGAACTC CTGAAGCTCC TGAAGCTCCT CCCGGAGCAG CACTTCACCC AGCCGCCCCC CCGGTACACC GAGGCGAGCC TGGTGAAGAC GCTGGAAGAG TACGGCATCG GGCGCCCCTC AACCTATGCC TCCATCATGA ACACGCTCCT GGAGCGCAAG TACGCCCGCC TCGACAGCAA GCGCTTCATC CCCGAGGATG TGGGGATGGT GGTCAACGAT CTTCTGACCA ACCATTTCAC CACGTACGTG GACTACAACT TCACCGCCAC CCTTGAGGAA GAGCTCGACC AGGTCTCCCG GGGGGAAAAG CGGTGGAAGC CGCTGCTGCG CGAGTTCTGG GAGCCCTTCC AGGGACTGCT CAAACAGAAA GAGGGCGAGG TCAGCAAGGC GGACCTCACC ACCGAGGCCA CGGACGAGGC ATGCCCCGAA TGCGGAAAAC CCCTGGTGGT GAAGCTCGGC AAGCGCGGCA AGTTCATTGC CTGCTCCGGT TACAAGGAAG GGTGCACCTA TACCCGCAAC ATCGACCAGG GTGAGGGAAG AGAGCAGGCG GAGCCGGTCC TGTCCGAGGA AAAGTGCGAC AAATGCGGCA GCCCCATGCT CATCAAGGAC GGGCGCTTCG GCAAGTACCT GGCCTGCTCG GCCTATCCCG CCTGCAAGAA CATCCAGCCC CTGGTGAAGC CCAAGGGGAC CGGCCATACC TGCCCCGAAT GCAAGGAAGG GGAGCTGACC GAGAAAAAGT CCCGCTACGG CAAGATGTTC TACTCCTGCA ACCGCTATCC CCAGTGCAAG TTCGCCCTCT GGGACCCGCC CCAGCCGGGG CCGTGCCCCA AGTGCGGCTT CCCGCTGCTG GTGAAGAAGG TCTACAAGCG GGAAGGGGAG TTCCTCAAGT GTCCCAAGGA AGGATGCGAC TACCGGACCG AAGGGAAAAA GTAA
|
Protein sequence | MSQHLVIVES PAKAKTIEKF LGPDYKVLAS YGHVRALPSK QGSVDVEHDF EPRYAVLPES KRHIDAIKKE LKASDSLLLA TDPDREGEAI SWHLLAALGV KPEKPPVPVR RVVFHEITKD AIVHAVENPR DISQDLVDAQ QARSILDYLV GFNLSPFLWK KIRYGLSAGR VQSVALRLIC EREKEIQAFQ SQEYWTIGAE LAKEGGQKCT ANLVEAEGKK LDKFDIPDQA AADRLVKALE NATFTVDKVT KSERKRTPAP PFTTSTLQQE AARKLGFSAK KTMATAQKLY EGVAIDEGLV GLITYMRTDS VVLSNQALQE AHQVITSLYG PEYALAKPRF YKNKAKNAQE AHEAVRPTSI ARTPAELKKY LSSDQFKLYD LIWKRTVACQ MAEALLDQTS VDIGAGKGYR FRAAGTVIRF PGFMKLYIEG VDDQAEEKEG TLPPLTEGEL LKLLKLLPEQ HFTQPPPRYT EASLVKTLEE YGIGRPSTYA SIMNTLLERK YARLDSKRFI PEDVGMVVND LLTNHFTTYV DYNFTATLEE ELDQVSRGEK RWKPLLREFW EPFQGLLKQK EGEVSKADLT TEATDEACPE CGKPLVVKLG KRGKFIACSG YKEGCTYTRN IDQGEGREQA EPVLSEEKCD KCGSPMLIKD GRFGKYLACS AYPACKNIQP LVKPKGTGHT CPECKEGELT EKKSRYGKMF YSCNRYPQCK FALWDPPQPG PCPKCGFPLL VKKVYKREGE FLKCPKEGCD YRTEGKK
|
| |