Gene GSU2549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2549 
SymboltopA 
ID2687248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2810320 
End bp2812593 
Gene Length2274 bp 
Protein Length757 aa 
Translation table11 
GC content63% 
IMG OID637127239 
ProductDNA topoisomerase I 
Protein accessionNP_953595 
Protein GI39997644 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA
[COG0551] Zn-finger domain associated with topoisomerase type I 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0550694 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCAAC ATCTCGTCAT AGTAGAATCT CCTGCCAAGG CTAAGACCAT AGAGAAGTTC 
CTCGGCCCGG ACTACAAGGT GCTCGCATCC TACGGCCATG TGCGCGCCCT GCCGAGCAAG
CAGGGCTCCG TGGACGTGGA GCACGACTTC GAGCCCCGCT ACGCCGTCCT GCCCGAGAGC
AAACGGCACA TCGACGCCAT CAAGAAGGAG TTGAAGGCGA GCGATTCGCT CCTGCTGGCC
ACCGACCCCG ACCGGGAAGG GGAGGCCATC TCCTGGCACC TGCTGGCGGC CCTGGGCGTG
AAGCCCGAGA AACCGCCGGT ACCGGTCAGG CGGGTGGTGT TCCACGAGAT CACCAAGGAC
GCCATAGTCC ATGCCGTGGA GAATCCCCGC GATATCTCAC AGGATCTGGT GGACGCCCAG
CAGGCTCGCT CAATTCTCGA TTATCTCGTG GGTTTCAATC TCTCCCCCTT CCTCTGGAAG
AAGATTCGTT ACGGCCTTTC CGCCGGCCGG GTCCAGTCGG TGGCCCTGCG GCTCATCTGC
GAGCGGGAGA AGGAGATCCA GGCGTTCCAG TCCCAGGAAT ACTGGACCAT CGGCGCGGAG
CTGGCCAAGG AGGGGGGGCA GAAGTGCACC GCCAATCTGG TCGAAGCCGA GGGGAAGAAG
CTCGACAAGT TCGACATCCC CGATCAGGCT GCGGCCGACC GGCTCGTGAA GGCCCTGGAG
AACGCCACCT TCACTGTGGA CAAGGTGACG AAGAGCGAGC GCAAGCGGAC GCCGGCGCCG
CCGTTTACCA CATCGACCCT CCAGCAGGAG GCTGCCCGCA AACTGGGCTT TTCGGCCAAA
AAGACCATGG CCACGGCCCA AAAGCTCTAC GAAGGGGTCG CCATCGACGA AGGGCTCGTG
GGTCTCATCA CTTACATGCG TACCGACAGC GTGGTGCTGT CGAACCAGGC ACTGCAGGAG
GCCCACCAGG TCATCACTTC CCTGTACGGT CCCGAATACG CCCTTGCCAA GCCCCGCTTC
TACAAAAACA AGGCTAAGAA CGCCCAGGAG GCCCACGAGG CGGTCCGCCC CACCTCCATC
GCCCGCACCC CGGCGGAGCT GAAAAAGTAC CTCTCCTCCG ACCAGTTCAA GCTGTACGAC
CTGATCTGGA AGCGGACCGT GGCCTGCCAG ATGGCCGAGG CGCTCCTGGA CCAAACCTCC
GTCGATATCG GCGCGGGCAA GGGCTACCGC TTCCGGGCCG CCGGCACCGT GATCCGCTTT
CCCGGATTTA TGAAGCTGTA CATCGAAGGG GTGGACGATC AGGCCGAAGA GAAGGAGGGG
ACCCTCCCTC CCCTCACCGA AGGGGAACTC CTGAAGCTCC TGAAGCTCCT CCCGGAGCAG
CACTTCACCC AGCCGCCCCC CCGGTACACC GAGGCGAGCC TGGTGAAGAC GCTGGAAGAG
TACGGCATCG GGCGCCCCTC AACCTATGCC TCCATCATGA ACACGCTCCT GGAGCGCAAG
TACGCCCGCC TCGACAGCAA GCGCTTCATC CCCGAGGATG TGGGGATGGT GGTCAACGAT
CTTCTGACCA ACCATTTCAC CACGTACGTG GACTACAACT TCACCGCCAC CCTTGAGGAA
GAGCTCGACC AGGTCTCCCG GGGGGAAAAG CGGTGGAAGC CGCTGCTGCG CGAGTTCTGG
GAGCCCTTCC AGGGACTGCT CAAACAGAAA GAGGGCGAGG TCAGCAAGGC GGACCTCACC
ACCGAGGCCA CGGACGAGGC ATGCCCCGAA TGCGGAAAAC CCCTGGTGGT GAAGCTCGGC
AAGCGCGGCA AGTTCATTGC CTGCTCCGGT TACAAGGAAG GGTGCACCTA TACCCGCAAC
ATCGACCAGG GTGAGGGAAG AGAGCAGGCG GAGCCGGTCC TGTCCGAGGA AAAGTGCGAC
AAATGCGGCA GCCCCATGCT CATCAAGGAC GGGCGCTTCG GCAAGTACCT GGCCTGCTCG
GCCTATCCCG CCTGCAAGAA CATCCAGCCC CTGGTGAAGC CCAAGGGGAC CGGCCATACC
TGCCCCGAAT GCAAGGAAGG GGAGCTGACC GAGAAAAAGT CCCGCTACGG CAAGATGTTC
TACTCCTGCA ACCGCTATCC CCAGTGCAAG TTCGCCCTCT GGGACCCGCC CCAGCCGGGG
CCGTGCCCCA AGTGCGGCTT CCCGCTGCTG GTGAAGAAGG TCTACAAGCG GGAAGGGGAG
TTCCTCAAGT GTCCCAAGGA AGGATGCGAC TACCGGACCG AAGGGAAAAA GTAA
 
Protein sequence
MSQHLVIVES PAKAKTIEKF LGPDYKVLAS YGHVRALPSK QGSVDVEHDF EPRYAVLPES 
KRHIDAIKKE LKASDSLLLA TDPDREGEAI SWHLLAALGV KPEKPPVPVR RVVFHEITKD
AIVHAVENPR DISQDLVDAQ QARSILDYLV GFNLSPFLWK KIRYGLSAGR VQSVALRLIC
EREKEIQAFQ SQEYWTIGAE LAKEGGQKCT ANLVEAEGKK LDKFDIPDQA AADRLVKALE
NATFTVDKVT KSERKRTPAP PFTTSTLQQE AARKLGFSAK KTMATAQKLY EGVAIDEGLV
GLITYMRTDS VVLSNQALQE AHQVITSLYG PEYALAKPRF YKNKAKNAQE AHEAVRPTSI
ARTPAELKKY LSSDQFKLYD LIWKRTVACQ MAEALLDQTS VDIGAGKGYR FRAAGTVIRF
PGFMKLYIEG VDDQAEEKEG TLPPLTEGEL LKLLKLLPEQ HFTQPPPRYT EASLVKTLEE
YGIGRPSTYA SIMNTLLERK YARLDSKRFI PEDVGMVVND LLTNHFTTYV DYNFTATLEE
ELDQVSRGEK RWKPLLREFW EPFQGLLKQK EGEVSKADLT TEATDEACPE CGKPLVVKLG
KRGKFIACSG YKEGCTYTRN IDQGEGREQA EPVLSEEKCD KCGSPMLIKD GRFGKYLACS
AYPACKNIQP LVKPKGTGHT CPECKEGELT EKKSRYGKMF YSCNRYPQCK FALWDPPQPG
PCPKCGFPLL VKKVYKREGE FLKCPKEGCD YRTEGKK