Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sfum_0213 |
Symbol | |
ID | 4461476 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Syntrophobacter fumaroxidans MPOB |
Kingdom | Bacteria |
Replicon accession | NC_008554 |
Strand | + |
Start bp | 249092 |
End bp | 251395 |
Gene Length | 2304 bp |
Protein Length | 767 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639700967 |
Product | DNA topoisomerase I |
Protein accession | YP_844349 |
Protein GI | 116747662 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA [COG0551] Zn-finger domain associated with topoisomerase type I |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00304522 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.312261 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTAAGT CATTATTGAT TGTCGAATCC CCGACAAAAG CGAAGACACT TGGGAGATAC CTTGGAAAAG ATTTCATTGT AAAAGCCTCT GTGGGGCATG TGAAGGATCT TCCCAAGAAC AGGCTCGGAA TCAATCTGGA AAAGGATTTC CAGCCGGAAT ACCAGGTAAT ACGCGGCAAG AAGAAGGTCA TCAGCGAACT GCACGAGGCC GCCGCAAAGT CGGGAGCGAT CTTTCTCGGT CCGGACCCCG ACCGCGAGGG TGAGGCCATT GCGTGGCATA TCGCGGAAGA AATCGGTGCC ACGGACAAGC CCGTATACCG GGTGCTTTTC TACGAGCTGA CCCGGAAAGC GATACAGGAA GCTCTTGCCA AGCCCGACAG GCTGAACCGG GAGCTCTACG AAGCCCAGCA GGCCAGGCGC ATTCTGGACC GCCTGGTAGG ATATATGATT TCCCCGATCC TGTGGCAGAA AGTGAAGCGA GGGTTGAGCG CCGGGAGGGT GCAGTCGGTG GCCCTTCGGT TGATCTGCGC GCGGGAAAAG GAGATCCGGG ATTTCGATTC CAGGGAGTAC TGGACCATAA CCGCTTTGCT GGGGACGCAG GCATCGGCCG ATGCGCCCGC GAAGAGCGCG TCCCGGCGGT TCAAGGCGGA GCTGTTCCGT TGCGGGAAGA AGAAATGCAC GATTTCCACG GGAGAGGAAG CCCGCGAGCT GGTAGACCGG CTGCGGCCGC TCGACTATCG GGTGAGCAAG GTCGAACGCA GGAAGAAGAA ACGCCATCCG GCGCCTCCCT TCATCACCAG CACGCTGCAG CAGGAGGCGG CCAGGAAGCT GCATTTCTCT GCCAGGCAGA CCATGAATGT GGCCCAGCGG CTCTACGAGG GGCTCGAACT GGGAAAGGAA GGGGCCGTCG GCCTGATCAC GTACATGCGT ACCGACTCGA CACGGCTGTC CGCGGATGCC GTTCAGGCGG TCCGGGACTA CATCGCCGGA CATTGGGACA AGGCCTATCT GCCGGCCAAG CCCGCCGCGT ACAAGAGCAA AGCGGGCGCC CAGGGAGCGC ACGAGGCGAT CCGGCCCACG GACGTGAATC GGACCCCGGA AACCGTTGCG GGCTTTCTGA CAAAAGAGCA GCTCAAGCTC TATACGCTCA TCTGGAAACG TTTCACGGCA TGCCAGATGG CGCCTGCCGT TCTCGACCAG ACCTCGGTGG ATATCGCGGC CGGGGACTAC GTCTTGCGTG CGTCCGGCTC GATCGTCGAG TTTCCGGGTT TCATGACGCT GTATGTCGAG GGTCGGGAGA ACGGGGATGA GGATTCGGAG ACCGAGGGGC TGCTGCCCGA GCTGAAGGAA GGGGAGGTCC TGAGGCTGGA AGACCTGAAG GCAGATCAGC ATTTCACCCA GCCTCCGCCG CGGTATACCG AAGCCTCCCT GATCAAGGAG CTCGAAGATC TCGGCATCGG GCGGCCCAGC ACTTATGCCA CGATCCTTTC GACGATCCTG GATCGGGAAT ATGCCGTGGT CCGTAAGAAG AGCCTCTTCC CCAGCGAATT GGGATGGCTG ATCGACGGCC TGATGGTGGA AAACTTCCCC AGCGTGGTGG ACGTCGATTT TACCGCCAAA ATGGAAAAAA GCCTGGACGA AATCGAACAG GGGCAGCACC CTTATCGCAA CCTTTTGGCG GAATTTTACG AGCAGTTTTC GAAGACGCTC GAATCCGCGC GGACCAACAT GGTGAACCTC AAGGCGGTCG GACGCCGGAC CGATCTCCAG TGCCCGCAGT GCGGCCTGCC GCTGCACATC CGGTGGAGTC GCAACGGGCC GTTCCTGGCC TGCAGCGGCT ATCCGGACTG CCGGTTCTCG TCGGACTACA GGCGGGATGA AAAGGGAAAC ATCGAGCCGG TGGCCGAGGA ATCCACCGGC GAGACGTGTG AGAAGTGCGG GCGACCGATG ATCCTGAAGA AGGGGCGTTT CGGGAACTTC CTGGCTTGCA GCGGCTATCC GGCGTGCAAG AACACCAAGG CGCCCGGTAC GGGAATCCCG TGTCCGCGCG AAGGATGTTC GGGGGAGTTG GTGGAACGGG TCAGCAGAGG CGGCCGGCAT TTCTTCGGCT GCAGCAGATA TCCGGAATGC AAGACGGCCT TTTCGGGGCG GCCGGTCCCG GGGAAATGCC CTTCATGCGG CACCGGGCCG TTGATTGAAA AGGGGGGCAA GGGAGGGAGT GTGAAGCGGG TCTGCGTCAA TCCGTCCTGC AAGTATGTGG AAACCGTTCC CGCCGCGGCG GACCGGAAGG CCGCAAAGGA TTGA
|
Protein sequence | MSKSLLIVES PTKAKTLGRY LGKDFIVKAS VGHVKDLPKN RLGINLEKDF QPEYQVIRGK KKVISELHEA AAKSGAIFLG PDPDREGEAI AWHIAEEIGA TDKPVYRVLF YELTRKAIQE ALAKPDRLNR ELYEAQQARR ILDRLVGYMI SPILWQKVKR GLSAGRVQSV ALRLICAREK EIRDFDSREY WTITALLGTQ ASADAPAKSA SRRFKAELFR CGKKKCTIST GEEARELVDR LRPLDYRVSK VERRKKKRHP APPFITSTLQ QEAARKLHFS ARQTMNVAQR LYEGLELGKE GAVGLITYMR TDSTRLSADA VQAVRDYIAG HWDKAYLPAK PAAYKSKAGA QGAHEAIRPT DVNRTPETVA GFLTKEQLKL YTLIWKRFTA CQMAPAVLDQ TSVDIAAGDY VLRASGSIVE FPGFMTLYVE GRENGDEDSE TEGLLPELKE GEVLRLEDLK ADQHFTQPPP RYTEASLIKE LEDLGIGRPS TYATILSTIL DREYAVVRKK SLFPSELGWL IDGLMVENFP SVVDVDFTAK MEKSLDEIEQ GQHPYRNLLA EFYEQFSKTL ESARTNMVNL KAVGRRTDLQ CPQCGLPLHI RWSRNGPFLA CSGYPDCRFS SDYRRDEKGN IEPVAEESTG ETCEKCGRPM ILKKGRFGNF LACSGYPACK NTKAPGTGIP CPREGCSGEL VERVSRGGRH FFGCSRYPEC KTAFSGRPVP GKCPSCGTGP LIEKGGKGGS VKRVCVNPSC KYVETVPAAA DRKAAKD
|
| |