Gene Sfum_0213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_0213 
Symbol 
ID4461476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp249092 
End bp251395 
Gene Length2304 bp 
Protein Length767 aa 
Translation table11 
GC content61% 
IMG OID639700967 
ProductDNA topoisomerase I 
Protein accessionYP_844349 
Protein GI116747662 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA
[COG0551] Zn-finger domain associated with topoisomerase type I 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00304522 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.312261 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAAGT CATTATTGAT TGTCGAATCC CCGACAAAAG CGAAGACACT TGGGAGATAC 
CTTGGAAAAG ATTTCATTGT AAAAGCCTCT GTGGGGCATG TGAAGGATCT TCCCAAGAAC
AGGCTCGGAA TCAATCTGGA AAAGGATTTC CAGCCGGAAT ACCAGGTAAT ACGCGGCAAG
AAGAAGGTCA TCAGCGAACT GCACGAGGCC GCCGCAAAGT CGGGAGCGAT CTTTCTCGGT
CCGGACCCCG ACCGCGAGGG TGAGGCCATT GCGTGGCATA TCGCGGAAGA AATCGGTGCC
ACGGACAAGC CCGTATACCG GGTGCTTTTC TACGAGCTGA CCCGGAAAGC GATACAGGAA
GCTCTTGCCA AGCCCGACAG GCTGAACCGG GAGCTCTACG AAGCCCAGCA GGCCAGGCGC
ATTCTGGACC GCCTGGTAGG ATATATGATT TCCCCGATCC TGTGGCAGAA AGTGAAGCGA
GGGTTGAGCG CCGGGAGGGT GCAGTCGGTG GCCCTTCGGT TGATCTGCGC GCGGGAAAAG
GAGATCCGGG ATTTCGATTC CAGGGAGTAC TGGACCATAA CCGCTTTGCT GGGGACGCAG
GCATCGGCCG ATGCGCCCGC GAAGAGCGCG TCCCGGCGGT TCAAGGCGGA GCTGTTCCGT
TGCGGGAAGA AGAAATGCAC GATTTCCACG GGAGAGGAAG CCCGCGAGCT GGTAGACCGG
CTGCGGCCGC TCGACTATCG GGTGAGCAAG GTCGAACGCA GGAAGAAGAA ACGCCATCCG
GCGCCTCCCT TCATCACCAG CACGCTGCAG CAGGAGGCGG CCAGGAAGCT GCATTTCTCT
GCCAGGCAGA CCATGAATGT GGCCCAGCGG CTCTACGAGG GGCTCGAACT GGGAAAGGAA
GGGGCCGTCG GCCTGATCAC GTACATGCGT ACCGACTCGA CACGGCTGTC CGCGGATGCC
GTTCAGGCGG TCCGGGACTA CATCGCCGGA CATTGGGACA AGGCCTATCT GCCGGCCAAG
CCCGCCGCGT ACAAGAGCAA AGCGGGCGCC CAGGGAGCGC ACGAGGCGAT CCGGCCCACG
GACGTGAATC GGACCCCGGA AACCGTTGCG GGCTTTCTGA CAAAAGAGCA GCTCAAGCTC
TATACGCTCA TCTGGAAACG TTTCACGGCA TGCCAGATGG CGCCTGCCGT TCTCGACCAG
ACCTCGGTGG ATATCGCGGC CGGGGACTAC GTCTTGCGTG CGTCCGGCTC GATCGTCGAG
TTTCCGGGTT TCATGACGCT GTATGTCGAG GGTCGGGAGA ACGGGGATGA GGATTCGGAG
ACCGAGGGGC TGCTGCCCGA GCTGAAGGAA GGGGAGGTCC TGAGGCTGGA AGACCTGAAG
GCAGATCAGC ATTTCACCCA GCCTCCGCCG CGGTATACCG AAGCCTCCCT GATCAAGGAG
CTCGAAGATC TCGGCATCGG GCGGCCCAGC ACTTATGCCA CGATCCTTTC GACGATCCTG
GATCGGGAAT ATGCCGTGGT CCGTAAGAAG AGCCTCTTCC CCAGCGAATT GGGATGGCTG
ATCGACGGCC TGATGGTGGA AAACTTCCCC AGCGTGGTGG ACGTCGATTT TACCGCCAAA
ATGGAAAAAA GCCTGGACGA AATCGAACAG GGGCAGCACC CTTATCGCAA CCTTTTGGCG
GAATTTTACG AGCAGTTTTC GAAGACGCTC GAATCCGCGC GGACCAACAT GGTGAACCTC
AAGGCGGTCG GACGCCGGAC CGATCTCCAG TGCCCGCAGT GCGGCCTGCC GCTGCACATC
CGGTGGAGTC GCAACGGGCC GTTCCTGGCC TGCAGCGGCT ATCCGGACTG CCGGTTCTCG
TCGGACTACA GGCGGGATGA AAAGGGAAAC ATCGAGCCGG TGGCCGAGGA ATCCACCGGC
GAGACGTGTG AGAAGTGCGG GCGACCGATG ATCCTGAAGA AGGGGCGTTT CGGGAACTTC
CTGGCTTGCA GCGGCTATCC GGCGTGCAAG AACACCAAGG CGCCCGGTAC GGGAATCCCG
TGTCCGCGCG AAGGATGTTC GGGGGAGTTG GTGGAACGGG TCAGCAGAGG CGGCCGGCAT
TTCTTCGGCT GCAGCAGATA TCCGGAATGC AAGACGGCCT TTTCGGGGCG GCCGGTCCCG
GGGAAATGCC CTTCATGCGG CACCGGGCCG TTGATTGAAA AGGGGGGCAA GGGAGGGAGT
GTGAAGCGGG TCTGCGTCAA TCCGTCCTGC AAGTATGTGG AAACCGTTCC CGCCGCGGCG
GACCGGAAGG CCGCAAAGGA TTGA
 
Protein sequence
MSKSLLIVES PTKAKTLGRY LGKDFIVKAS VGHVKDLPKN RLGINLEKDF QPEYQVIRGK 
KKVISELHEA AAKSGAIFLG PDPDREGEAI AWHIAEEIGA TDKPVYRVLF YELTRKAIQE
ALAKPDRLNR ELYEAQQARR ILDRLVGYMI SPILWQKVKR GLSAGRVQSV ALRLICAREK
EIRDFDSREY WTITALLGTQ ASADAPAKSA SRRFKAELFR CGKKKCTIST GEEARELVDR
LRPLDYRVSK VERRKKKRHP APPFITSTLQ QEAARKLHFS ARQTMNVAQR LYEGLELGKE
GAVGLITYMR TDSTRLSADA VQAVRDYIAG HWDKAYLPAK PAAYKSKAGA QGAHEAIRPT
DVNRTPETVA GFLTKEQLKL YTLIWKRFTA CQMAPAVLDQ TSVDIAAGDY VLRASGSIVE
FPGFMTLYVE GRENGDEDSE TEGLLPELKE GEVLRLEDLK ADQHFTQPPP RYTEASLIKE
LEDLGIGRPS TYATILSTIL DREYAVVRKK SLFPSELGWL IDGLMVENFP SVVDVDFTAK
MEKSLDEIEQ GQHPYRNLLA EFYEQFSKTL ESARTNMVNL KAVGRRTDLQ CPQCGLPLHI
RWSRNGPFLA CSGYPDCRFS SDYRRDEKGN IEPVAEESTG ETCEKCGRPM ILKKGRFGNF
LACSGYPACK NTKAPGTGIP CPREGCSGEL VERVSRGGRH FFGCSRYPEC KTAFSGRPVP
GKCPSCGTGP LIEKGGKGGS VKRVCVNPSC KYVETVPAAA DRKAAKD