Gene Dbac_2741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDbac_2741 
Symbol 
ID8378425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfomicrobium baculatum DSM 4028 
KingdomBacteria 
Replicon accessionNC_013173 
Strand
Start bp3129233 
End bp3131473 
Gene Length2241 bp 
Protein Length746 aa 
Translation table11 
GC content60% 
IMG OID645001966 
ProductDNA topoisomerase I 
Protein accessionYP_003159233 
Protein GI256830505 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.123661 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCAAGG ACTTAATCAT CGTCGAATCA CCCGCCAAAA TCAAAACCAT CAAAAAGTTT 
CTGGGCGGCG GGTATGAAGT GGAAGCATCC GTGGGACACG TGCGTGACCT GCCGACCAAA
ACCCTGGGCG TGGACGAGGG CAATGACTTT GCCCCGGACT ATCAGATCAT CCCCGGCAAG
GCCAAGGTCG TCAGCAAACT CAAATCCGCG GCCAAGGCTG CGGACACGGT CTATCTGGCA
CCCGACCCGG ACCGTGAAGG GGAGGCCATC GCTTGGCACG TGGCCGAAGT CATCCGGACT
TCCAATCCGA ATGTGAAACG CATTCAGTTC AACGAGATCA CGGCCAAGGC GGTCAAAGAA
GCCTTGGCCA ACCCCACGGA ACTGCGCAAG CCCCTCTTCG ATTCCCAGCA GGCCAGACGC
ATTCTGGATC GCCTGGTGGG CTACAAAATT TCTCCGCTGC TCTGGAAAAA GGTCAAACGC
GGCCTGTCCG CCGGCCGGGT CCAATCCGTG GCCCTGCGTC TCATCGTGGA CCGCGAACGT
GAACGCCAGG CCTTTGTTTC CGAAGAATAC TGGGTTTTCA AGATCACGGT CCAGGGCGAG
ACCCCTCCGC CCTTTGACGC CGACCTGTGG AAAGTGGACG GGGAAAAACC CGTCATCGGC
GACGAAAAGA CGGCCCTGGC CCTGGAAAGC CGCGTCGACG GACAGTCTTA TGTGGTACGG
GACATCGTCG AGAAGGAACG CCAGCGCCAC CCTCGTCCGC CGTTCATCAC CTCCACCCTG
CAGCAGGATG CCAGCAACAA GCTCGGCTTC AACGCCAAAC GGACCATGTC CGTGGCTCAG
CGCCTCTACG AGGGCGTAGA GCTTGGCGAA CGCGGAACCA CGGCACTCAT CACCTACATG
CGTACCGACT CGGTGCGCAT TTCTGATGAG GCCAAGGACG GCGCCCGCGA GTGGATCACC
CGCACCCTGG GGCCGGAATT CTATCCCAAG GAGCCCCGGG TCTACAAAAG CAAGGGCAGC
GCCCAGGACG CGCACGAAGC CATTCGCCCC GTCGATCCGA GCCTGACTCC CGATTCCATC
AAAAACAACC TGCCGCCCGA ACAGTACAAG CTCTACCGGC TCATCTGGGA GCGCTTCATG
GCCTCGCAGA TGGCTTCCGC CCGCTTCTGG GATACCGTGG CCACCATTGA AAGCGGCCCG
GCCCAATGGC GCGCCAAAGG CGAGAGACTC GTCTTCCCCG GTTTTCTGCA GATCTGGCCG
CAATCCTCGG ACAACCAGAG CGCCTTGCTG CCGAAACTTG TGACCGGACA GGAACTGCGT
CTGGAGAAGC TGCACAAGGA ACAGAAATTC ACCCAGCCCC CGGCTCGCTT CTCGGAAGCG
TCCCTGGTGC ACAAGCTCGA AGAGCTGGGC ATCGGCCGAC CCTCGACCTA TGCGGCCATC
ATCTCCACCC TGACAGAGCG CGACTACGTC CACATCGAGG AAAAACATTT CCAGCCCACG
GACCTGGGCG TCATTGTCTG CGACCTTCTG GTGGAACATT TCGCCCACCT CATGGACGCC
GGCTTCACGG CGCGCATGGA AGAAAGCCTC GACCATGTCG CCGAGGGCGA AACCGACTGG
GTCGCCCTGC TCCGGGATTT CACCCTGGAC TTCAACCCGA CCCTGGACAA GGCGCGGGAA
AACATGACCC AGGTCAAGGC CGGCATGGAC ACGGGACTGT CCTGTCCCCA ATGCGAGGAT
GGCAGGCTTG TGGTCAAATT CGGCAAGAAC GGAACTTTTC TGGGCTGCGC CAACTATCCG
GCCTGTTCCT TTACCAGCAA CTACATCCGC AACAACGCCG GGGAAATCGA GCTGGTCAAG
GAAGAAGCGC CCGAAGAGCT CGGGCCCTGC CCCAAATGCG AAGACGGTCG GTTGGTGGTC
AAAAAGACCA AGACCGGCGG CAGATTTGTA GCCTGCTCCA ATTATCCCGC CTGCCGCCAC
ACCAAATCCG TGAGTACCGG CGTGGCCTGC CCCAAGGACG GCTGCGACGG CGAACTGGTC
GAAAAAACCA GCCGCCGCGG GAAGCCCTTC TATTCCTGCA GCAAATATCC CAAATGCGAT
TACGCGGTCT GGGATTTCCC CGTGGCCAAG CCCTGTCCGC TGTGCGAATC AAAAATCCTG
GTGCGCAAAG AGACCAAGGC CAGGGGAGCG CACCTGGCCT GTCCGGTCAA GGATTGCGGA
TACTGGGAAA AATTGGACTA G
 
Protein sequence
MGKDLIIVES PAKIKTIKKF LGGGYEVEAS VGHVRDLPTK TLGVDEGNDF APDYQIIPGK 
AKVVSKLKSA AKAADTVYLA PDPDREGEAI AWHVAEVIRT SNPNVKRIQF NEITAKAVKE
ALANPTELRK PLFDSQQARR ILDRLVGYKI SPLLWKKVKR GLSAGRVQSV ALRLIVDRER
ERQAFVSEEY WVFKITVQGE TPPPFDADLW KVDGEKPVIG DEKTALALES RVDGQSYVVR
DIVEKERQRH PRPPFITSTL QQDASNKLGF NAKRTMSVAQ RLYEGVELGE RGTTALITYM
RTDSVRISDE AKDGAREWIT RTLGPEFYPK EPRVYKSKGS AQDAHEAIRP VDPSLTPDSI
KNNLPPEQYK LYRLIWERFM ASQMASARFW DTVATIESGP AQWRAKGERL VFPGFLQIWP
QSSDNQSALL PKLVTGQELR LEKLHKEQKF TQPPARFSEA SLVHKLEELG IGRPSTYAAI
ISTLTERDYV HIEEKHFQPT DLGVIVCDLL VEHFAHLMDA GFTARMEESL DHVAEGETDW
VALLRDFTLD FNPTLDKARE NMTQVKAGMD TGLSCPQCED GRLVVKFGKN GTFLGCANYP
ACSFTSNYIR NNAGEIELVK EEAPEELGPC PKCEDGRLVV KKTKTGGRFV ACSNYPACRH
TKSVSTGVAC PKDGCDGELV EKTSRRGKPF YSCSKYPKCD YAVWDFPVAK PCPLCESKIL
VRKETKARGA HLACPVKDCG YWEKLD