Gene Emin_0158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0158 
Symbol 
ID6262860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp168296 
End bp170593 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content41% 
IMG OID642610622 
ProductDNA topoisomerase I 
Protein accessionYP_001875060 
Protein GI187250578 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA
[COG0551] Zn-finger domain associated with topoisomerase type I 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACCA AGAAAACAAC AACAAAAAAA ACGGAGTCTC TTTCTAAAGG TAAAAACCTT 
GTTATAGTCG AGTCACCTAC AAAACAAAAA ACAATAAGCA AAATTTTAGG GGCCGATTAT
GTTGTAAAAA GTTCCTTCGG GCATGTTAGG GATTTGCCTT CCAAAGAAAT AGGCGTTGAC
GAAAAAAACG GTTTTAAACC CAAATATGTT CCTGTGGAAA AAGCTAAAAA AATGGTTTCC
GAGCTTGAAA AACTTGCTAA AGGTTCCGAA TATGTTTACT TGGCCACTGA CCCTGACCGC
GAAGGAGAAG CTATTGCCTG GCATTTGGTT GAACTTTTAA AAATACCTGT TGAAAAAATA
CGCCGTATTT TCTTTCATGA AATTACGCCC GCGGCTGTTA AAGCCAGTTT TGACCATGCC
AGAAATATTA ATAAAGATTT GGTTGACGCA CAGCAGGCAA GGCGCGTACT TGACCGTTTG
GTTGGGTACA AACTTTCGCC GCTTCTTTGG AAGAAAATTA CGGGCGGTCT TTCCGCGGGC
AGAGTGCAAA GCGTAGCGGT AAGGCTTCTT GCCGAGCGTG CCAAAGAAAT AGAAAATTTT
AGGGAAGAAG AATATTATTC TTTAACCTCC GAGCTTGAAA AGCAAGGTGA AACGCCCAAA
TTTAACGCGC GCATGCTTAA ATGGAGGGGC AAAAATACTG AAATTATTAC AACATACCAT
TTGTTCGCCG AAGACTATAA AGTAAAAACA ACAGTCTTTA AAAAACCTGA GGATCTTGCC
CCTGTTAATT CTTTATTAAG ACAAGGGCCC TTAACGGTAA GTAAAATTGA AAAAAAAGAA
GTAAAACAAA AAGCCAAACC ACCTTTTATA ACCAGCTCTT TACAGCAGGA AGCGTATAAT
AAAATAGGTT TCCCTTCACA AAAAACTATG ATGACGGCGC AAAGCCTTTA TGAAGGCGTT
GAGATAGCGG GTGAAGTTGT AGGTTTAATT ACGTATATGA GAACAGACTC TTTTAACGTA
TCAAAAGATT TGCAATCCCA AACCAAAAAG TTTATAGCCG GCAAATACGG GGATGATTTT
GTCCCCCCAA CTCCAAATTT TTTTAAAAGC AAAGTAAAAG GCGCTCAGGA AGCGCACGAG
TCTATTCACC CGACTGATGT TTATAAAACG CCAGCTAGTA TAAAAGATTA TTTGAGCGCG
GACCAGTACA AACTTTACGA ACTTATTTGG TTAAGGTTTA TTGCCAGCCA AATGGCGGAC
GCTGTTTTTA ACACAGTATC TGTCGACATA ACGGCAGGCA AAGCTGAAGA ATGCGTTTTA
AGAGCAACGG GCCGCACAGT TAAATTCCCG GGCTTTTTAT CCGTTTATAA AGAAGACGAT
TCGGAAGAAG AGGACGAAGG CTCAGCCTTG CTTCCTAATC TTACGGAAGG CGATGATTTA
AACCTTATTG ACATTGTAAC CAAATCACAT AAAACAGCCC CGCCGCCGAA CTATAATGAG
GCGAGCTTGA TTAAAACGCT TGAAAAGCAC GGTATCGGGC GTCCTTCCAC TTACGCTCCT
ACAATTAAAA CTATTTTGGA CAGGAAATAT ATTATCCGCC AGCCGAAAAC CAACAAACTG
ATTGTGACTG ATTTGGGCGT AACGGTGACA GACCAGTTAA AAGACTTTTT TAAAGATATT
ATGGACCTTT CTTATACTGC GGGCATTGAA GAAAAACTCG ACGATATAGC AGAAGGCGAT
AATGACTGGG TTAAAGTTAT AGGCGATTTT TATGAAGGTT TTAAAAAAGA TTTAGCTACG
GCAGATAAAG ACATGCAGCG CGCCGCGCCC AAACCTTCCG ATGAAAAATG CCCATTATGC
GGCAGCCCCA TGGTAATAAG AAGAAGCAGA TTCGGCGAAT ACCTGGCCTG CTCCACCTAT
CCGGAATGTA AGGGTAAAAT TAATTTAACT TCTTCAGGCG AAAAATTGGC GCCTGAAGTA
ACCGAGGAAA AATGTGAAAA ATGCGGTAGC CCCATGGTTA TACGTTCGGG GCGCAGGGGT
AAGTTTATGG CTTGCAGTGC GTTTCCTAAA TGCAAAAACA CATTTTCTAT TGATGCGAGC
GGCAATAAGG TAGCTTCCTC CGGCCCTATT GAAACAAAAA TTAAATGTGA AAAATGCGGC
AAACCTATGC TGCTTCGCGC AAGCAAACGC GGCGAATTTT TAGGCTGCAG CGGTTACCCT
AAATGTAAAA CAATAGTATC TGTTTCACCT GAGGAAATCG CCAAAATAAA AAAAGAGCAC
GAAGCAGAAA ATAAATAA
 
Protein sequence
MATKKTTTKK TESLSKGKNL VIVESPTKQK TISKILGADY VVKSSFGHVR DLPSKEIGVD 
EKNGFKPKYV PVEKAKKMVS ELEKLAKGSE YVYLATDPDR EGEAIAWHLV ELLKIPVEKI
RRIFFHEITP AAVKASFDHA RNINKDLVDA QQARRVLDRL VGYKLSPLLW KKITGGLSAG
RVQSVAVRLL AERAKEIENF REEEYYSLTS ELEKQGETPK FNARMLKWRG KNTEIITTYH
LFAEDYKVKT TVFKKPEDLA PVNSLLRQGP LTVSKIEKKE VKQKAKPPFI TSSLQQEAYN
KIGFPSQKTM MTAQSLYEGV EIAGEVVGLI TYMRTDSFNV SKDLQSQTKK FIAGKYGDDF
VPPTPNFFKS KVKGAQEAHE SIHPTDVYKT PASIKDYLSA DQYKLYELIW LRFIASQMAD
AVFNTVSVDI TAGKAEECVL RATGRTVKFP GFLSVYKEDD SEEEDEGSAL LPNLTEGDDL
NLIDIVTKSH KTAPPPNYNE ASLIKTLEKH GIGRPSTYAP TIKTILDRKY IIRQPKTNKL
IVTDLGVTVT DQLKDFFKDI MDLSYTAGIE EKLDDIAEGD NDWVKVIGDF YEGFKKDLAT
ADKDMQRAAP KPSDEKCPLC GSPMVIRRSR FGEYLACSTY PECKGKINLT SSGEKLAPEV
TEEKCEKCGS PMVIRSGRRG KFMACSAFPK CKNTFSIDAS GNKVASSGPI ETKIKCEKCG
KPMLLRASKR GEFLGCSGYP KCKTIVSVSP EEIAKIKKEH EAENK