Gene EcDH1_1879 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1879 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2031518 
End bp2033479 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content55% 
IMG OID 
ProductDNA topoisomerase III 
Protein accessionACX39537 
Protein GI260449115 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value4.11002e-09 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGTTGT TTATTGCCGA AAAACCGAGT CTGGCGCGCG CCATTGCTGA TGTCCTGCCC 
AAACCGCACC GGAAAGGCGA TGGCTTTATC GAGTGCGGTA ATGGTCAGGT GGTGACCTGG
TGTATCGGTC ACCTGCTTGA GCAGGCGCAG CCAGACGCCT ACGACAGCCG CTATGCGCGC
TGGAATCTTG CGGATTTGCC GATTGTCCCG GAAAAGTGGC AATTACAGCC CCGACCCTCC
GTGACCAAAC AACTTAACGT CATCAAACGG TTCCTGCATG AAGCCAGCGA AATCGTTCAC
GCCGGGGACC CGGATCGTGA AGGGCAATTG CTGGTGGATG AAGTGCTGGA CTATCTGCAA
CTGGCACCGG AAAAGCGCCA GCAGGTACAG CGTTGCTTGA TAAACGACCT GAACCCGCAG
GCGGTTGAGC GGGCGATCGA CCGTCTTCGT TCCAACAGTG AGTTTGTACC GCTGTGCGTT
TCTGCGCTGG CGCGAGCGCG TGCCGACTGG CTGTACGGCA TCAATATGAC CCGTGCGTAT
ACCATTCTCG GTCGCAATGC CGGTTATCAG GGCGTACTTT CCGTGGGACG CGTGCAGACG
CCCGTGCTTG GGCTGGTGGT GCGCCGCGAT GAAGAGATTG AAAACTTCGT GGCGAAAGAC
TTCTTTGAAG TCAAAGCACA TATCGTGACA CCTGCCGATG AGCGGTTTAC CGCTATCTGG
CAACCGAGCG AAGCGTGTGA ACCGTACCAG GATGAAGAAG GGCGCTTGTT ACATCGTCCA
CTGGCGGAGC ATGTGGTTAA CCGCATTAGT GGTCAACCGG CTATTGTCAC CAGCTATAAC
GATAAACGGG AATCAGAATC CGCGCCGCTG CCTTTTTCGC TTTCAGCGTT GCAGATTGAA
GCGGCAAAAC GTTTTGGTCT GAGTGCGCAG AACGTGCTTG ATATCTGCCA GAAACTGTAC
GAAACGCACA AGCTAATCAC TTATCCGCGT TCTGATTGTC GCTATTTGCC AGAAGAACAT
TTTGCCGGAC GCCACGCGGT GATGAATGCC ATCAGTGTTC ATGCACCGGA TCTGTTGCCG
CAGCCAGTGG TAGATCCAGA TATACGCAAC CGCTGTTGGG ATGACAAAAA GGTCGATGCG
CACCACGCCA TCATTCCGAC CGCACGGAGT TCTGCGATCA ACCTGACGGA GAACGAAGCG
AAGGTCTATA ACCTGATTGC CCGTCAGTAT CTGATGCAAT TCTGCCCGGA TGCGGTGTTC
CGCAAGTGTG TTATCGAACT GGACATTGCC AAAGGCAAAT TTGTCGCTAA AGCGCGTTTT
CTTGCTGAAG CAGGCTGGCG CACGCTGTTA GGCAGCAAAG AGCGCGATGA AGAAAACGAC
GGCACGCCAC TGCCTGTGGT GGCGAAAGGC GATGAGTTGC TGTGTGAAAA AGGTGAAGTG
GTAGAGCGGC AAACCCAGCC GCCGCGCCAT TTTACCGATG CAACACTGCT TTCGGCGATG
ACCGGGATCG CGCGCTTTGT GCAGGATAAA GATCTGAAAA AGATCCTCCG TGCGACCGAT
GGTCTGGGGA CAGAGGCAAC GCGTGCCGGG ATTATTGAAC TGTTGTTCAA GCGTGGTTTC
CTGACCAAAA AAGGGCGCTA TATCCACTCC ACCGACGCCG GAAAAGCGCT ATTCCATTCG
CTGCCGGAGA TGGCGACGCG ACCGGACATG ACCGCGCACT GGGAATCGGT GCTGACGCAA
ATCAGCGAAA AGCAGTGTCG CTATCAGGAC TTTATGCAGC CGCTGGTGGG GACGCTATAT
CAGCTTATTG ATCAAGCCAA ACGTACGCCG GTGCGGCAGT TTCGCGGCAT TGTGGCTCCG
GGCAGTGGTG GCAGTGCTGA TAAGAAAAAG GCTGCACCGC GTAAACGTAG TGCGAAAAAA
AGTCCGCCAG CAGATGAAGT CGGAAGCGGG GCGATAGCGT AA
 
Protein sequence
MRLFIAEKPS LARAIADVLP KPHRKGDGFI ECGNGQVVTW CIGHLLEQAQ PDAYDSRYAR 
WNLADLPIVP EKWQLQPRPS VTKQLNVIKR FLHEASEIVH AGDPDREGQL LVDEVLDYLQ
LAPEKRQQVQ RCLINDLNPQ AVERAIDRLR SNSEFVPLCV SALARARADW LYGINMTRAY
TILGRNAGYQ GVLSVGRVQT PVLGLVVRRD EEIENFVAKD FFEVKAHIVT PADERFTAIW
QPSEACEPYQ DEEGRLLHRP LAEHVVNRIS GQPAIVTSYN DKRESESAPL PFSLSALQIE
AAKRFGLSAQ NVLDICQKLY ETHKLITYPR SDCRYLPEEH FAGRHAVMNA ISVHAPDLLP
QPVVDPDIRN RCWDDKKVDA HHAIIPTARS SAINLTENEA KVYNLIARQY LMQFCPDAVF
RKCVIELDIA KGKFVAKARF LAEAGWRTLL GSKERDEEND GTPLPVVAKG DELLCEKGEV
VERQTQPPRH FTDATLLSAM TGIARFVQDK DLKKILRATD GLGTEATRAG IIELLFKRGF
LTKKGRYIHS TDAGKALFHS LPEMATRPDM TAHWESVLTQ ISEKQCRYQD FMQPLVGTLY
QLIDQAKRTP VRQFRGIVAP GSGGSADKKK AAPRKRSAKK SPPADEVGSG AIA