Gene EcSMS35_4867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4867 
Symbol 
ID6146905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4977307 
End bp4979256 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content40% 
IMG OID641619671 
ProductDNA-binding transcriptional regulator DhaR 
Protein accessionYP_001746778 
Protein GI170682161 
COG category[K] Transcription
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3284] Transcriptional activator of acetoin/glycerol metabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.608152 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATTA TTATAATGAA TAAAGAGTCG TATCATAATG ATCTAAAAAA TAAATGGAAA 
ATGTTTGTTA AGCATGGTTG GGTCGCGACA AATTCGACTA ATCATGTGAT GCTGAGGTCA
TGGCAAAAAT GTCTCAAGCA TTGTGACCCT CGCCACTGGA ACACGCCTGT CAAAGCTTCC
GGGCAAACAT TGCAGACGAT TTTTTCTCGT AACGAAGAGT TTATTAGAAT TTCACAAAGA
GTTGTTGAAG ACCATTTCAC CCTGGCTGGT GACGACCGGT TGGCATTTTT GATCATTGAC
CCACACGGCT GGGTTCTATC GTTGAATGCA GCAGGCGATT ATTCCAGTCA ATTGCGAGAG
TTAGGAATTG AATCTGGTAT GTCATGGGCC GAAGATGGCA TTGGAACCAA TGTATATAGT
CTTTGTCGGG AAACAAATTT ATATACACAA CTGGAAGGAG CAGAACATTT TAGTGAACAG
CTACATTGTT ATGCGATGAG CGCTGCTCCG GTCATTGATT ATTATGGTAA TATCCATGGA
TACATTGTAT GTATAATTGA AACTACCGCC GAGCTTGTTA AATTAAAAAC ATCATATTCA
TGTGCAACAG AAATCGCTAA TTATATCTAT ATTGAGAATG AACAAAAGTT AATAAACAAA
GTACTTTGCC AGCATAATGC TGTGATTGAA TGCATGGATG ACGGTTTTAT TTGTTGGAAT
AGTCATTCTT TAATTACGAT GGTTAATTCT CAAGCACAAA CACTACTGAA TATTGATAAA
GAAAGTTTAA TTGGTCAGAA CATCCGAAAA GGATTCGTAT TTCCGCCGAT TTTGAATGAG
GCAATTACAC AACGGAACAA ACTATCGCAA AAGCAAATTG TCCTCGAATG CCGTGGCGAA
TTTATTGAAC TTATGGTTAC CCTTCGCCCG TTAAGCGATG GTTCGTTTTT GCTTTTTCTT
CATCCATTAG ACAAAATCAG GAAAATAGCC CAACAGCAAA TAAGCACTAA TGCAAATTTT
ACCTTTGACA GTTTACATGC GGCTTCAGGT GGTATGAAGC AGGTATTACT TATCGCTCGC
CGGGCAATTA AATCCATCTC TCCGATTTTG ATCAATGGCG AAGAAGGTGT GGGAAAATTG
AGTTTGGCGA TGGCAATACA TAATGAGAGC GAGCAACGTG ATGGGCCATT TATTTCTGTA
GATTGTCAGA TGCTATCACC AGAAAATATC TTACACGAAC TTCTTGGCTC TGATGTTGGT
CCTTCGCCAT CGAAATTTGA ACTGGCTCAT AATGGCACCT TATATCTGGA TAAAGTCGAA
TATCTATCAG GGGAAGTTCA GAGTGTTTTA TTGAAAGTAT TGAAAACGGG GCTTGTTACT
CGCTCAGACA GTCATCGTTT GATCCCCGTA CGCTTTCGCC TGATTACATG TACCAGTAGT
TCTTTACGTG AGTACGTGCA ACAAGGGGCT TTTAGCCGAC AGCTATATTA TGAGATCTCC
ATGAATGAAA TTGAAATTCC GCCATTGCGC AAACGTCGTG AAGATCTCAA GCAAATGATT
GACGATATTA TTGATAAGTA TCAGGAGCGC ACTCGAAAAA AAATGACAAT CACGCCTGAC
GCAAATTCAG TTCTGCTTGA GTACCGTTGG CCTGGAAACA TCTCCGAGTT CAAAAATCGA
ATGGAGAAGG TATTTATTAA CTGCAATCGG CTTGTCCTCG GATTAGAGAA TATTCCTCTG
GATATCCGAC AAAATAACAG TAGTGGCGAC GATGATATCC CTCATCTTAC TTCACTGGCA
GAATTGGAGA TGCAAGCTAT TGAGCATACA TGTCGTGTCT GCGAATGGAA TCTAACTAAA
GCAGCTGAAG TATTAAAAAT TGGTCGTACA ACATTATGGC GCAAGCTTAA AATCTATAAT
CTCTATCCAA ATGTTGAGCA TGCAGATTGA
 
Protein sequence
MDIIIMNKES YHNDLKNKWK MFVKHGWVAT NSTNHVMLRS WQKCLKHCDP RHWNTPVKAS 
GQTLQTIFSR NEEFIRISQR VVEDHFTLAG DDRLAFLIID PHGWVLSLNA AGDYSSQLRE
LGIESGMSWA EDGIGTNVYS LCRETNLYTQ LEGAEHFSEQ LHCYAMSAAP VIDYYGNIHG
YIVCIIETTA ELVKLKTSYS CATEIANYIY IENEQKLINK VLCQHNAVIE CMDDGFICWN
SHSLITMVNS QAQTLLNIDK ESLIGQNIRK GFVFPPILNE AITQRNKLSQ KQIVLECRGE
FIELMVTLRP LSDGSFLLFL HPLDKIRKIA QQQISTNANF TFDSLHAASG GMKQVLLIAR
RAIKSISPIL INGEEGVGKL SLAMAIHNES EQRDGPFISV DCQMLSPENI LHELLGSDVG
PSPSKFELAH NGTLYLDKVE YLSGEVQSVL LKVLKTGLVT RSDSHRLIPV RFRLITCTSS
SLREYVQQGA FSRQLYYEIS MNEIEIPPLR KRREDLKQMI DDIIDKYQER TRKKMTITPD
ANSVLLEYRW PGNISEFKNR MEKVFINCNR LVLGLENIPL DIRQNNSSGD DDIPHLTSLA
ELEMQAIEHT CRVCEWNLTK AAEVLKIGRT TLWRKLKIYN LYPNVEHAD