Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4867 |
Symbol | |
ID | 6146905 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4977307 |
End bp | 4979256 |
Gene Length | 1950 bp |
Protein Length | 649 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 641619671 |
Product | DNA-binding transcriptional regulator DhaR |
Protein accession | YP_001746778 |
Protein GI | 170682161 |
COG category | [K] Transcription [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3284] Transcriptional activator of acetoin/glycerol metabolism |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.608152 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATATTA TTATAATGAA TAAAGAGTCG TATCATAATG ATCTAAAAAA TAAATGGAAA ATGTTTGTTA AGCATGGTTG GGTCGCGACA AATTCGACTA ATCATGTGAT GCTGAGGTCA TGGCAAAAAT GTCTCAAGCA TTGTGACCCT CGCCACTGGA ACACGCCTGT CAAAGCTTCC GGGCAAACAT TGCAGACGAT TTTTTCTCGT AACGAAGAGT TTATTAGAAT TTCACAAAGA GTTGTTGAAG ACCATTTCAC CCTGGCTGGT GACGACCGGT TGGCATTTTT GATCATTGAC CCACACGGCT GGGTTCTATC GTTGAATGCA GCAGGCGATT ATTCCAGTCA ATTGCGAGAG TTAGGAATTG AATCTGGTAT GTCATGGGCC GAAGATGGCA TTGGAACCAA TGTATATAGT CTTTGTCGGG AAACAAATTT ATATACACAA CTGGAAGGAG CAGAACATTT TAGTGAACAG CTACATTGTT ATGCGATGAG CGCTGCTCCG GTCATTGATT ATTATGGTAA TATCCATGGA TACATTGTAT GTATAATTGA AACTACCGCC GAGCTTGTTA AATTAAAAAC ATCATATTCA TGTGCAACAG AAATCGCTAA TTATATCTAT ATTGAGAATG AACAAAAGTT AATAAACAAA GTACTTTGCC AGCATAATGC TGTGATTGAA TGCATGGATG ACGGTTTTAT TTGTTGGAAT AGTCATTCTT TAATTACGAT GGTTAATTCT CAAGCACAAA CACTACTGAA TATTGATAAA GAAAGTTTAA TTGGTCAGAA CATCCGAAAA GGATTCGTAT TTCCGCCGAT TTTGAATGAG GCAATTACAC AACGGAACAA ACTATCGCAA AAGCAAATTG TCCTCGAATG CCGTGGCGAA TTTATTGAAC TTATGGTTAC CCTTCGCCCG TTAAGCGATG GTTCGTTTTT GCTTTTTCTT CATCCATTAG ACAAAATCAG GAAAATAGCC CAACAGCAAA TAAGCACTAA TGCAAATTTT ACCTTTGACA GTTTACATGC GGCTTCAGGT GGTATGAAGC AGGTATTACT TATCGCTCGC CGGGCAATTA AATCCATCTC TCCGATTTTG ATCAATGGCG AAGAAGGTGT GGGAAAATTG AGTTTGGCGA TGGCAATACA TAATGAGAGC GAGCAACGTG ATGGGCCATT TATTTCTGTA GATTGTCAGA TGCTATCACC AGAAAATATC TTACACGAAC TTCTTGGCTC TGATGTTGGT CCTTCGCCAT CGAAATTTGA ACTGGCTCAT AATGGCACCT TATATCTGGA TAAAGTCGAA TATCTATCAG GGGAAGTTCA GAGTGTTTTA TTGAAAGTAT TGAAAACGGG GCTTGTTACT CGCTCAGACA GTCATCGTTT GATCCCCGTA CGCTTTCGCC TGATTACATG TACCAGTAGT TCTTTACGTG AGTACGTGCA ACAAGGGGCT TTTAGCCGAC AGCTATATTA TGAGATCTCC ATGAATGAAA TTGAAATTCC GCCATTGCGC AAACGTCGTG AAGATCTCAA GCAAATGATT GACGATATTA TTGATAAGTA TCAGGAGCGC ACTCGAAAAA AAATGACAAT CACGCCTGAC GCAAATTCAG TTCTGCTTGA GTACCGTTGG CCTGGAAACA TCTCCGAGTT CAAAAATCGA ATGGAGAAGG TATTTATTAA CTGCAATCGG CTTGTCCTCG GATTAGAGAA TATTCCTCTG GATATCCGAC AAAATAACAG TAGTGGCGAC GATGATATCC CTCATCTTAC TTCACTGGCA GAATTGGAGA TGCAAGCTAT TGAGCATACA TGTCGTGTCT GCGAATGGAA TCTAACTAAA GCAGCTGAAG TATTAAAAAT TGGTCGTACA ACATTATGGC GCAAGCTTAA AATCTATAAT CTCTATCCAA ATGTTGAGCA TGCAGATTGA
|
Protein sequence | MDIIIMNKES YHNDLKNKWK MFVKHGWVAT NSTNHVMLRS WQKCLKHCDP RHWNTPVKAS GQTLQTIFSR NEEFIRISQR VVEDHFTLAG DDRLAFLIID PHGWVLSLNA AGDYSSQLRE LGIESGMSWA EDGIGTNVYS LCRETNLYTQ LEGAEHFSEQ LHCYAMSAAP VIDYYGNIHG YIVCIIETTA ELVKLKTSYS CATEIANYIY IENEQKLINK VLCQHNAVIE CMDDGFICWN SHSLITMVNS QAQTLLNIDK ESLIGQNIRK GFVFPPILNE AITQRNKLSQ KQIVLECRGE FIELMVTLRP LSDGSFLLFL HPLDKIRKIA QQQISTNANF TFDSLHAASG GMKQVLLIAR RAIKSISPIL INGEEGVGKL SLAMAIHNES EQRDGPFISV DCQMLSPENI LHELLGSDVG PSPSKFELAH NGTLYLDKVE YLSGEVQSVL LKVLKTGLVT RSDSHRLIPV RFRLITCTSS SLREYVQQGA FSRQLYYEIS MNEIEIPPLR KRREDLKQMI DDIIDKYQER TRKKMTITPD ANSVLLEYRW PGNISEFKNR MEKVFINCNR LVLGLENIPL DIRQNNSSGD DDIPHLTSLA ELEMQAIEHT CRVCEWNLTK AAEVLKIGRT TLWRKLKIYN LYPNVEHAD
|
| |