Gene EcE24377A_1347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_1347 
SymboldhaR 
ID5585922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp1341748 
End bp1343667 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content51% 
IMG OID640925043 
ProductDNA-binding transcriptional regulator DhaR 
Protein accessionYP_001462452 
Protein GI157159109 
COG category[K] Transcription
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3284] Transcriptional activator of acetoin/glycerol metabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGGCG CTTTTAACAA CGATGGTCGG GGCATATCTC CCTTAATTGC AACCTCCTGG 
GAGCGATGCA ATAAGCTGAT GAAACGGGAG ACATGGAACG TACCACATCA GGCCCAGGGC
GTGACATTTG CTTCTATTTA TCGGCGTAAG AAAGCGATGC TGACGCTCGG GCAGGCTGCG
CTGGAAGATG CCTGGGAATA TATGGCACCG CGAGAGTGTG CGCTGTTTAT CCTCGATGAA
ACCGCCTGCA TTCTCAGCCG TAATGGCGAT CCGCAAACCT TGCAGCAGCT AAGTGCACTG
GGATTCAATG ACGGCACGTA TTGCGCCGAG GGAATTATTG GTACTTGTGC GCTATCGTTA
GCGGCTATCT CTGGTCAGGC CGTGAAAACG ATGGCCGATC AACATTTCAA ACAGGCACTC
TGGAACTGGG CCTTTTGTGC AACGCCGTTG TTTGACAGCA AGGGCCGATT GACGGGAACA
ATAGCGCTGG CGTGTCCGGT TGAGCAAACT ACCGCAGCTG ATTTGCCGTT GACGTTGGCA
ATCGCCCGCG AGGTCGGAAA TTTACTGCTG ACGGACAGTT TGCTCGCTGA AACTAACCGT
CATTTAAATC AACTTAATGC CCTGTTAGAA AGTATGGATG ATGGCGTGAT TAGCTGGGAC
GAGCAGGGTA ATTTGCAATT TATTAATGCC CAGGCGGCGC GGGTCTTGCG CCTTGACGCG
ACGGCAAGTC AGGGAAGGGC AATCACTGAA CTCTTAACGT TACCCGCCGT ATTGCAACAA
GCAATAAAAC AGGCACATCC GCTCAAACAC GTAGAAGCAA CCTTTGAAAG TCAGCATCAG
TTTATTGATG CGGTGATAAC CCTTAAACCG ATAATAGAAA CGCAGGGAAC CAGCTTTATT
TTGTTGCTCC ATCCTGTGGA ACAGATGCGG CAGTTGATGA CCAGTCAATT AGGAAAAGTC
AGCCATACCT TCGCTCATAT GCCACAGGAC GATCCGCAAA CCCGCCGCTT GATTCATTTT
GGTCGCCAGG CGGCGCGCAG TAGCTTTCCT GTCCTGCTTT GTGGAGAAGA GGGCGTGGGA
AAGGCACTGC TAAGTCAGGC AATTCATAAT GAAAGCGAGC GTGCTGCAGG TCCTTATATC
GCCGTCAATT GTGAGTTATA TGGTGATGCG GCGCTGGCGG AAGAATTTAT TGGTGGCGAT
CGCACGGACA ATGAAAATGG CCGTCTGAGT CGGCTGGAAC TGGCACACGG CGGCACGCTG
TTTCTTGAAA AGATTGAATA TCTGGCGGTG GAGTTACAGT CTGCTTTGCT TCAGGTTATC
AAGCAGGGTG TTATCACGCG ACTGGATGCG CGGCGTTTAA TACCAGTTGA TGTCAAAGTG
ATTGCCACAA CGACCGCGGA CCTCGCAATG CTGGTGGAAC AAAATCGTTT TAGTCGCCAG
CTGTATTACG CGTTGCATGC ATTTGAAATT ACCATCCCGC CTCTGCGTAT GCGGCGTGGC
AGCATTCCGG CGCTGGTGAA TAACAAATTA CGCAGTCTTG AAAAACGCTT CTCTACGCGG
CTGAAAATTG ATGACGATGC CCTCGCTCGC CTGGTTTCCT GTGCATGGCC TGGCAACGAT
TTTGAACTTT ACAGCGTCAT CGAGAATCTT GCTCTGAGTA GTGATAATGG GCGCATTCGC
GTCAGTGATT TGCCGGAACA TCTGTTTACC GAGCAGGCAA CAGATGATGT CAGCGCCACA
CGCCTTTCCA CCAGTCTGTC ATTTGCGGAA GTTGAAAAAG AGGCAATTAT TAACGCAGCC
CAGGTCACAG GCGGTCGCAT TCAGGAAATG TCGGCTTTAC TTGGGATCGG CCGCACTACG
CTGTGGCGGA AAATGAAGCA ACATGGCATT GATGCAGGGC AGTTTAAGCG CCGGGTATGA
 
Protein sequence
MSGAFNNDGR GISPLIATSW ERCNKLMKRE TWNVPHQAQG VTFASIYRRK KAMLTLGQAA 
LEDAWEYMAP RECALFILDE TACILSRNGD PQTLQQLSAL GFNDGTYCAE GIIGTCALSL
AAISGQAVKT MADQHFKQAL WNWAFCATPL FDSKGRLTGT IALACPVEQT TAADLPLTLA
IAREVGNLLL TDSLLAETNR HLNQLNALLE SMDDGVISWD EQGNLQFINA QAARVLRLDA
TASQGRAITE LLTLPAVLQQ AIKQAHPLKH VEATFESQHQ FIDAVITLKP IIETQGTSFI
LLLHPVEQMR QLMTSQLGKV SHTFAHMPQD DPQTRRLIHF GRQAARSSFP VLLCGEEGVG
KALLSQAIHN ESERAAGPYI AVNCELYGDA ALAEEFIGGD RTDNENGRLS RLELAHGGTL
FLEKIEYLAV ELQSALLQVI KQGVITRLDA RRLIPVDVKV IATTTADLAM LVEQNRFSRQ
LYYALHAFEI TIPPLRMRRG SIPALVNNKL RSLEKRFSTR LKIDDDALAR LVSCAWPGND
FELYSVIENL ALSSDNGRIR VSDLPEHLFT EQATDDVSAT RLSTSLSFAE VEKEAIINAA
QVTGGRIQEM SALLGIGRTT LWRKMKQHGI DAGQFKRRV