Gene EcHS_A1305 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1305 
SymboldhaR 
ID5593171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1299505 
End bp1301424 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content51% 
IMG OID640920462 
ProductDNA-binding transcriptional regulator DhaR 
Protein accessionYP_001458023 
Protein GI157160705 
COG category[K] Transcription
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3284] Transcriptional activator of acetoin/glycerol metabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGGCG CTTTTAACAA CGATGGTCGG GGCATATCTC CCTTAATTGC AACCTCCTGG 
GAGCGATGCA ATAAGCTGAT GAAACGGGAG ACATGGAACG TACCACATCA GGCCCAGGGC
GTGACATTTG CTTCTATTTA TCGGCGTAAG AAAGCGATGC TGACGCTCGG GCAGGCTGCG
CTGGAAGATG CCTGGGAATA TATGGCACCG CGAGAGTGTG CGCTGTTTAT CCTCGATGAA
ACCGCCTGCA TTCTCAGCCG TAATGGCGAT CCGCAAACCT TGCAGCAGCT AAGTGCACTG
GGATTCAATG ACGGCACGTA TTGCGCCGAG GGAATTATTG GTACTTGTGC GCTATCGTTA
GCGGCTATCT CTGGTCAGGC CGTGAAAACG ATGGCCGATC AACATTTCAA ACAGGCACTC
TGGAACTGGG CCTTTTGTGC AACGCCGTTG TTTGACAGCA AGGGCCGATT GACGGGAACA
ATAGCGCTGG CGTGTCCGGT TGAGCAAACT ACCGCAGCTG ATTTGCCGTT GACGTTGGCA
ATCGCCCGCG AGGTCGGAAA TTTACTGCTG ACGGACAGTT TGCTCGCTGA AACTAACCGT
CATTTAAATC AACTTAATGC CCTGTTAGAA AGTATGGATG ATGGCGTGAT TAGCTGGGAT
GAGCAGGGTA ATTTGCAATT TATTAATGCC CAGGCGGCGC GGGTCTTGCG CCTTGACGCG
ACGGCAAGTC AGGGACGGGC AATCACTGAA CTCTTAACGT TACCCGCCGT ATTGCAACAA
GCAATAAAAC AGGCACATCC GCTCAAACAC GTAGAAGCAA CCTTTGAAAG CCAGCACCAG
TTTATTGATG CGGTGATAAC CCTTAAACCG ATAATAGAAA CGCAGGGAAC CAGCTTTATT
TTGTTGCTCC ATCCTGTGGA ACAGATGCGG CAGTTAATGA CCAGTCAATT AGGAAAAGTC
AGCCATACCT TCGCTCATAT GCCACAGGAC GATCCGCAAA CCCGCCGCTT GATTCATTTT
GGTCGCCAGG CGGCGCGCAG TAGCTTTCCT GTCCTGCTTT GTGGAGAAGA GGGCGTGGGC
AAGGCACTGC TAAGTCAGGC AATTCATAAT GAAAGCGAGC GTGCTGCAGG TCCTTATATC
GCCGTCAATT GTGAGTTATA TGGTGATGCT GCGCTGGCGG AAGAATTTAT TGGTGGCGAT
CGCACGGACA ATGAAAATGG CCGTCTGAGT CGGCTGGAAC TGGCACACGG CGGCACGCTG
TTTCTTGAAA AGATTGAATA TCTGGCGGTG GAGTTACAGT CTGCTTTGCT TCAGGTTATC
AAGCAGGGGG TTATCACGCG ACTGGATGCG CGGCGTTTAA TACCAATTGA TGTCAAAGTG
ATTGCCACAA CGACCGCGGA CCTCGCAATG CTGGTGGAAC AAAATCGTTT TAGTCGCCAG
CTGTATTACG CGCTGCATGC ATTTGAAATT ACCATCCCGC CTCTGCGTAT GCGGCGTGGC
AGCATTCCGG CGCTGGTGAA TAACAAATTA CGCAGTCTTG AAAAACGCTT CTCTACGCGG
CTGAAAATTG ATGACGATGC CCTCGCTCGC CTGGTTTCTT GTGCATGGCC AGGCAACGAT
TTTGAACTTT ACAGCGTCAT CGAGAATCTT GCTCTGAGTA GTGATAACGG GCGCATTCGC
GTCAGTGATT TGCCGGAACA TCTGTTTACC GAGCAGGCGA CAGATGATGT CAGCGCCACC
CGCCTTTCCA CCAGTCTGTC ATTTGCGGAA GTTGAAAAAG AGGCAATTAT TAACGCAGCC
CAGGTCACAG GCGGTCGCAT TCAGGAAATG TCGGCTTTAC TTGGGATCGG CCGCACTACG
CTGTGGCGGA AAATGAAGCA ACATGGCATT GATGCAGGGC AGTTTAAGCG CCGGGTATGA
 
Protein sequence
MSGAFNNDGR GISPLIATSW ERCNKLMKRE TWNVPHQAQG VTFASIYRRK KAMLTLGQAA 
LEDAWEYMAP RECALFILDE TACILSRNGD PQTLQQLSAL GFNDGTYCAE GIIGTCALSL
AAISGQAVKT MADQHFKQAL WNWAFCATPL FDSKGRLTGT IALACPVEQT TAADLPLTLA
IAREVGNLLL TDSLLAETNR HLNQLNALLE SMDDGVISWD EQGNLQFINA QAARVLRLDA
TASQGRAITE LLTLPAVLQQ AIKQAHPLKH VEATFESQHQ FIDAVITLKP IIETQGTSFI
LLLHPVEQMR QLMTSQLGKV SHTFAHMPQD DPQTRRLIHF GRQAARSSFP VLLCGEEGVG
KALLSQAIHN ESERAAGPYI AVNCELYGDA ALAEEFIGGD RTDNENGRLS RLELAHGGTL
FLEKIEYLAV ELQSALLQVI KQGVITRLDA RRLIPIDVKV IATTTADLAM LVEQNRFSRQ
LYYALHAFEI TIPPLRMRRG SIPALVNNKL RSLEKRFSTR LKIDDDALAR LVSCAWPGND
FELYSVIENL ALSSDNGRIR VSDLPEHLFT EQATDDVSAT RLSTSLSFAE VEKEAIINAA
QVTGGRIQEM SALLGIGRTT LWRKMKQHGI DAGQFKRRV