Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2034 |
Symbol | |
ID | 5593656 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2030237 |
End bp | 2031034 |
Gene Length | 798 bp |
Protein Length | 265 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640921178 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001458723 |
Protein GI | 157161405 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 51 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGGCAA AAGATAAGAC TAACCTAAAG ATTGAAGAAA TACGGATGCA TAAACATCAT GAGATTCACA GGGTTAAGCC TCTTATGCCA GCTTTATGTC GTATCCGTCA GGGAAAAAAA GTTATCAATT GGGAGACGCA TAGTCTAACT GTTGATAATA ATCAGATAAT ATTATTTCCC TGTGGATATG AATTTTATAT TGCGAATTAT CCAGAAGCAG GTCTTTATCT TGCAGAAATG CTTTATTATC CCATTGATCT AATTGAAAAG TTTCAAAAAT TTTATGCGAT AACTGATCAA ATTCGTAACA CGACAGGTTT CTGCTTACCT CAGAACCCCG AGTTAATATA TTGTTGGGAG CAACTAAAAA CATCTATTTC CCGAGGCTTC TCAACGCAAA TTCAGGAACA CTTAGCAATG GGCGTTCTAC TTTCATTAGG AGCACATCAT GTTAATTGTT TACTTTTATC AGATAGTAAA CAATCATTAA TAAGTCGTTG TTATAACCTA ATGCTATCCG AACCTGGAAC AAAATGGACA GCAAACAAGG TAGCGAGATA TCTCTACATT TCTGTTTCCA CATTGCATCG CCGTCTGGCA AGCGAGGGAG TAAGTTTTCA AAGTATATTG GACGACGTGA GGTTAAATAA TGCGTTGTCT GCTATACAAA CGACAGTAAA ACCCATCAGC GAGATTGCCA GGGAAAATGG TTACAAGTGT CCTTCTCGTT TTACTGAAAG GTTCCATAAT CGTTTTAAGA TAACACCAAG AGAGCTAAGA AAAGCGTCCA GAGAGTAA
|
Protein sequence | MLAKDKTNLK IEEIRMHKHH EIHRVKPLMP ALCRIRQGKK VINWETHSLT VDNNQIILFP CGYEFYIANY PEAGLYLAEM LYYPIDLIEK FQKFYAITDQ IRNTTGFCLP QNPELIYCWE QLKTSISRGF STQIQEHLAM GVLLSLGAHH VNCLLLSDSK QSLISRCYNL MLSEPGTKWT ANKVARYLYI SVSTLHRRLA SEGVSFQSIL DDVRLNNALS AIQTTVKPIS EIARENGYKC PSRFTERFHN RFKITPRELR KASRE
|
| |