Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5360 |
Symbol | rhaR |
ID | 6968742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 5001629 |
End bp | 5002567 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643389015 |
Product | transcriptional activator RhaR |
Protein accession | YP_002273424 |
Protein GI | 209396565 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.73393 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 67 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTTCT GCAATAACGC GAATCTTCTC AACGTATTTG TACGCCATAT TGCGAATAAT CAATTTCGTT CTCTGGCCGA GGTAGCCACG GTGGCGCATC AGTTAAAACT TCTCAAAGAT GATTTTTTTG CCAGCGACCA GCAGGCAGTC GCTGTGGCTG ACCGTTATCC GCAAGATGTC TTTGCTGAAC ATACACATGA TTTTTGTGAG CTGGTGATTG TCTGGCGCGG TAATGGTCTG CATGTACTCA ACGATCGCCC TTATCGCATT ACCCGTGGCG ATCTCTTTTA CATTCATGCT GACGATAAAC ACTCCTACGC TTCCGTTAAC GATCTGGTTT TGCAGAATAT TATTTATTGC CCGGAGCGGC TGAAGCTGAA TCTTGACTGG CAGGGGGCGA TTCCGGGATT TAGCGCCAGC GCAGGGCAAC CACACTGGCG CTTAGGTAGC ATGGGGATGG CACAGGCGCG GCAGGTTATC GGTCAGCTTG AGCATGAAAG TAGTCAGCAT GTGCCGTTTG CTAACGAAAT GGCAGAGTTG CTGTTCGGGC AGTTGGTGAT GTTGCTGAAT CGCCATCGTT ACACCAGTGA TTCGTTGCCG CCAACATCCA GCGAAACGTT GCTGGATAAG CTGATTACCC GGTTGGCGGC TAGCCTGAAA AGTCCCTTTG CGCTGGATAA ATTTTGTGAT GAGGCATCGT GCAGTGAGCG CGTTTTGCGT CAGCAATTTC GCCAGCAGAC TGGAATGACC ATCAATCAAT ATCTGCGGCA GGTCAGAGTG TGTCATGCGC AATATCTTCT CCAGCATAGC CGCCTGTTAA TCAGTGATAT TTCGACCGAA TGTGGCTTTG AAGATAGTAA CTATTTTTCG GTGGTGTTTA CCCGGGAAAC CGGGATGACG CCCAGCCAGT GGCGTCATCT CAATTCGCAG AAAGATTAA
|
Protein sequence | MAFCNNANLL NVFVRHIANN QFRSLAEVAT VAHQLKLLKD DFFASDQQAV AVADRYPQDV FAEHTHDFCE LVIVWRGNGL HVLNDRPYRI TRGDLFYIHA DDKHSYASVN DLVLQNIIYC PERLKLNLDW QGAIPGFSAS AGQPHWRLGS MGMAQARQVI GQLEHESSQH VPFANEMAEL LFGQLVMLLN RHRYTSDSLP PTSSETLLDK LITRLAASLK SPFALDKFCD EASCSERVLR QQFRQQTGMT INQYLRQVRV CHAQYLLQHS RLLISDISTE CGFEDSNYFS VVFTRETGMT PSQWRHLNSQ KD
|
| |