Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4136 |
Symbol | rhaR |
ID | 5591247 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 4126565 |
End bp | 4127503 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640923238 |
Product | transcriptional activator RhaR |
Protein accession | YP_001460697 |
Protein GI | 157163379 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 61 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTTCT GCAATAACGC GAATCTTCTC AACGTATTTG TACGCCATAT TGCGAATAAT CAACTTCGTT CTCTGGCCGA GGTAGCCACG GTGGCGCATC AGTTAAAACT TCTCAAAGAT GATTTTTTTG CCAGCGACCA GCAGGCAGTC GCTGTGGCTG ACCGTTATCC GCAAGATGTC TTTGCTGAAC ATACACATGA TTTTTGTGAG CTGGTGATTG TCTGGCGCGG TAATGGCCTG CATGTACTCA ACGATCGCCC TTATCGCATT ACCCGTGGCG ATCTCTTTTA CATTCATGCT GACGATAAAC ACTCCTACGC TTCCGTTAAC GATCTGGTTT TGCAGAATAT TATTTATTGC CCGGAGCGTC TGAAGCTGAA TCTTGACTGG CAGGGGGCGA TTCCGGGATT TAACGCCAGC GCAGGGCAAC CACACTGGCG CTTAGGTAGC ATGGGGATGG CGCAGGCGCG GCAGGTTATC GGTCAGCTTG AGCATGAAAG TAGTCAGCAT GTGCCGTTTG CTAACGAAAT GGCTGAGTTG CTGTTCGGGC AGTTGGTGAT GTTGCTGAAT CGCCATCGTT ACACCAGTGA TTCGTTGCCG CCAACATCCA GCGAAACGTT GCTGGATAAG CTGATTACCC GGCTGGCGGC TAGCCTGAAA AGTCCCTTTG CGCTGGATAA ATTTTGTGAT GAGGCATCGT GCAGTGAGCG CGTTTTGCGT CAGCAATTTC GCCAGCAGAC TGGAATGACC ATCAATCAAT ATCTGCGACA GGTCAGAGTG TGTCATGCGC AATATCTTCT CCAGCATAGC CGCCTGTTAA TCAGTGATAT TTCGACCGAA TGTGGCTTTG AAGATAGTAA CTATTTTTCG GTGGTGTTTA CCCGGGAAAC CGGGATGACG CCCAGCCAGT GGCGTCATCT CAATTCGCAG AAAGATTAA
|
Protein sequence | MAFCNNANLL NVFVRHIANN QLRSLAEVAT VAHQLKLLKD DFFASDQQAV AVADRYPQDV FAEHTHDFCE LVIVWRGNGL HVLNDRPYRI TRGDLFYIHA DDKHSYASVN DLVLQNIIYC PERLKLNLDW QGAIPGFNAS AGQPHWRLGS MGMAQARQVI GQLEHESSQH VPFANEMAEL LFGQLVMLLN RHRYTSDSLP PTSSETLLDK LITRLAASLK SPFALDKFCD EASCSERVLR QQFRQQTGMT INQYLRQVRV CHAQYLLQHS RLLISDISTE CGFEDSNYFS VVFTRETGMT PSQWRHLNSQ KD
|
| |