Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_4439 |
Symbol | rhaR |
ID | 5589229 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 4426229 |
End bp | 4427167 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640928053 |
Product | transcriptional activator RhaR |
Protein accession | YP_001465397 |
Protein GI | 157156879 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTTTCT GCAATAACGC GAATCTTCTC AACGTATTTG TACGCCATAT TGCGAATAAT CAACTTCGTT CTCTGGCCGA GGTAGCCACG GTGGCGCATC AGTTAAAACT TCTCAAAGAT GATTTTTTTG CCAGCGACCA GCAGGCAGTC GCTGTGGCTG ACCGTTATCC GCAAGATGTT TTTGCCGAAC ATACACATGA TTTTTGTGAG CTGGTGATTG TCTGGCGCGG TAATGGTCTG CATGTACTCA ACGATCGCCC TTATCGCATT ACCCGTGGCG ATCTCTTTTA CATTCATGCT GACGATAAAC ACTCCTACGC TTCCGTTAAC GATCTGGTTT TGCAGAATAT TATTTATTGC CCGGAGCGGC TGAAGCTGAA TCTTGACTGG CAGGGAGCGA TTCCGGGATT TAGCGCCAGC GCAGGGCAAC CACACTGGCG CTTAGGTAGC GTGGGGATGG CGCAGGCGCG GCAGGTTATC GGTCAGCTTG AGCATGAAAG TAGTCAGCAT GTGTCGTTTG CTAACGAAAT GGCTGAGTTG CTGTTCGGGC AGTTGGTGAT GTTGCTGAAT CGCCATCGTT ACACCAGTGA TTCGTTGCCG CCAACATCCA GCGAAACGTT GCTGGATAAG CTGATTACCC GGCTGGCGGC TAGCCTGAAA AGTCCCTTTG CGCTGGATAA ATTTTGTGAT GAGGCATCGT GCAGTGAGCG CGTTTTGCGT CAGCAATTTC GCCAGCAGAC TGGAATGACC ATCAATCAAT ATCTGCGGCA GGTCAGAGTG TGTCATGCGC AATATCTTCT CCAGCATAGC CGCCTGTTAA TCAGTGATAT TTCGACCGAA TGTGGCTTTG AAGATAGTAA CTATTTTTCG GTGGTGTTTA CCCGGGAAAC CGGGATGACG CCCAGCCAGT GGCGTCATCT CAATTCGCAG AAAGATTAA
|
Protein sequence | MVFCNNANLL NVFVRHIANN QLRSLAEVAT VAHQLKLLKD DFFASDQQAV AVADRYPQDV FAEHTHDFCE LVIVWRGNGL HVLNDRPYRI TRGDLFYIHA DDKHSYASVN DLVLQNIIYC PERLKLNLDW QGAIPGFSAS AGQPHWRLGS VGMAQARQVI GQLEHESSQH VSFANEMAEL LFGQLVMLLN RHRYTSDSLP PTSSETLLDK LITRLAASLK SPFALDKFCD EASCSERVLR QQFRQQTGMT INQYLRQVRV CHAQYLLQHS RLLISDISTE CGFEDSNYFS VVFTRETGMT PSQWRHLNSQ KD
|
| |