Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4299 |
Symbol | rhaR |
ID | 6143463 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4400984 |
End bp | 4401922 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641619119 |
Product | transcriptional activator RhaR |
Protein accession | YP_001746243 |
Protein GI | 170680171 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.0471375 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTTCT GCAATAACGC GAATCTTCTC AACGTATTTG TACGCCATAT TGCGAATAAT CAACTTCGTT CTCTGACCGA GGTAGCCACG GTGGCGCATC AGTTAAAACT TCTCAAAGAT GATTTTTTTG CCAGCGACCA GCAGGCAGTC GCTGTGGCTG ACCGTTATCC GCAAGATGTC TTTGCTGAAC ATACACATGA TTTTTGTGAG CTGGTAATTG TCTGGCGCGG TAATGGTCTG CATGTACTCA ACGATCGCCC TTATCGCATT ACCCGTGGCG ATCTCTTTTA CATTCATGCT GACGATAAAC ACTCCTACGC TTCCGTTAAC GATCTGGTTT TGCAGAATAT TATTTATTGC CCGGAGCGGC TGAAGCTGAA TCTTGACTGG CAGGGGGCGA TTCCGGGATT TAGCGCCAGC GCAGGGCAAC CACACTGGCG CTTAGGTAGC ATGGGGATGG CACAGGCGCG GCAGATTATC GGTCAGCTTG AGCATGAAAG TAGTCAGCAT GTGCCGTTTG CTAACGAAAT GGCAGAGTTG CTGTTCGGGC AGTTGGTGAT GTCGCTGAAT CGCCATCGTT ACACCAGCGA TTCGTTGCCG CCAACATCCA GCGAAACGTT GCTGGATAAG CTGATTACCC GGCTGGCGGC CAGCCTGAAA AGTCCCTTTG CGCTGGATGA GTTTTGTGAT GAGGCATCGT GCAGTGAGCG CGTTTTGCGT CAGCAATTTC GCCAGCAGAC TGGAATGACC ATCAATCAAT ATCTGCGGCA GGTCAGAGTG TGTCATGCGC AATATCTTCT CCAGCATAGC CGCCTGTTAA TCAGTGATAT TTCAACCGAA TGTGGCTTTG AAGATAGTAA CTATTTTTCG GTGGTGTTTA CCCGGGAAAC CGGGATGACG CCCAGCCAGT GGCGTCATCT CAATTCGCAG AAAGATTAA
|
Protein sequence | MAFCNNANLL NVFVRHIANN QLRSLTEVAT VAHQLKLLKD DFFASDQQAV AVADRYPQDV FAEHTHDFCE LVIVWRGNGL HVLNDRPYRI TRGDLFYIHA DDKHSYASVN DLVLQNIIYC PERLKLNLDW QGAIPGFSAS AGQPHWRLGS MGMAQARQII GQLEHESSQH VPFANEMAEL LFGQLVMSLN RHRYTSDSLP PTSSETLLDK LITRLAASLK SPFALDEFCD EASCSERVLR QQFRQQTGMT INQYLRQVRV CHAQYLLQHS RLLISDISTE CGFEDSNYFS VVFTRETGMT PSQWRHLNSQ KD
|
| |