Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1590 |
Symbol | rstB |
ID | 6144669 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1578306 |
End bp | 1579607 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641616467 |
Product | sensor protein RstB |
Protein accession | YP_001743645 |
Protein GI | 170682743 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00520331 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 66 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAC TGTTTATCCA GTTTTACCTG TTATTGTTTG TCTGCTTCCT TGTGATGTCT CTGCTGGTCG GGCTGGTGTA CAAATTTACC GCCGAACGCG CAGGCAAACA GTCGCTGGAT GATTTGATGA ACAGTTCGCT GTATCTGATG CGCAGCGAAT TGCGTGAGAT CCCCCCACAC GACTGGGGTA AGACGCTGAA AGAGATGGAT TTAAATCTCT CTTTCGATCT GCGTGTCGAG CCGCTGAGTA AATACCATCT TGATGATATT TCCATGCACC GGCTGCGTGG CGGCGAAATT GTCGCTCTGG ACGATCAGTA CACCTTTTTG CAACGTATCC CGCGCAGCCA CTACGTGCTG GCAGTTGGTC CTGTTCCTTA TCTTTATTAC CTCCATCAGA TGCGATTACT GGATATCGCC CTGATCGCTT TTATTGCTAT TTCCCTCGCC TTTCCGGTGT TTATCTGGAT GCGTCCGCAC TGGCAGGATA TGTTAAAACT GGAAGCAGCG GCGCAACGAT TTGGCGATGG GCATCTCAAT GAACGTATCC ACTTTGATGA GGGTTCGAGC TTTGAACGAC TTGGTGTTGC ATTTAACCAG ATGGCGGACA ATATCAACGC CTTGATTGCC AGCAAAAAAC AGCTTATTGA CGGTATCGCT CACGAACTGC GAACACCGTT AGTGCGCCTG CGTTATCGAC TGGAGATGAG CGATAACCTG AGCGCCGCCG AATCCCAGGC GTTGAACCGC GATATCAGTC AACTTGAAGC TTTAATTGAA GAGCTTCTGA CTTATGCCCG ACTCGATCGC CCACAAAATG AGCTTCATCT TAGCGAACCA GACCTGCCGT TGTGGCTGTC AACGCATCTG GCAGATATTC AGGCAGTAAC GCCCGATAAA ACGGTACGGA TAAAAACGCT CATGCAAGGC CATTATGCGG CGTTGGATAT GCGCTTAATG GAGCGCGTGC TGGATAATTT GCTCAATAAC GCCCTGCGCT ACTGTCATTC AACGGTTGAA ACCAGCCTGC TACTGTCGGG GAATAGAGCG ACATTAATTG TTGAGGATGA TGGCCCAGGG ATTGCCCCAG AAAACCGCGA ACATATCTTT GAACCTTTTG TTCGCCTCGA TCCCAGCCGG GATCGCTCAA CCGGCGGCTG CGGGCTGGGG CTGGCAATTG TCCACTCTAT AGCACTGGCA ATGGGCGGTA CGGTTAATTG TGACACCAGC GAACTGGGTG GTGCCCGCTT CTCGTTTAGC TGGCCGTTAT GGCATAACAT CCCGCAATTT ACCTCTGCCT GA
|
Protein sequence | MKKLFIQFYL LLFVCFLVMS LLVGLVYKFT AERAGKQSLD DLMNSSLYLM RSELREIPPH DWGKTLKEMD LNLSFDLRVE PLSKYHLDDI SMHRLRGGEI VALDDQYTFL QRIPRSHYVL AVGPVPYLYY LHQMRLLDIA LIAFIAISLA FPVFIWMRPH WQDMLKLEAA AQRFGDGHLN ERIHFDEGSS FERLGVAFNQ MADNINALIA SKKQLIDGIA HELRTPLVRL RYRLEMSDNL SAAESQALNR DISQLEALIE ELLTYARLDR PQNELHLSEP DLPLWLSTHL ADIQAVTPDK TVRIKTLMQG HYAALDMRLM ERVLDNLLNN ALRYCHSTVE TSLLLSGNRA TLIVEDDGPG IAPENREHIF EPFVRLDPSR DRSTGGCGLG LAIVHSIALA MGGTVNCDTS ELGGARFSFS WPLWHNIPQF TSA
|
| |