Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3685 |
Symbol | envZ |
ID | 6145130 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3742121 |
End bp | 3743473 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641618512 |
Product | osmolarity sensor protein |
Protein accession | YP_001745652 |
Protein GI | 170681185 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.596737 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGCGAT TGCGCTTCTC GCCACGAAGT TCATTTGCCC GCACGTTATT GCTCATCGTC ACCTTGCTGT TCGTCAGCCT GGTGACGACT TATCTGGTGG TGCTGAACTT CGCGATTTTG CCGAGCCTCC AGCAGTTTAA TAAAGTCCTC GCGTACGAAG TGCGTATGTT GATGACCGAC AAACTGCAAC TGGAGGACGG CACGCAGTTG GTTGTGCCTC CCGCTTTCCG TCGGGAGATC TACCGTGAGC TGGGGATCTC TCTCTATTCC AACGAAGCTG CCGAAGAGGC AGGTCTGCGT TGGGCACAAC ACTATGAATT CTTAAGCCAT CAGATGGCGC AGCAACTGGG CGGCCCGACG GAAGTGCGCG TCGAGGTCAA CAAAAGTTCG CCTGTCGTCT GGCTGAAAAC CTGGTTGTCG CCCAATATCT GGGTACGCGT GCCGCTAACC GAAATTCATC AGGGCGATTT CTCCCCGCTG TTCCGCTATA CGTTGGCGAT TATGCTATTG GCGATAGGCG GGGCGTGGCT GTTTATTCGT ATCCAGAACC GACCGTTGGT CGATCTCGAA CACGCAGCCT TGCAGGTGGG TAAAGGGATT ATTCCGCCGC CGCTGCGTGA ATATGGCGCT TCGGAGGTGC GTTCCGTTAC CCGCGCCTTT AACCATATGG CGGCTGGTGT TAAGCAACTG GCGGATGACC GTACGCTGCT GATGGCGGGG GTAAGCCATG ATCTGCGCAC GCCGCTGACG CGTATTCGTC TGGCGACCGA GATGATGAGC GAGCAGGATG GCTACCTGGC AGAATCGATC AATAAAGATA TCGAAGAGTG CAACGCCATC ATTGAGCAGT TTATCGACTA CCTGCGCACC GGGCAGGAGA TGCCGATGGA AATGGCGGAT CTCAACGCCG TGCTCGGTGA GGTGATTGCC GCCGAAAGTG GCTATGAGCG GGAAATTGAA ACCGCGCTTT ACCCCGGCAG CATTGAGGTG AAAATGCACC CGCTGTCGAT CAAACGCGCG GTGGCGAATA TGGTGGTCAA CGCCGCCCGT TACGGCAATG GCTGGATCAA AGTCAGCAGC GGCACGGAGC CGAATCGCGC CTGGTTCCAG GTAGAAGATG ACGGTCCGGG TATTGCGCCG GAACAACGTA AGCACCTGTT CCAGCCGTTT GTTCGTGGCG ACAGTGCGCG CACTATTAGC GGCACGGGAT TAGGGCTGGC GATTGTGCAG CGTATCGTGG ATAACCATAA CGGCATGTTG GAGCTTGGCA CCAGCGAGCG GGGCGGGCTT TCCATTCGTG CCTGGCTGCC AGTGCCGGTA ACGCGGGCGC AGGGCATGAC AAAAGAAGGG TAA
|
Protein sequence | MRRLRFSPRS SFARTLLLIV TLLFVSLVTT YLVVLNFAIL PSLQQFNKVL AYEVRMLMTD KLQLEDGTQL VVPPAFRREI YRELGISLYS NEAAEEAGLR WAQHYEFLSH QMAQQLGGPT EVRVEVNKSS PVVWLKTWLS PNIWVRVPLT EIHQGDFSPL FRYTLAIMLL AIGGAWLFIR IQNRPLVDLE HAALQVGKGI IPPPLREYGA SEVRSVTRAF NHMAAGVKQL ADDRTLLMAG VSHDLRTPLT RIRLATEMMS EQDGYLAESI NKDIEECNAI IEQFIDYLRT GQEMPMEMAD LNAVLGEVIA AESGYEREIE TALYPGSIEV KMHPLSIKRA VANMVVNAAR YGNGWIKVSS GTEPNRAWFQ VEDDGPGIAP EQRKHLFQPF VRGDSARTIS GTGLGLAIVQ RIVDNHNGML ELGTSERGGL SIRAWLPVPV TRAQGMTKEG
|
| |