Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1358 |
Symbol | |
ID | 6146132 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1347517 |
End bp | 1348398 |
Gene Length | 882 bp |
Protein Length | 293 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641616236 |
Product | heat shock protein HtpX |
Protein accession | YP_001743416 |
Protein GI | 170682375 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0501] Zn-dependent protease with chaperone function |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00135277 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.0441345 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCGAA TCGCGCTCTT CCTGCTAACG AACCTGGCCG TAATGGTCGT TTTCGGGCTG GTACTGAGCC TGACAGGGAT ACAGTCGAGC AGCGTTCAGG GGCTGATGAT CATGGCCTTG CTGTTCGGTT TTGGTGGTTC CTTCGTTTCG CTTCTGATGT CCAAATGGAT GGCATTACGA TCTGTTGGCG GGGAAGTGAT CGAGCAACCG CGTAACGAAA GGGAACGTTG GCTGGTTAAT ACTGTAGCAA CCCAGGCTCG TCAGGCGGGG ATCGCTATGC CGCAAGTGGC TATCTACCAT GCGCCGGACA TCAATGCTTT TGCAACCGGT GCGCGCCGTG ATGCTTCTCT GGTTGCTGTC AGCACTGGTT TGCTGCAGAA CATGAGCCCG GATGAAGCCG AGGCGGTAAT TGCTCACGAA ATCAGCCACA TCGCCAATGG TGATATGGTC ACCATGACGC TGATTCAGGG CGTGGTGAAC ACCTTCGTTA TCTTTATTTC CCGTATTCTG GCGCAACTTG CCGCGGGTTT TATGGGCGGA AATCGTGATG AAGGTGAAGA GAGCAACGGC AACCCGCTGA TCTACTTTGC GGTTGCAACG GTTCTGGAAC TGGTGTTTGG TATTCTGGCG AGCATTATCA CCATGTGGTT CTCGCGTCAT CGTGAATTCC ACGCCGATGC GGGTTCAGCA AAACTGGTTG GTCGCGAGAA AATGATTGCT GCATTGCAAC GCCTGAAAAC CAGCTATGAA CCGCAAGAAG CAACCAGCAT GATGGCTTTC TGCATTAACG GGAAGTCGAA ATCGCTCAGT GAGTTGTTCA TGACTCACCC GCCACTGGAT AAACGTATTG AAGCTCTGCG TACGGGTGAA TACCTGAAGT AA
|
Protein sequence | MMRIALFLLT NLAVMVVFGL VLSLTGIQSS SVQGLMIMAL LFGFGGSFVS LLMSKWMALR SVGGEVIEQP RNERERWLVN TVATQARQAG IAMPQVAIYH APDINAFATG ARRDASLVAV STGLLQNMSP DEAEAVIAHE ISHIANGDMV TMTLIQGVVN TFVIFISRIL AQLAAGFMGG NRDEGEESNG NPLIYFAVAT VLELVFGILA SIITMWFSRH REFHADAGSA KLVGREKMIA ALQRLKTSYE PQEATSMMAF CINGKSKSLS ELFMTHPPLD KRIEALRTGE YLK
|
| |