Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0060 |
Symbol | imp |
ID | 6969484 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 59726 |
End bp | 62047 |
Gene Length | 2322 bp |
Protein Length | 773 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643384141 |
Product | organic solvent tolerance protein |
Protein accession | YP_002268664 |
Protein GI | 209398538 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1452] Organic solvent tolerance protein OstA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00055163 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGCCA CCGCCCTTTA TAGTCAACAG GGACTGGCAG CCGACCTCGC CTCACAGTGC ATGTTGGGCG TGCCAAGCTA TGACCGTCCT CTGGTACAGG GCGATACCAA TGACTTACCC GTGACTATCA ATGCTAACCA CGCGAAAGGG GACTACCCGG ATGACGCCGT GTTTACTGGC AGCGTGGATA TCATGCAGGG TAACAGCCGT CTGCAGGCCG ACGAAGTGCA GCTCCATCAA AAAGAGGCAC CAGGACAACC GGAGCCGGTA CGTACCGTTG ATGCGCTCGG TAATGTCCAT TACGACGATA ACCAGGTGAT CCTCAAAGGG CCGAAAGGCT GGGCGAATCT GAACACCAAA GATACCAACG TCTGGGAAGG TGATTACCAG ATGGTGGGTC GCCAGGGTCG CGGTAAAGCG GACCTGATGA AACAACGTGG CGAAAACCGC TATACCATTC TGGATAACGG TAGCTTTACC TCCTGTCTGC CGGGTTCTGA CACCTGGAGC GTGGTAGGTA GCGAAATTAT TCATGACCGC GAAGAACAAG TTGCGGAGAT CTGGAACGCC CGCTTTAAGG TGGGTCCGGT ATCGATCTTT TATAGCCCCT ATTTGCAGTT GCCGGTGGGT GACAAACGTC GCTCTGGTTT CTTGATCCCG AACGCCAAGT ACACCACCAC CAACTACTTT GAGTTCTACC TGCCATATTA CTGGAACATC GCGCCAAATA TGGATGCCAC CATCACGCCG CATTATATGC ATCGTCGTGG CAACATCATG TGGGAGAACG AATTCCGCTA CCTCTCCCAG GCGGGCGCTG GCTTGATGGA ACTGGACTAT CTGCCTTCAG ATAAAGTCTA TGAAGATGAA CACCCGAACG ATGACAGTTC ACGTCGTTGG TTATTCTACT GGAACCACTC CGGGGTCATG GATCAGGTGT GGCGTTTCAA CGTCGACTAC ACCAAGGTCA GCGATCCTAG CTACTTCAAT GATTTCGATA ACAAGTACGG TTCCAGTACT GACGGCTACG CAACGCAAAA ATTCAGCGTT GGCTATGCGG TGCAAAACTT CAATGCCACC GTTTCAACCA AGCAGTTCCA GGTTTTCAGC GAACAGAACA CCAGTAGCTA CTCGGCAGAG CCGCAGTTAG ACGTTAATTA CTACCAGAAT GATGTTGGTC CGTTTGATAC GCGTATTTAC GGCCAGGCAG TGCACTTTGT TAACACCAGA GACGACATGC CTGAAGCAAC CCGTGTTCAC CTGGAACCGA CCATCAATTT GCCGCTCTCT AATAACTGGG GCAGCATCAA TACCGAAGCG AAGTTGCTGG CAACCCATTA TCAGCAAACC AATCTTGACT GGTATAACTC CAGAAACACG ACCAAGCTGG ACGAATCCGT TAACCGCGTA ATGCCGCAAT TCAAAGTTGA CGGCAAAATG GTCTTTGAAC GCGATATGGA AATGCTGGCT CCGGGTTATA CCCAAACGCT GGAACCGCGC GCGCAGTATT TGTACGTGCC GTATCGCGAT CAGAGCGACA TCTATAACTA CGACTCGTCT CTGCTGCAAT CTGACTACTC TGGCCTGTTC CGGGACCGGA CTTACGGCGG TCTTGACCGT ATTGCCTCCG CTAACCAGGT GACGACCGGT GTCACATCTC GCATATATGA TGATGCTGCC GTTGAACGTT TTAATATTTC CGTTGGTCAA ATCTACTATT TCACGGAGTC TCGCACTGGC GATGACAACA TAACATGGGA GAATGACGAC AAAACGGGTT CACTGGTGTG GGCAGGCGAT ACTTACTGGC GTATCTCCGA GCGTTGGGGA TTGCGTGGCG GGATTCAGTA CGATACACGT CTGGATAACG TAGCGACCAG TAACTCCAGC ATTGAATACC GTCGGGATGA AGACCGTCTG GTACAGCTGA ATTACCGTTA CGCCAGCCCG GAATATATTC AGGCTACGCT GCCTAAGTAC TATTCCACTG CTGAGCAATA TAAGAATGGT ATTTCGCAGG TAGGTGCTGT CGCCAGCTGG CCAATTGCCG ATCGTTGGTC CATTGTTGGG GCCTACTACT ACGACACCAA TGCGAACAAG CAAGCCGACT CTATGTTAGG TGTGCAATAC AGCTCCTGCT GCTATGCAAT TCGCGTCGGT TACGAGCGGA AGCTGAACGG TTGGGATAAC GATAAACAAC ATGCGGTATA TGACAACGCA ATCGGCTTTA ACATCGAACT TCGCGGCCTG AGCTCCAACT ACGGTCTGGG TACGCAAGAG ATGCTGCGTT CGAACATTCT GCCGTATCAA AACACTTTGT GA
|
Protein sequence | MIATALYSQQ GLAADLASQC MLGVPSYDRP LVQGDTNDLP VTINANHAKG DYPDDAVFTG SVDIMQGNSR LQADEVQLHQ KEAPGQPEPV RTVDALGNVH YDDNQVILKG PKGWANLNTK DTNVWEGDYQ MVGRQGRGKA DLMKQRGENR YTILDNGSFT SCLPGSDTWS VVGSEIIHDR EEQVAEIWNA RFKVGPVSIF YSPYLQLPVG DKRRSGFLIP NAKYTTTNYF EFYLPYYWNI APNMDATITP HYMHRRGNIM WENEFRYLSQ AGAGLMELDY LPSDKVYEDE HPNDDSSRRW LFYWNHSGVM DQVWRFNVDY TKVSDPSYFN DFDNKYGSST DGYATQKFSV GYAVQNFNAT VSTKQFQVFS EQNTSSYSAE PQLDVNYYQN DVGPFDTRIY GQAVHFVNTR DDMPEATRVH LEPTINLPLS NNWGSINTEA KLLATHYQQT NLDWYNSRNT TKLDESVNRV MPQFKVDGKM VFERDMEMLA PGYTQTLEPR AQYLYVPYRD QSDIYNYDSS LLQSDYSGLF RDRTYGGLDR IASANQVTTG VTSRIYDDAA VERFNISVGQ IYYFTESRTG DDNITWENDD KTGSLVWAGD TYWRISERWG LRGGIQYDTR LDNVATSNSS IEYRRDEDRL VQLNYRYASP EYIQATLPKY YSTAEQYKNG ISQVGAVASW PIADRWSIVG AYYYDTNANK QADSMLGVQY SSCCYAIRVG YERKLNGWDN DKQHAVYDNA IGFNIELRGL SSNYGLGTQE MLRSNILPYQ NTL
|
| |