Gene ECH74115_0060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0060 
Symbolimp 
ID6969484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp59726 
End bp62047 
Gene Length2322 bp 
Protein Length773 aa 
Translation table11 
GC content51% 
IMG OID643384141 
Productorganic solvent tolerance protein 
Protein accessionYP_002268664 
Protein GI209398538 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1452] Organic solvent tolerance protein OstA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00055163 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGCCA CCGCCCTTTA TAGTCAACAG GGACTGGCAG CCGACCTCGC CTCACAGTGC 
ATGTTGGGCG TGCCAAGCTA TGACCGTCCT CTGGTACAGG GCGATACCAA TGACTTACCC
GTGACTATCA ATGCTAACCA CGCGAAAGGG GACTACCCGG ATGACGCCGT GTTTACTGGC
AGCGTGGATA TCATGCAGGG TAACAGCCGT CTGCAGGCCG ACGAAGTGCA GCTCCATCAA
AAAGAGGCAC CAGGACAACC GGAGCCGGTA CGTACCGTTG ATGCGCTCGG TAATGTCCAT
TACGACGATA ACCAGGTGAT CCTCAAAGGG CCGAAAGGCT GGGCGAATCT GAACACCAAA
GATACCAACG TCTGGGAAGG TGATTACCAG ATGGTGGGTC GCCAGGGTCG CGGTAAAGCG
GACCTGATGA AACAACGTGG CGAAAACCGC TATACCATTC TGGATAACGG TAGCTTTACC
TCCTGTCTGC CGGGTTCTGA CACCTGGAGC GTGGTAGGTA GCGAAATTAT TCATGACCGC
GAAGAACAAG TTGCGGAGAT CTGGAACGCC CGCTTTAAGG TGGGTCCGGT ATCGATCTTT
TATAGCCCCT ATTTGCAGTT GCCGGTGGGT GACAAACGTC GCTCTGGTTT CTTGATCCCG
AACGCCAAGT ACACCACCAC CAACTACTTT GAGTTCTACC TGCCATATTA CTGGAACATC
GCGCCAAATA TGGATGCCAC CATCACGCCG CATTATATGC ATCGTCGTGG CAACATCATG
TGGGAGAACG AATTCCGCTA CCTCTCCCAG GCGGGCGCTG GCTTGATGGA ACTGGACTAT
CTGCCTTCAG ATAAAGTCTA TGAAGATGAA CACCCGAACG ATGACAGTTC ACGTCGTTGG
TTATTCTACT GGAACCACTC CGGGGTCATG GATCAGGTGT GGCGTTTCAA CGTCGACTAC
ACCAAGGTCA GCGATCCTAG CTACTTCAAT GATTTCGATA ACAAGTACGG TTCCAGTACT
GACGGCTACG CAACGCAAAA ATTCAGCGTT GGCTATGCGG TGCAAAACTT CAATGCCACC
GTTTCAACCA AGCAGTTCCA GGTTTTCAGC GAACAGAACA CCAGTAGCTA CTCGGCAGAG
CCGCAGTTAG ACGTTAATTA CTACCAGAAT GATGTTGGTC CGTTTGATAC GCGTATTTAC
GGCCAGGCAG TGCACTTTGT TAACACCAGA GACGACATGC CTGAAGCAAC CCGTGTTCAC
CTGGAACCGA CCATCAATTT GCCGCTCTCT AATAACTGGG GCAGCATCAA TACCGAAGCG
AAGTTGCTGG CAACCCATTA TCAGCAAACC AATCTTGACT GGTATAACTC CAGAAACACG
ACCAAGCTGG ACGAATCCGT TAACCGCGTA ATGCCGCAAT TCAAAGTTGA CGGCAAAATG
GTCTTTGAAC GCGATATGGA AATGCTGGCT CCGGGTTATA CCCAAACGCT GGAACCGCGC
GCGCAGTATT TGTACGTGCC GTATCGCGAT CAGAGCGACA TCTATAACTA CGACTCGTCT
CTGCTGCAAT CTGACTACTC TGGCCTGTTC CGGGACCGGA CTTACGGCGG TCTTGACCGT
ATTGCCTCCG CTAACCAGGT GACGACCGGT GTCACATCTC GCATATATGA TGATGCTGCC
GTTGAACGTT TTAATATTTC CGTTGGTCAA ATCTACTATT TCACGGAGTC TCGCACTGGC
GATGACAACA TAACATGGGA GAATGACGAC AAAACGGGTT CACTGGTGTG GGCAGGCGAT
ACTTACTGGC GTATCTCCGA GCGTTGGGGA TTGCGTGGCG GGATTCAGTA CGATACACGT
CTGGATAACG TAGCGACCAG TAACTCCAGC ATTGAATACC GTCGGGATGA AGACCGTCTG
GTACAGCTGA ATTACCGTTA CGCCAGCCCG GAATATATTC AGGCTACGCT GCCTAAGTAC
TATTCCACTG CTGAGCAATA TAAGAATGGT ATTTCGCAGG TAGGTGCTGT CGCCAGCTGG
CCAATTGCCG ATCGTTGGTC CATTGTTGGG GCCTACTACT ACGACACCAA TGCGAACAAG
CAAGCCGACT CTATGTTAGG TGTGCAATAC AGCTCCTGCT GCTATGCAAT TCGCGTCGGT
TACGAGCGGA AGCTGAACGG TTGGGATAAC GATAAACAAC ATGCGGTATA TGACAACGCA
ATCGGCTTTA ACATCGAACT TCGCGGCCTG AGCTCCAACT ACGGTCTGGG TACGCAAGAG
ATGCTGCGTT CGAACATTCT GCCGTATCAA AACACTTTGT GA
 
Protein sequence
MIATALYSQQ GLAADLASQC MLGVPSYDRP LVQGDTNDLP VTINANHAKG DYPDDAVFTG 
SVDIMQGNSR LQADEVQLHQ KEAPGQPEPV RTVDALGNVH YDDNQVILKG PKGWANLNTK
DTNVWEGDYQ MVGRQGRGKA DLMKQRGENR YTILDNGSFT SCLPGSDTWS VVGSEIIHDR
EEQVAEIWNA RFKVGPVSIF YSPYLQLPVG DKRRSGFLIP NAKYTTTNYF EFYLPYYWNI
APNMDATITP HYMHRRGNIM WENEFRYLSQ AGAGLMELDY LPSDKVYEDE HPNDDSSRRW
LFYWNHSGVM DQVWRFNVDY TKVSDPSYFN DFDNKYGSST DGYATQKFSV GYAVQNFNAT
VSTKQFQVFS EQNTSSYSAE PQLDVNYYQN DVGPFDTRIY GQAVHFVNTR DDMPEATRVH
LEPTINLPLS NNWGSINTEA KLLATHYQQT NLDWYNSRNT TKLDESVNRV MPQFKVDGKM
VFERDMEMLA PGYTQTLEPR AQYLYVPYRD QSDIYNYDSS LLQSDYSGLF RDRTYGGLDR
IASANQVTTG VTSRIYDDAA VERFNISVGQ IYYFTESRTG DDNITWENDD KTGSLVWAGD
TYWRISERWG LRGGIQYDTR LDNVATSNSS IEYRRDEDRL VQLNYRYASP EYIQATLPKY
YSTAEQYKNG ISQVGAVASW PIADRWSIVG AYYYDTNANK QADSMLGVQY SSCCYAIRVG
YERKLNGWDN DKQHAVYDNA IGFNIELRGL SSNYGLGTQE MLRSNILPYQ NTL