Gene ECH74115_0286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0286 
SymbolphoE 
ID6971197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp298717 
End bp299772 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content45% 
IMG OID643384352 
Productouter membrane phosphoporin protein E 
Protein accessionYP_002268868 
Protein GI209399294 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3203] Outer membrane protein (porin) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.172379 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.530203 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA GCACTCTGGC ATTAGTGGTG ATGGGCATTG TGGCATCTGC ATCCGTACAG 
GCCGCAGAAA TATATAACAA AGACGGTAAT AAACTGGATG TCTATGGCAA AGTTAAAGCC
ATGCATTATA TGAGTGATAA CGACAGTAAA GATGGCGACC AGAGTTATAT CCGTTTTGGT
TTTAAAGGCG AAACACAAAT TAACGATCAA CTGACTGGTT ATGGTCGTTG GGAAGCGGAG
TTTGCCGGAA ATAAAGCGGA GAGTGATACT GCACAGCAAA AAACGCGTCT CGCTTTTGCC
GGATTGAAGT ATAAAGATTT GGGTTCTTTC GACTATGGCC GTAACCTGGG CGCGTTGTAT
GACGTGGAAG CCTGGACCGA TATGTTCCCG GAATTTGGTG GCGACTCCTC GGCGCAGACC
GACAACTTTA TGACCAAACG CGCCAGCGGT CTGGCGACGT ATCGGAACAC CGACTTCTTC
GGCGTTATCG ATGGCCTGAA CTTAACCCTG CAATATCAAG GGAAAAACGA AAACCGCGAC
GTTAAAAAGC AAAACGGCGA TGGCTTCGGC ACGTCATTGA CATATGACTT TGGCGGCAGC
GATTTCGCCA TTAGTGGGGC CTATACCAAC TCAGATCGCA CCAACGAGCA GAACCTGCAA
AGCCGTGGCA CAGGCAAGCG TGCAGAAGCT TGGGCTACAG GTCTGAAATA CGATGCCAAT
AATATTTATC TGGCAACTTT TTATTCTGAA ACACGCAAAA TGACGCCAAT AACTGGCGGC
TTTGCCAATA AGACACAGAA CTTTGAAGCG GTCGCTCAAT ACCAGTTTGA CTTTGGTCTG
CGTCCATCGC TGGGTTATGT CTTATCGAAA GGGAAAGATA TTGAAGGTAT CGGTGATGAA
GATCTGGTCA ATTATATCGA TGTCGGGGCT ACATATTATT TCAACAAAAA TATGTCAGCG
TTTGTTGATT ATAAAATCAA CCAACTGGAT AGCGATAACA AATTGAATAT TAATAATGAT
GATATTGTCG CGGTTGGCAT GACCTATCAG TTTTAA
 
Protein sequence
MKKSTLALVV MGIVASASVQ AAEIYNKDGN KLDVYGKVKA MHYMSDNDSK DGDQSYIRFG 
FKGETQINDQ LTGYGRWEAE FAGNKAESDT AQQKTRLAFA GLKYKDLGSF DYGRNLGALY
DVEAWTDMFP EFGGDSSAQT DNFMTKRASG LATYRNTDFF GVIDGLNLTL QYQGKNENRD
VKKQNGDGFG TSLTYDFGGS DFAISGAYTN SDRTNEQNLQ SRGTGKRAEA WATGLKYDAN
NIYLATFYSE TRKMTPITGG FANKTQNFEA VAQYQFDFGL RPSLGYVLSK GKDIEGIGDE
DLVNYIDVGA TYYFNKNMSA FVDYKINQLD SDNKLNINND DIVAVGMTYQ F