Gene ECH74115_1121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1121 
SymbolompA 
ID6969157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1151839 
End bp1152903 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content53% 
IMG OID643385126 
Productouter membrane protein A 
Protein accessionYP_002269625 
Protein GI209397392 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2885] Outer membrane protein and related peptidoglycan-associated (lipo)proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000116879 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.89417 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGATGATA ACGAGGCGCA AAAAATGAAA AAGACAGCTA TCGCGATTGC AGTGGCACTG 
GCTGGTTTCG CTACCGTAGC GCAGGCCGCT CCGAAAGATA ACACCTGGTA CACTGGTGCT
AAACTGGGCT GGTCCCAGTA CCATGACACT GGTTTCATCA ACAACAATGG CCCGACCCAT
GAAAACCAAC TGGGCGCTGG TGCTTTTGGT GGTTACCAGG TTAACCCGTA TGTTGGCTTT
GAAATGGGTT ACGACTGGTT AGGTCGTATG CCGTACAAAG GCAGCGTTGA AAACGGTGCA
TACAAAGCTC AGGGCGTTCA ACTGACCGCT AAACTGGGTT ACCCAATCAC TGACGACCTG
GACATCTACA CTCGTCTGGG TGGTATGGTA TGGCGTGCAG ACACTAAATC CAACGTTTAT
GGTAAAAACC ACGACACCGG CGTTTCTCCG GTCTTCGCTG GCGGTGTTGA GTACGCGATC
ACTCCTGAAA TCGCTACCCG TCTGGAATAC CAGTGGACCA ACAACATCGG TGACGCACAC
ACCATCGGCA CTCGTCCGGA CAACGGCATG CTGAGCCTGG GTGTTTCCTA CCGTTTCGGT
CAGGGCGAAG CAGCTCCAGT AGTTGCTCCG GCTCCAGCTC CGGCACCGGA AGTACAGACC
AAGCACTTCA CTCTGAAGTC TGACGTTCTG TTCAACTTCA ACAAAGCAAC CCTGAAACCG
GAAGGTCAGG CTGCTCTGGA TCAGCTGTAC AGCCAGCTGA GCAACCTGGA TCCGAAAGAC
GGTTCCGTAG TTGTTCTGGG TTACACCGAC CGCATCGGTT CTGACGCTTA CAACCAGGGT
CTGTCCGAGC GCCGTGCTCA GTCTGTTGTT GATTACCTGA TCTCCAAAGG TATCCCGGCA
GACAAGATCT CCGCACGTGG TATGGGCGAA TCCAACCCGG TTACTGGCAA CACCTGTGAC
AACGTGAAAC AGCGTGCTGC ACTGATCGAC TGCCTGGCTC CGGATCGTCG CGTAGAGATC
GAAGTTAAAG GTATCAAAGA CGTTGTAACT CAGCCGCAAG CTTAA
 
Protein sequence
MDDNEAQKMK KTAIAIAVAL AGFATVAQAA PKDNTWYTGA KLGWSQYHDT GFINNNGPTH 
ENQLGAGAFG GYQVNPYVGF EMGYDWLGRM PYKGSVENGA YKAQGVQLTA KLGYPITDDL
DIYTRLGGMV WRADTKSNVY GKNHDTGVSP VFAGGVEYAI TPEIATRLEY QWTNNIGDAH
TIGTRPDNGM LSLGVSYRFG QGEAAPVVAP APAPAPEVQT KHFTLKSDVL FNFNKATLKP
EGQAALDQLY SQLSNLDPKD GSVVVLGYTD RIGSDAYNQG LSERRAQSVV DYLISKGIPA
DKISARGMGE SNPVTGNTCD NVKQRAALID CLAPDRRVEI EVKGIKDVVT QPQA