Gene ECH74115_3353 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3353 
SymbolompC 
ID6970891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3085995 
End bp3087098 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content51% 
IMG OID643387165 
Productouter membrane porin protein C 
Protein accessionYP_002271628 
Protein GI209400412 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3203] Outer membrane protein (porin) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.128498 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTTA AAGTACTGTC CCTCCTGGTC CCAGCTCTGC TGGTAGCAGG CGCAGCAAAC 
GCTGCTGAAG TTTACAACAA AGACGGCAAC AAATTAGATC TGTACGGTAA AGTAGACGGC
CTGCACTATT TCTCTGACGA CAAGTCTGTA GATGGCGACC AGACCTACAT GCGTCTTGGC
TTCAAAGGTG AAACTCAGGT TACTGACCAG CTGACCGGTT ACGGCCAGTG GGAATATCAG
ATCCAGGGCA ACAGCGCTGA AAACGAAAAC AACTCCTGGA CCCGTGTGGC ATTCGCAGGT
CTGAAATTCC AGGATGTAGG TTCTTTCGAC TACGGTCGTA ACTACGGCGT AGTTTACGAC
GTAACTTCCT GGACCGACGT TCTGCCAGAA TTCGGTGGTG ACACCTACGG TTCTGACAAC
TTCATGCAGC AGCGTGGTAA CGGTTTCGCG ACCTACCGTA ACACTGACTT CTTCGGTCTG
GTTGACGGCC TGAACTTTGC TGTTCAGTAC CAGGGCAAAA ACGGTAGCGT AAGCGGCGAA
GGCATGACTA ACAACGGTCG TGAAGCACTG CGTCAGAACG GCGACGGCGT TGGCGGATCT
ATCACTTATG ATTACGAAGG CTTCGGTATC GGTGCTGCAG TTTCCAGCTC CAAACGTACT
GATGATCAGA ACAGCCCGCT GTACATCGGT AACGGCGACC GTGCTGAAAC CTACACTGGT
GGTCTGAAAT ACGACGCTAA CAACATCTAT CTGGCTGCTC AGTACACCCA GACCTACAAC
GCAACTCGCG TAGGTTCCCT GGGTTGGGCG AACAAAGCAC AGAACTTCGA AGCTGTTGCT
CAGTACCAGT TCGACTTCGG TCTGCGTCCG TCCCTGGCTT ACCTGCAGTC TAAAGGTAAA
AACCTGGGTG TCATCAATGG TCGTAACTAC GACGACGAAG ATATCCTGAA ATATGTTGAT
GTTGGCGCGA CCTACTACTT CAACAAAAAC ATGTCCACCT ATGTTGACTA CAAAATCAAC
CTGCTGGACG ACAACCAGTT CACTCGTGAC GCTGGCATCA ACACTGATAA CATCGTAGCT
CTGGGTCTGG TTTACCAGTT CTAA
 
Protein sequence
MKVKVLSLLV PALLVAGAAN AAEVYNKDGN KLDLYGKVDG LHYFSDDKSV DGDQTYMRLG 
FKGETQVTDQ LTGYGQWEYQ IQGNSAENEN NSWTRVAFAG LKFQDVGSFD YGRNYGVVYD
VTSWTDVLPE FGGDTYGSDN FMQQRGNGFA TYRNTDFFGL VDGLNFAVQY QGKNGSVSGE
GMTNNGREAL RQNGDGVGGS ITYDYEGFGI GAAVSSSKRT DDQNSPLYIG NGDRAETYTG
GLKYDANNIY LAAQYTQTYN ATRVGSLGWA NKAQNFEAVA QYQFDFGLRP SLAYLQSKGK
NLGVINGRNY DDEDILKYVD VGATYYFNKN MSTYVDYKIN LLDDNQFTRD AGINTDNIVA
LGLVYQF