Gene ECH74115_2083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2083 
Symbol 
ID6966731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1980951 
End bp1982051 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content44% 
IMG OID643385986 
Productgram-negative porin family protein 
Protein accessionYP_002270475 
Protein GI209396427 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3203] Outer membrane protein (porin) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTAA AAATAGTTGC GGTGGTTGTA ACTGGTTTGT TAGCTGCGAA CGTAGCACAC 
GCTGCCGAAG TCTATAACAA GGATGGTAAT AAACTCGACC TTTATGGCAA GGTTACCGCT
CTACGTTATT TTACTGATGA TAAGCGTGAC GATGGTGATA AAACTTATGC CCGTCTCGGC
TTTAAAGGAG AAACGCAAAT CAATGATCAA ATGATTGGTT TTGGTCACTG GGAATATGAT
TTTAAAGGCT ATAACGATGA AGCCAACGGC TCGCGCGACA ACAAGACCCG TCTGGCCTAT
GCTGGTTTAA AAATTAGTGA ATTTGGCTCT CTGGACTATG GCCGTAACTA CGGTGTCGGC
TATGACATTG GTTCATGGAC TGATATGTTG CCAGAATTTG GTGGCGATAC CTGGAGTCAG
AAAGATGTCT TCATGACATA CCGTACCACC GGTGTAGCAA CCTATCGCAA CTACGATTTC
TTTGGCTTAA TTGAAGGGCT GAACTTTGCC GCGCAATATC AAGGCAAAAA TGAACGTACT
GACAACAGTC ATCTTTATGG TGCTGACTAC ACGCGTGCCA ACGGTGACGG TTTCGGTATC
TCCTCAACTT ATGTTTATGA TGGCTTTGGT ATCGGAGCGG TGTATACCAA ATCCGATCGG
ACAAATGCGC AGGAAAGAGC CGCTGCTAAT CCTCTCAATG CCTCCGGTAA GAATGCAGAA
CTGTGGGCTA CAGGTATAAA ATATGATGCC AACAACATCT ACTTTGCAGC TAATTACGCT
GAAACATTAA ACATGACCAC CTATGGCGAT GGTTATATTT CTAACAAAGC ACAAAGTTTT
GAAGTGGTGG CGCAATATCA ATTCGACTTC GGCTTGCGCC CCTCACTCGC TTACCTGAAA
TCGAAAGGCA TAGATCTGGG CCGCTACGGC GATCAGGACA TGATTGAGTA TATCGACGTT
GGTGCGACGT ATTTCTTCAA CAAAAATATG TCGACCTATG TTGATTATAA AATCAACCTG
ATTGATGAAA GCGACTTTAC CCGTGCCGTA GATATTCGCA CCGATAACAT CGTCGCTACG
GGCATTACCT ATCAGTTCTA A
 
Protein sequence
MKLKIVAVVV TGLLAANVAH AAEVYNKDGN KLDLYGKVTA LRYFTDDKRD DGDKTYARLG 
FKGETQINDQ MIGFGHWEYD FKGYNDEANG SRDNKTRLAY AGLKISEFGS LDYGRNYGVG
YDIGSWTDML PEFGGDTWSQ KDVFMTYRTT GVATYRNYDF FGLIEGLNFA AQYQGKNERT
DNSHLYGADY TRANGDGFGI SSTYVYDGFG IGAVYTKSDR TNAQERAAAN PLNASGKNAE
LWATGIKYDA NNIYFAANYA ETLNMTTYGD GYISNKAQSF EVVAQYQFDF GLRPSLAYLK
SKGIDLGRYG DQDMIEYIDV GATYFFNKNM STYVDYKINL IDESDFTRAV DIRTDNIVAT
GITYQF