Gene ECH74115_4349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4349 
SymboltolC 
ID6970005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4027292 
End bp4028773 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content52% 
IMG OID643388076 
Productouter membrane channel protein 
Protein accessionYP_002272514 
Protein GI209400811 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1538] Outer membrane protein 
TIGRFAM ID[TIGR01844] type I secretion outer membrane protein, TolC family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.823303 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAT TGCTCCCCAT TCTTATCGGC CTGAGCCTTT CTGGGTTCAG TTCGTTGAGC 
CAGGCCGAGA ACCTGATGCA AGTTTATCAG CAAGCACGCC TTAGTAACCC GGAATTGCGT
AAGTCTGCCG CCGATCGTGA TGCTGCCTTT GAAAAAATTA ATGAAGCGCG CAGTCCATTA
CTGCCACAGC TAGGTTTAGG TGCAGATTAC ACCTATAGCA ACGGCTACCG CGACGCGAAC
GGCATCAACT CTAACGCGAC CAGTGCGTCC CTGCAGTTAA CTCAATCCAT TTTTGATATG
TCGAAATGGC GTGCGTTAAC GCTGCAGGAA AAAGCAGCAG GGATTCAGGA CGTCACGTAT
CAGACCGATC AGCAAACCTT GATCCTCAAC ACCGCGACCG CTTATTTCAA CGTGTTGAAT
GCTATTGACG TTCTTTCCTA TACACAGGCG CAAAAAGAAG CGATCTACCG TCAATTAGAT
CAAACCACCC AACGTTTTAA CGTGGGCCTG GTAGCGATCA CCGACGTGCA GAACGCCCGC
GCGCAGTACG ATACCGTGCT GGCGAACGAA GTGACCGCAC GTAATAACCT TGATAACGCG
GTAGAGCAGC TGCGCCAGAT CACCGGTAAC TACTATCCGG AACTGGCGGC GCTGAATGTC
GAAAACTTTA AAACCGACAA ACCACAGCCG GTTAACGCGC TGCTGAAAGA AGCCGAAAAA
CGCAACCTGT CGCTGTTACA GGCACGCTTG AGCCAGGACC TGGCGCGCGA GCAAATTCGC
CAGGCGCAGG ATGGTCACTT ACCGACTCTG GATTTAACGG CTTCTAGCGG GATTTCTGAC
ACCTCTTATA GCGGTTCGAA AACCCGTGGT GCCGCTGGTA CCCAGTATGA CGATAGCAAT
ATGGGCCAGA ACAAAGTTGG CCTGAGCTTC TCGCTGCCGA TTTATCAGGG CGGAATGGTT
AACTCGCAGG TGAAACAGGC ACAGTACAAC TTTGTTGGTG CCAGCGAGCA ACTGGAAAGC
GCGCATCGTA GCGTCGTGCA GACCGTACGT TCCTCTTTCA ACAACATTAA TGCTTCTATC
AGTAGTATTA ACGCCTACAA ACAAGCCGTA GTTTCCGCTC AAAGCTCATT AGACGCGATG
GAAGCGGGCT ACTCGGTCGG TACGCGTACC ATTGTTGATG TGTTGGATGC GACCACCACG
CTGTACAACG CCAAGCAAGA GCTGGCGAAT GCGCGTTATA ACTACCTGAT TAATCAGCTG
AATATTAAGT CAGCCCTGGG TACGTTGAAC GAGCAGGATC TGCTGGCACT GAACAATGCG
CTGAGCAAAC CGGTTTCCAC TAATCCGGAA AACGTTGCCC CGCAAACGCC GGAACAGAAT
GCTATTGCTG ATGGTTATGC GCCTGATAGC CCGGCACCCG TCGTTCAGCA AACATCCGCA
CGCACTACCA CCAGTAACGG TCATAACCCT TTCCGTAACT GA
 
Protein sequence
MKKLLPILIG LSLSGFSSLS QAENLMQVYQ QARLSNPELR KSAADRDAAF EKINEARSPL 
LPQLGLGADY TYSNGYRDAN GINSNATSAS LQLTQSIFDM SKWRALTLQE KAAGIQDVTY
QTDQQTLILN TATAYFNVLN AIDVLSYTQA QKEAIYRQLD QTTQRFNVGL VAITDVQNAR
AQYDTVLANE VTARNNLDNA VEQLRQITGN YYPELAALNV ENFKTDKPQP VNALLKEAEK
RNLSLLQARL SQDLAREQIR QAQDGHLPTL DLTASSGISD TSYSGSKTRG AAGTQYDDSN
MGQNKVGLSF SLPIYQGGMV NSQVKQAQYN FVGASEQLES AHRSVVQTVR SSFNNINASI
SSINAYKQAV VSAQSSLDAM EAGYSVGTRT IVDVLDATTT LYNAKQELAN ARYNYLINQL
NIKSALGTLN EQDLLALNNA LSKPVSTNPE NVAPQTPEQN AIADGYAPDS PAPVVQQTSA
RTTTSNGHNP FRN