Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4349 |
Symbol | tolC |
ID | 6970005 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4027292 |
End bp | 4028773 |
Gene Length | 1482 bp |
Protein Length | 493 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643388076 |
Product | outer membrane channel protein |
Protein accession | YP_002272514 |
Protein GI | 209400811 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1538] Outer membrane protein |
TIGRFAM ID | [TIGR01844] type I secretion outer membrane protein, TolC family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.823303 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 71 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAT TGCTCCCCAT TCTTATCGGC CTGAGCCTTT CTGGGTTCAG TTCGTTGAGC CAGGCCGAGA ACCTGATGCA AGTTTATCAG CAAGCACGCC TTAGTAACCC GGAATTGCGT AAGTCTGCCG CCGATCGTGA TGCTGCCTTT GAAAAAATTA ATGAAGCGCG CAGTCCATTA CTGCCACAGC TAGGTTTAGG TGCAGATTAC ACCTATAGCA ACGGCTACCG CGACGCGAAC GGCATCAACT CTAACGCGAC CAGTGCGTCC CTGCAGTTAA CTCAATCCAT TTTTGATATG TCGAAATGGC GTGCGTTAAC GCTGCAGGAA AAAGCAGCAG GGATTCAGGA CGTCACGTAT CAGACCGATC AGCAAACCTT GATCCTCAAC ACCGCGACCG CTTATTTCAA CGTGTTGAAT GCTATTGACG TTCTTTCCTA TACACAGGCG CAAAAAGAAG CGATCTACCG TCAATTAGAT CAAACCACCC AACGTTTTAA CGTGGGCCTG GTAGCGATCA CCGACGTGCA GAACGCCCGC GCGCAGTACG ATACCGTGCT GGCGAACGAA GTGACCGCAC GTAATAACCT TGATAACGCG GTAGAGCAGC TGCGCCAGAT CACCGGTAAC TACTATCCGG AACTGGCGGC GCTGAATGTC GAAAACTTTA AAACCGACAA ACCACAGCCG GTTAACGCGC TGCTGAAAGA AGCCGAAAAA CGCAACCTGT CGCTGTTACA GGCACGCTTG AGCCAGGACC TGGCGCGCGA GCAAATTCGC CAGGCGCAGG ATGGTCACTT ACCGACTCTG GATTTAACGG CTTCTAGCGG GATTTCTGAC ACCTCTTATA GCGGTTCGAA AACCCGTGGT GCCGCTGGTA CCCAGTATGA CGATAGCAAT ATGGGCCAGA ACAAAGTTGG CCTGAGCTTC TCGCTGCCGA TTTATCAGGG CGGAATGGTT AACTCGCAGG TGAAACAGGC ACAGTACAAC TTTGTTGGTG CCAGCGAGCA ACTGGAAAGC GCGCATCGTA GCGTCGTGCA GACCGTACGT TCCTCTTTCA ACAACATTAA TGCTTCTATC AGTAGTATTA ACGCCTACAA ACAAGCCGTA GTTTCCGCTC AAAGCTCATT AGACGCGATG GAAGCGGGCT ACTCGGTCGG TACGCGTACC ATTGTTGATG TGTTGGATGC GACCACCACG CTGTACAACG CCAAGCAAGA GCTGGCGAAT GCGCGTTATA ACTACCTGAT TAATCAGCTG AATATTAAGT CAGCCCTGGG TACGTTGAAC GAGCAGGATC TGCTGGCACT GAACAATGCG CTGAGCAAAC CGGTTTCCAC TAATCCGGAA AACGTTGCCC CGCAAACGCC GGAACAGAAT GCTATTGCTG ATGGTTATGC GCCTGATAGC CCGGCACCCG TCGTTCAGCA AACATCCGCA CGCACTACCA CCAGTAACGG TCATAACCCT TTCCGTAACT GA
|
Protein sequence | MKKLLPILIG LSLSGFSSLS QAENLMQVYQ QARLSNPELR KSAADRDAAF EKINEARSPL LPQLGLGADY TYSNGYRDAN GINSNATSAS LQLTQSIFDM SKWRALTLQE KAAGIQDVTY QTDQQTLILN TATAYFNVLN AIDVLSYTQA QKEAIYRQLD QTTQRFNVGL VAITDVQNAR AQYDTVLANE VTARNNLDNA VEQLRQITGN YYPELAALNV ENFKTDKPQP VNALLKEAEK RNLSLLQARL SQDLAREQIR QAQDGHLPTL DLTASSGISD TSYSGSKTRG AAGTQYDDSN MGQNKVGLSF SLPIYQGGMV NSQVKQAQYN FVGASEQLES AHRSVVQTVR SSFNNINASI SSINAYKQAV VSAQSSLDAM EAGYSVGTRT IVDVLDATTT LYNAKQELAN ARYNYLINQL NIKSALGTLN EQDLLALNNA LSKPVSTNPE NVAPQTPEQN AIADGYAPDS PAPVVQQTSA RTTTSNGHNP FRN
|
| |