Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3330 |
Symbol | tolC |
ID | 6142633 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3406637 |
End bp | 3408118 |
Gene Length | 1482 bp |
Protein Length | 493 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641618159 |
Product | outer membrane channel protein |
Protein accession | YP_001745309 |
Protein GI | 170682545 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1538] Outer membrane protein |
TIGRFAM ID | [TIGR01844] type I secretion outer membrane protein, TolC family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAT TGCTCCCCAT TCTTATCGGC CTGAGCCTTT CTGGTTTCAG TTCGTTGAGC CAGGCCGAGA ACCTGATGCA AGTTTATCAG CAAGCACGCC TTAGTAACCC GGAATTGCGT AAGTCTGCCG CCGATCGTGA TGCTGCCTTT GAAAAAATTA ATGAAGCGCG CAGTCCATTA CTGCCACAGC TAGGTTTAGG TGCAGATTAC ACCTATAGCA ACGGCTACCG CGACGCGAAC GGCATCAACT CTAACGCGAC CAGTGCGTCC CTGCAGTTAA CTCAATCCAT TTTTGATATG TCGAAATGGC GTGCGTTAAC GCTGCAGGAA AAAGCAGCTG GGATTCAGGA CGTCACGTAT CAGACCGATC AACAAACTTT GATCCTCAAC ACCGCGACCG CTTATTTCAA CGTGTTGAAT GCTATTGACG TTCTTTCCTA TACACAGGCG CAAAAAGAAG CGATCTACCG TCAATTAGAT CAAACCACCC AACGTTTTAA CGTGGGCCTG GTAGCGATCA CTGACGTGCA GAACGCCCGC GCACAGTACG ATACCGTGCT GGCGAACGAA GTGACCGCAC GTAATAACCT TGATAACGCG GTAGAGCAGC TGCGCCAGAT CACCGGTAAC TACTATCCGG AACTGGCGGC GCTGAATGTC GAAAACTTTA AAACCGACAA ACCACAGCCG GTTAACACGC TGCTGAAAGA AGCCGAAAAA CGCAACCTGT CGTTGTTACA GGCGCGCTTG AGCCAGGACC TGGCGCGCGA GCAAATTCGC CAGGCGCAGG ATGGTCACTT ACCGACTCTG GATTTAACGG CTTCTACCGG GATTTCTGAC ACCTCTTATA GCGGTTCGAA AACCCGTGGT GCCGCTGGTA CCCAGTATGA CGATAGCAAT ATGGGCCAGA ACAAAGTTGG CCTGAGCTTC TCGCTGCCGA TTTATCAGGG CGGAATGGTT AACTCGCAGG TGAAACAGGC CCAGTACAAC TTTGTTGGTG CCAGCGAGCA ACTGGAAAGC GCGCATCGTA GCGTCGTACA AACCGTACGT TCCTCCTTCA ACAACATTAA TGCTTCTATC AGTAGCATTA ACGCCTACAA ACAAGCCGTA GTTTCCGCTC AAAGCTCATT AGACGCGATG GAAGCGGGTT ACTCGGTCGG TACGCGTACC ATTGTTGATG TGTTGGATGC GACCACCACG CTGTACAACG CCAAGCAAGA GCTGGCAAAT GCGCGTTATA ACTACCTGAT TAACCAGTTG AATATTAAAT CAGCTCTGGG TACGTTGAAC GAGCAGGATC TTCTGGCACT GAACAATGCG CTGAGCAAAC CGGTTTCCAC TAATCCGGAA AACGTTGCCC CGCAAACGCC GGAACAGAAT GCTATTGCTG ATGGTTATGC GCCTGATAGC CCGGCACCCG TCGTTCAGCA AACATCCGCA CGCACTACCA CCAGTAACGG TCATAACCCT TTCCGTAACT GA
|
Protein sequence | MKKLLPILIG LSLSGFSSLS QAENLMQVYQ QARLSNPELR KSAADRDAAF EKINEARSPL LPQLGLGADY TYSNGYRDAN GINSNATSAS LQLTQSIFDM SKWRALTLQE KAAGIQDVTY QTDQQTLILN TATAYFNVLN AIDVLSYTQA QKEAIYRQLD QTTQRFNVGL VAITDVQNAR AQYDTVLANE VTARNNLDNA VEQLRQITGN YYPELAALNV ENFKTDKPQP VNTLLKEAEK RNLSLLQARL SQDLAREQIR QAQDGHLPTL DLTASTGISD TSYSGSKTRG AAGTQYDDSN MGQNKVGLSF SLPIYQGGMV NSQVKQAQYN FVGASEQLES AHRSVVQTVR SSFNNINASI SSINAYKQAV VSAQSSLDAM EAGYSVGTRT IVDVLDATTT LYNAKQELAN ARYNYLINQL NIKSALGTLN EQDLLALNNA LSKPVSTNPE NVAPQTPEQN AIADGYAPDS PAPVVQQTSA RTTTSNGHNP FRN
|
| |