Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3213 |
Symbol | tolC |
ID | 5592899 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 3222940 |
End bp | 3224427 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640922331 |
Product | outer membrane channel protein |
Protein accession | YP_001459829 |
Protein GI | 157162511 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1538] Outer membrane protein |
TIGRFAM ID | [TIGR01844] type I secretion outer membrane protein, TolC family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.00955114 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAATGA AGAAATTGCT CCCCATTCTT ATCGGCCTGA GCCTTTCTGG GTTCAGTTCG TTGAGCCAGG CCGAGAACCT GATGCAAGTT TATCAGCAAG CACGCCTTAG TAACCCGGAA TTGCGTAAGT CTGCCGCCGA TCGTGATGCT GCCTTTGAAA AAATTAATGA AGCGCGCAGT CCATTACTGC CACAGCTAGG TTTAGGTGCA GATTACACCT ATAGCAACGG CTACCGCGAC GCGAACGGCA TCAACTCTAA CGCGACCAGT GCGTCCTTGC AGTTAACTCA ATCCATTTTT GATATGTCGA AATGGCGTGC GTTAACGCTG CAGGAAAAAG CAGCAGGGAT TCAGGACGTC ACGTATCAGA CCGATCAGCA AACCTTGATC CTCAACACCG CGACCGCTTA TTTCAACGTG TTGAATGCTA TTGACGTTCT TTCCTATACA CAGGCACAAA AAGAAGCGAT CTACCGTCAA TTAGATCAAA CCACCCAACG TTTTAACGTG GGCCTGGTAG CGATCACCGA CGTGCAGAAC GCCCGCGCAC AGTACGATAC CGTGCTGGCG AACGAAGTGA CCGCACGTAA TAACCTTGAT AACGCGGTAG AGCAGCTGCG CCAGATCACC GGTAACTACT ATCCGGAACT GGCTGCGCTG AATGTCGAAA ACTTTAAAAC CGACAAACCA CAGCCGGTTA ACGCGCTGCT GAAAGAAGCC GAAAAACGCA ACCTGTCGCT GTTACAGGCA CGCTTGAGCC AGGACCTGGC GCGCGAGCAA ATTCGCCAGG CGCAGGATGG TCACTTACCG ACTCTGGATT TAACGGCTTC TACCGGGATT TCTGACACCT CTTATAGCGG TTCGAAAACC CGTGGTGCCG CTGGTACCCA GTATGACGAT AGCAATATGG GCCAGAACAA AGTTGGCCTG AGCTTCTCGC TGCCGATTTA TCAGGGCGGA ATGGTTAACT CGCAGGTGAA ACAGGCACAG TACAACTTTG TCGGTGCCAG CGAGCAACTG GAAAGTGCCC ATCGTAGCGT CGTGCAGACC GTGCGTTCCT CCTTCAACAA CATTAATGCA TCTATCAGTA GCATTAACGC CTACAAACAA GCCGTAGTTT CCGCTCAAAG CTCATTAGAC GCGATGGAAG CGGGCTACTC GGTCGGTACG CGTACCATTG TTGATGTGTT GGATGCGACC ACCACGTTGT ACAACGCCAA GCAAGAGCTG GCGAATGCGC GTTATAACTA CCTGATTAAT CAGCTGAATA TTAAGTCAGC CCTGGGTACG TTGAACGAGC AGGATCTGCT GGCACTGAAC AATGCGCTGA GCAAACCGGT TTCCACTAAT CCGGAAAACG TTGCCCCGCA AACGCCGGAA CAGAATGCTA TTGCTGATGG TTATGCGCCT GATAGCCCGG CCCCCGTCGT TCAGCAAACA TCCGCACGCA CTACCACCAG TAACGGTCAT AACCCTTTCC GTAACTGA
|
Protein sequence | MQMKKLLPIL IGLSLSGFSS LSQAENLMQV YQQARLSNPE LRKSAADRDA AFEKINEARS PLLPQLGLGA DYTYSNGYRD ANGINSNATS ASLQLTQSIF DMSKWRALTL QEKAAGIQDV TYQTDQQTLI LNTATAYFNV LNAIDVLSYT QAQKEAIYRQ LDQTTQRFNV GLVAITDVQN ARAQYDTVLA NEVTARNNLD NAVEQLRQIT GNYYPELAAL NVENFKTDKP QPVNALLKEA EKRNLSLLQA RLSQDLAREQ IRQAQDGHLP TLDLTASTGI SDTSYSGSKT RGAAGTQYDD SNMGQNKVGL SFSLPIYQGG MVNSQVKQAQ YNFVGASEQL ESAHRSVVQT VRSSFNNINA SISSINAYKQ AVVSAQSSLD AMEAGYSVGT RTIVDVLDAT TTLYNAKQEL ANARYNYLIN QLNIKSALGT LNEQDLLALN NALSKPVSTN PENVAPQTPE QNAIADGYAP DSPAPVVQQT SARTTTSNGH NPFRN
|
| |