Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E3468 |
Symbol | tolC |
ID | 6270785 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 3222921 |
End bp | 3224408 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641727353 |
Product | outer membrane channel protein |
Protein accession | YP_001881802 |
Protein GI | 187730816 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1538] Outer membrane protein |
TIGRFAM ID | [TIGR01844] type I secretion outer membrane protein, TolC family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.0175451 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAATGA AGAAATTGCT CCCCATTCTT ATCGGCCTGA GCCTTTCTGG GTTCAGTTCG TTGAGCCAGG CCGAGAACCT GATGCAAGTT TATCAGCAAG CACGCCTTAG TAACCCGGAA TTGCGTAAGT CTGCCGCCGA TCGTGATGCT GCCTTTGAAA AAATTAATGA AGCGCGCAGT CCATTACTGC CACAGCTAGG TTTAGGTGCA GATTACACCT ATAGCAACGG CTACCGCGAC GCGAACGGCA TCAACTCGAA CGCGACCAGT GCGTCCCTGC AGTTAGCTCA ATCCATTTTT GATATGTCGA AATGGCGTGC GTTAACGCTG CAGGAAAAAG CAGCAGGGAT TCAGGACGTC ACATATCAGA CCGATCAGCA AACCTTGATC CTCAACACCG CGACCGCTTA TTTCAACGTG TTGAATGCTA TTGACGTTCT TTCCTATACA CAGGCACAAA AAGAAGCGAT CTACCGTCAA TTAGATCAAA CCACCCAACG TTTTAACGTG GGCCTGGTAG CGATCACCGA CGTGCAGAAC GCCCGCGCGC AGTACGATAC CGTGCTGGCG AACGAAGTGA CCGCACGTAA TAACCTTGAT AACGCGGTAG AGCAGCTGCG CCAGATCACC GGTAACTACT ATCCGGAACT GGCGGCGCTG AATGTCGAAA ACTTTAAAAC CGACAAACCA CAGCCGGTTA ACGCGCTGCT GAAAGAAGCC GAAAAACGCA ACCTGTCGCT GTTACAGGCA CGCTTGAGCC AGGACCTGGC GCGCGAGCAA ATTCGCCAGG CGCAGGATGG TCACTTACCG ACGCTGGATT TAACGGCTTC TACCGGGATT TCTGACACCT CTTATAGCGG TTCGAAAACT CGTGGTGCCG CTGGTACCCA GTATGACGAC AGCAATATGG GCCAGAACAA AGTGGGCCTG AGCTTCTCGC TGCCGATTTA TCAGGGCGGA ATGGTTAACT CGCAGGTGAA ACAGGCCCAG TACAACTTTG TTGGTGCCAG CGAGCAACTG GAAAGCGCGC ATCGTAGCAT CGTGCAAACC GTACGTTCCT CCTTCAACAA CATTAATGCA TCTATCAGTA GCATTAACGC CTACAAACAA GCCGTAGTTT CCGCTCAAAG CTCATTAGAC GCGATGGAAG CGGGCTACTC GGTCGGTACG CGTACCATTG TTGATGTGTT GGATGCAACC ACCACGCTGT ACAACGCTAA GCAAGAGCTG GCAAATGCGC GTTATAACTA CCTGATTAAT CAGCTGAATA TTAAGTCAGC CCTGGGTACG TTGAACGAGC AGGATCTGCT GGCACTGAAC AATGCGCTGA GCAAACCGGT TTCCACTAAT CCGGAAAACG TTGCCCCGCA AACGCCGGAA CAGAATGCTA TTGCTGATGG TTATGCGCCT GATAGCCCGG CACCCGTCGT TCAGCAAACA TCCGCACGCA CTACCACCAG TAACGGTCAT AACCCTTTCC GTAACTGA
|
Protein sequence | MQMKKLLPIL IGLSLSGFSS LSQAENLMQV YQQARLSNPE LRKSAADRDA AFEKINEARS PLLPQLGLGA DYTYSNGYRD ANGINSNATS ASLQLAQSIF DMSKWRALTL QEKAAGIQDV TYQTDQQTLI LNTATAYFNV LNAIDVLSYT QAQKEAIYRQ LDQTTQRFNV GLVAITDVQN ARAQYDTVLA NEVTARNNLD NAVEQLRQIT GNYYPELAAL NVENFKTDKP QPVNALLKEA EKRNLSLLQA RLSQDLAREQ IRQAQDGHLP TLDLTASTGI SDTSYSGSKT RGAAGTQYDD SNMGQNKVGL SFSLPIYQGG MVNSQVKQAQ YNFVGASEQL ESAHRSIVQT VRSSFNNINA SISSINAYKQ AVVSAQSSLD AMEAGYSVGT RTIVDVLDAT TTLYNAKQEL ANARYNYLIN QLNIKSALGT LNEQDLLALN NALSKPVSTN PENVAPQTPE QNAIADGYAP DSPAPVVQQT SARTTTSNGH NPFRN
|
| |