Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0762 |
Symbol | tolA |
ID | 6144526 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 768925 |
End bp | 770205 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641615651 |
Product | cell envelope integrity inner membrane protein TolA |
Protein accession | YP_001742850 |
Protein GI | 170684137 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3064] Membrane protein involved in colicin uptake |
TIGRFAM ID | [TIGR02794] TolA protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000171784 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCAAAGG CAACCGAACA AAACGACAAG CTCAAACGGG CGATAATTAT TTCAGCAGTG CTGCATGTCA TCTTATTTGC GGCGCTGATC TGGAGTTCGT TCGATGAGAA TATAGAAGCT TCAGCCGGAG GCGGCGGTGG TTCGTCCATC GACGCTGTCA TGGTTGATTC AGGTGCGGTA GTTGAGCAGT ACAAACGCAT GCAAAGCCAG GAATCAAGCG CGAAGCGTTC TGATGAGCAG CGCAAGATGA AGGAACAGCA GGCTGCTGAA GAACTGCGTG AGAAACAAGC GGCTGAACAG GAACGCCTGA AGCAACTTGA GAAAGAGCGG TTAGCGGCTC AGGAACAGAA AAAGCAGGCT GAAGAAGCCG CAAAACAGGC CGAGTTAAAG CAGAAGCAAG CGGAAGAAGC GGCAGCGAAA GCGGCGGCAG ATGCTAAAGC GAAGGCTGAA GCGGATGCAA AAGCTGCGGA AGAAGCAGCG AAGAAAGCGG CTGCAGACGC GAAGAAAAAA GCAGAAGCAG AAGCCGCCAA AGCCGCAGCC GAAGCGCAGA AAAAAGCCGA GGCAGCCGCG GCGGCACTGA AGAAGAAAGC GGAAGCGGCA GAAGCAGCTG CAGCTGAAGC AAGAAAGAAA GCGGCAACTG AAGCTGCTGA AAAAGCCAAA GCAGAAGCTG AGAAGAAAGC GGCTGCTGAA AAGGCTGCAG CTGATAAGAA AGCGGCAGCA GAGAAAGCTG CAGCCGACAA AAAAGCAGCA GAAAAAGCGG CTGCTGAAAA GGCAGCAGCT GATAAGAAAG CAGCGGCAGA AAAAGCCGCC GCAGACAAAA AAGCGGCAGC TGCAAAAGCG GCAGCTGCAA AAGCAGCAGC TGAAAAAGCC GCTGCAGCAA AAGCTGCCGC AGAGGCAGAT GATATTTTCG GTGAGCTAAG CTCTGGTAAG AATGCACCGA AAACGGGGGG AGGGGCGAAA GGGAACAATG CTTCGCCAGC CGGGAGTGGT AATACTAAAA ACAATGGCGC ATCAGGGGCC GATATCAATA ACTATGCCGG GCAGATTAAA TCTGCTATCG AAAGTAAGTT CTATGACGCA TCGTCCTATG CAGGCAAAAC CTGTACGCTG CGCATAAAAC TGGCACCCGA TGGCATGTTA CTGGATATCA AACCTGAAGG TGGCGATCCC GCACTTTGTC AGGCTGCGTT AGCAGCAGCT AAACTTGCGA AGATCCCGAA ACCACCAAGC CAGGCAGTAT ATGAAGTGTT CAAAAACGCG CCATTGGACT TCAAACCGTA A
|
Protein sequence | MSKATEQNDK LKRAIIISAV LHVILFAALI WSSFDENIEA SAGGGGGSSI DAVMVDSGAV VEQYKRMQSQ ESSAKRSDEQ RKMKEQQAAE ELREKQAAEQ ERLKQLEKER LAAQEQKKQA EEAAKQAELK QKQAEEAAAK AAADAKAKAE ADAKAAEEAA KKAAADAKKK AEAEAAKAAA EAQKKAEAAA AALKKKAEAA EAAAAEARKK AATEAAEKAK AEAEKKAAAE KAAADKKAAA EKAAADKKAA EKAAAEKAAA DKKAAAEKAA ADKKAAAAKA AAAKAAAEKA AAAKAAAEAD DIFGELSSGK NAPKTGGGAK GNNASPAGSG NTKNNGASGA DINNYAGQIK SAIESKFYDA SSYAGKTCTL RIKLAPDGML LDIKPEGGDP ALCQAALAAA KLAKIPKPPS QAVYEVFKNA PLDFKP
|
| |