Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0792 |
Symbol | tolA |
ID | 5593290 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 803195 |
End bp | 804460 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640919966 |
Product | cell envelope integrity inner membrane protein TolA |
Protein accession | YP_001457540 |
Protein GI | 157160222 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3064] Membrane protein involved in colicin uptake |
TIGRFAM ID | [TIGR02794] TolA protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00000000000022577 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCAAAGG CAACCGAACA AAACGACAAG CTCAAGCGGG CGATAATTAT TTCAGCAGTG CTGCATGTCA TCTTATTTGC GGCGCTGATC TGGAGTTCGT TCGATGAGAA TATAGAAGCT TCAGCCGGAG GCGGCGGTGG TTCGTCCATC GACGCTGTCA TGGTTGATTC AGGTGCGGTA GTTGAGCAGT ACAAACGCAT GCAAAGCCAG GAATCAAGCG CGAAGCGTTC TGATGAACAG CGCAAGATGA AGGAACAGCA GGCTGCTGAA GAACTGCGTG AGAAACAAGC GGCTGAACAG GAACGCCTGA AGCAACTTGA GAAAGAGCGG TTAGCGGCTC AGGAGCAGAA AAAGCAGGCT GAAGAAGCCG CAAAACAGGC CGAGTTAAAG CAGAAGCAAG CTGAAGAGGC GGCAGCGAAA GCGGCGGCAG ATGCTAAAGC GAAGGCCGAA GCAGATGCTA AAGCTGCGGA AGAAGCAGCG AAGAAAGCGG CTGCAGACGC AAAGAAAAAA GCAGAAGCAG AAGCCGCCAA AGCCGCAGCC GAAGCGCAGA AAAAAGCCGA GGCAGCCGCT GCGGCACTGA AGAAGAAAGC GGAAGCGGCA GAAGCAGCTG CAGCTGAAGC AAGAAAGAAA GCGGCAACTG AAGCTGCTGA AAAAGCCAAA GCAGAAGCTG AGAAGAAAGC GGCTGCTGAA AAGGCTGCAG CTGATAAGAA AGCGGCAGCA GAGAAAGCTG CAGCCGACAA AAAAGCAGCA GAAAAAGCGG CTGCTGAAAA GGCAGCAGCT GATAAGAAAG CAGCGGCAGA AAAAGCCGCC GCAGACAAAA AAGCGGCAGC GGCAAAAGCT GCAGCTGAAA AAGCCGCTGC AGCAAAAGCG GCCGCAGAGG CAGATGATAT TTTCGGTGAG CTAAGCTCTG GTAAGAATGC ACCGAAAACG GGGGGAGGGG CGAAAGGGAA CAATGCTTCG CCTGCCGGGA GTGGTAATAC TAAAAACAAT GGCGCATCAG GGGCCGATAT CAATAACTAT GCCGGGCAGA TTAAATCTGC TATCGAAAGT AAGTTCTATG ACGCATCGTC CTATGCAGGC AAAACCTGTA CGCTGCGCAT AAAACTGGCA CCCGATGGTA TGTTACTGGA TATCAAACCT GAAGGTGGCG ATCCCGCACT TTGTCAGGCT GCGTTGGCAG CAGCTAAACT TGCGAAGATC CCGAAACCAC CAAGCCAGGC AGTATATGAA GTGTTCAAAA ACGCGCCATT GGACTTCAAA CCGTAA
|
Protein sequence | MSKATEQNDK LKRAIIISAV LHVILFAALI WSSFDENIEA SAGGGGGSSI DAVMVDSGAV VEQYKRMQSQ ESSAKRSDEQ RKMKEQQAAE ELREKQAAEQ ERLKQLEKER LAAQEQKKQA EEAAKQAELK QKQAEEAAAK AAADAKAKAE ADAKAAEEAA KKAAADAKKK AEAEAAKAAA EAQKKAEAAA AALKKKAEAA EAAAAEARKK AATEAAEKAK AEAEKKAAAE KAAADKKAAA EKAAADKKAA EKAAAEKAAA DKKAAAEKAA ADKKAAAAKA AAEKAAAAKA AAEADDIFGE LSSGKNAPKT GGGAKGNNAS PAGSGNTKNN GASGADINNY AGQIKSAIES KFYDASSYAG KTCTLRIKLA PDGMLLDIKP EGGDPALCQA ALAAAKLAKI PKPPSQAVYE VFKNAPLDFK P
|
| |