Gene EcSMS35_0762 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0762 
SymboltolA 
ID6144526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp768925 
End bp770205 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content52% 
IMG OID641615651 
Productcell envelope integrity inner membrane protein TolA 
Protein accessionYP_001742850 
Protein GI170684137 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3064] Membrane protein involved in colicin uptake 
TIGRFAM ID[TIGR02794] TolA protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000171784 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCAAAGG CAACCGAACA AAACGACAAG CTCAAACGGG CGATAATTAT TTCAGCAGTG 
CTGCATGTCA TCTTATTTGC GGCGCTGATC TGGAGTTCGT TCGATGAGAA TATAGAAGCT
TCAGCCGGAG GCGGCGGTGG TTCGTCCATC GACGCTGTCA TGGTTGATTC AGGTGCGGTA
GTTGAGCAGT ACAAACGCAT GCAAAGCCAG GAATCAAGCG CGAAGCGTTC TGATGAGCAG
CGCAAGATGA AGGAACAGCA GGCTGCTGAA GAACTGCGTG AGAAACAAGC GGCTGAACAG
GAACGCCTGA AGCAACTTGA GAAAGAGCGG TTAGCGGCTC AGGAACAGAA AAAGCAGGCT
GAAGAAGCCG CAAAACAGGC CGAGTTAAAG CAGAAGCAAG CGGAAGAAGC GGCAGCGAAA
GCGGCGGCAG ATGCTAAAGC GAAGGCTGAA GCGGATGCAA AAGCTGCGGA AGAAGCAGCG
AAGAAAGCGG CTGCAGACGC GAAGAAAAAA GCAGAAGCAG AAGCCGCCAA AGCCGCAGCC
GAAGCGCAGA AAAAAGCCGA GGCAGCCGCG GCGGCACTGA AGAAGAAAGC GGAAGCGGCA
GAAGCAGCTG CAGCTGAAGC AAGAAAGAAA GCGGCAACTG AAGCTGCTGA AAAAGCCAAA
GCAGAAGCTG AGAAGAAAGC GGCTGCTGAA AAGGCTGCAG CTGATAAGAA AGCGGCAGCA
GAGAAAGCTG CAGCCGACAA AAAAGCAGCA GAAAAAGCGG CTGCTGAAAA GGCAGCAGCT
GATAAGAAAG CAGCGGCAGA AAAAGCCGCC GCAGACAAAA AAGCGGCAGC TGCAAAAGCG
GCAGCTGCAA AAGCAGCAGC TGAAAAAGCC GCTGCAGCAA AAGCTGCCGC AGAGGCAGAT
GATATTTTCG GTGAGCTAAG CTCTGGTAAG AATGCACCGA AAACGGGGGG AGGGGCGAAA
GGGAACAATG CTTCGCCAGC CGGGAGTGGT AATACTAAAA ACAATGGCGC ATCAGGGGCC
GATATCAATA ACTATGCCGG GCAGATTAAA TCTGCTATCG AAAGTAAGTT CTATGACGCA
TCGTCCTATG CAGGCAAAAC CTGTACGCTG CGCATAAAAC TGGCACCCGA TGGCATGTTA
CTGGATATCA AACCTGAAGG TGGCGATCCC GCACTTTGTC AGGCTGCGTT AGCAGCAGCT
AAACTTGCGA AGATCCCGAA ACCACCAAGC CAGGCAGTAT ATGAAGTGTT CAAAAACGCG
CCATTGGACT TCAAACCGTA A
 
Protein sequence
MSKATEQNDK LKRAIIISAV LHVILFAALI WSSFDENIEA SAGGGGGSSI DAVMVDSGAV 
VEQYKRMQSQ ESSAKRSDEQ RKMKEQQAAE ELREKQAAEQ ERLKQLEKER LAAQEQKKQA
EEAAKQAELK QKQAEEAAAK AAADAKAKAE ADAKAAEEAA KKAAADAKKK AEAEAAKAAA
EAQKKAEAAA AALKKKAEAA EAAAAEARKK AATEAAEKAK AEAEKKAAAE KAAADKKAAA
EKAAADKKAA EKAAAEKAAA DKKAAAEKAA ADKKAAAAKA AAAKAAAEKA AAAKAAAEAD
DIFGELSSGK NAPKTGGGAK GNNASPAGSG NTKNNGASGA DINNYAGQIK SAIESKFYDA
SSYAGKTCTL RIKLAPDGML LDIKPEGGDP ALCQAALAAA KLAKIPKPPS QAVYEVFKNA
PLDFKP