Gene EcHS_A0792 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0792 
SymboltolA 
ID5593290 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp803195 
End bp804460 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content52% 
IMG OID640919966 
Productcell envelope integrity inner membrane protein TolA 
Protein accessionYP_001457540 
Protein GI157160222 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3064] Membrane protein involved in colicin uptake 
TIGRFAM ID[TIGR02794] TolA protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000000022577 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCAAAGG CAACCGAACA AAACGACAAG CTCAAGCGGG CGATAATTAT TTCAGCAGTG 
CTGCATGTCA TCTTATTTGC GGCGCTGATC TGGAGTTCGT TCGATGAGAA TATAGAAGCT
TCAGCCGGAG GCGGCGGTGG TTCGTCCATC GACGCTGTCA TGGTTGATTC AGGTGCGGTA
GTTGAGCAGT ACAAACGCAT GCAAAGCCAG GAATCAAGCG CGAAGCGTTC TGATGAACAG
CGCAAGATGA AGGAACAGCA GGCTGCTGAA GAACTGCGTG AGAAACAAGC GGCTGAACAG
GAACGCCTGA AGCAACTTGA GAAAGAGCGG TTAGCGGCTC AGGAGCAGAA AAAGCAGGCT
GAAGAAGCCG CAAAACAGGC CGAGTTAAAG CAGAAGCAAG CTGAAGAGGC GGCAGCGAAA
GCGGCGGCAG ATGCTAAAGC GAAGGCCGAA GCAGATGCTA AAGCTGCGGA AGAAGCAGCG
AAGAAAGCGG CTGCAGACGC AAAGAAAAAA GCAGAAGCAG AAGCCGCCAA AGCCGCAGCC
GAAGCGCAGA AAAAAGCCGA GGCAGCCGCT GCGGCACTGA AGAAGAAAGC GGAAGCGGCA
GAAGCAGCTG CAGCTGAAGC AAGAAAGAAA GCGGCAACTG AAGCTGCTGA AAAAGCCAAA
GCAGAAGCTG AGAAGAAAGC GGCTGCTGAA AAGGCTGCAG CTGATAAGAA AGCGGCAGCA
GAGAAAGCTG CAGCCGACAA AAAAGCAGCA GAAAAAGCGG CTGCTGAAAA GGCAGCAGCT
GATAAGAAAG CAGCGGCAGA AAAAGCCGCC GCAGACAAAA AAGCGGCAGC GGCAAAAGCT
GCAGCTGAAA AAGCCGCTGC AGCAAAAGCG GCCGCAGAGG CAGATGATAT TTTCGGTGAG
CTAAGCTCTG GTAAGAATGC ACCGAAAACG GGGGGAGGGG CGAAAGGGAA CAATGCTTCG
CCTGCCGGGA GTGGTAATAC TAAAAACAAT GGCGCATCAG GGGCCGATAT CAATAACTAT
GCCGGGCAGA TTAAATCTGC TATCGAAAGT AAGTTCTATG ACGCATCGTC CTATGCAGGC
AAAACCTGTA CGCTGCGCAT AAAACTGGCA CCCGATGGTA TGTTACTGGA TATCAAACCT
GAAGGTGGCG ATCCCGCACT TTGTCAGGCT GCGTTGGCAG CAGCTAAACT TGCGAAGATC
CCGAAACCAC CAAGCCAGGC AGTATATGAA GTGTTCAAAA ACGCGCCATT GGACTTCAAA
CCGTAA
 
Protein sequence
MSKATEQNDK LKRAIIISAV LHVILFAALI WSSFDENIEA SAGGGGGSSI DAVMVDSGAV 
VEQYKRMQSQ ESSAKRSDEQ RKMKEQQAAE ELREKQAAEQ ERLKQLEKER LAAQEQKKQA
EEAAKQAELK QKQAEEAAAK AAADAKAKAE ADAKAAEEAA KKAAADAKKK AEAEAAKAAA
EAQKKAEAAA AALKKKAEAA EAAAAEARKK AATEAAEKAK AEAEKKAAAE KAAADKKAAA
EKAAADKKAA EKAAAEKAAA DKKAAAEKAA ADKKAAAAKA AAEKAAAAKA AAEADDIFGE
LSSGKNAPKT GGGAKGNNAS PAGSGNTKNN GASGADINNY AGQIKSAIES KFYDASSYAG
KTCTLRIKLA PDGMLLDIKP EGGDPALCQA ALAAAKLAKI PKPPSQAVYE VFKNAPLDFK
P