Gene ECH74115_0073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0073 
SymboltbpA 
ID6969330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp79297 
End bp80280 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content53% 
IMG OID643384153 
Productthiamine transporter substrate binding subunit 
Protein accessionYP_002268676 
Protein GI209399031 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4143] ABC-type thiamine transport system, periplasmic component 
TIGRFAM ID[TIGR01254] ABC transporter periplasmic binding protein, thiB subfamily
[TIGR01276] thiamine ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTAAAAA AATGTCTTCC CCTGCTGTTG CTGTGCACAG CGCCCGTTTT CGCTAAACCC 
GTTCTGACTG TTTATACCTA CGATTCCTTC GCCGCCGACT GGGGGCCTGG TCCGGTGGTT
AAAAAAGCCT TTGAAGCCGA CTGTAATTGC GAACTGAAAC TGGTGGCGCT GGAAGATGGC
GTTTCGCTTC TCAACCGTCT ACGGATGGAA GGCAAAAACA GTAAAGCCGA TGTGGTGCTG
GGGCTGGATA ACAATCTGTT AGACGCCGCC AGTAAAACCG GACTGTTTGC CAAAAGCGGT
GTGGCAGCGG ATGCCGTTAA CGTTCCCGGC GGCTGGAATA ATGACATTTT CGTACCGTTT
GATTATGGCT ACTTCGCCTT CGTTTATGAC AAGAACAAAC TGAAAAACCC GCCACAAAGC
CTGAAAGAAC TGGTTGAGAG CGATCAAAAC TGGCGGGTGA TTTATCAGGA TCCGCGCACC
AGTACACCGG GGCTGGGTCT GTTGCTATGG ATGCAAAAAG TCTATGGCGA TGACGCCCCA
CAAGCCTGGC AGAAACTGGC GAAGAAAACG GTCACGGTCA CCAAAGGCTG GAGCGAAGCC
TACGGCCTGT TTTTAAAAGG TGAAAGCGAT CTGGTACTGA GTTACACCAC CTCTCCGGCT
TATCACATTC TCGAAGAGAA GAAAGATAAC TACGCCGCCG CGAACTTCAG CGAAGGTCAC
TATCTGCAAG TGGAAGTCGC CGCTCGCACC GCTGCCAGCA AGCAGCCGGA GCTGGCGCAA
AAATTCCTCC AGTTTATGGT TTCTCCGGCT TTCCAGAATG CGATCCCAAC CGGCAACTGG
ATGTATCCGG TGGCAAACGT CACGCTGCCT GCCGGTTTTG AAAAATTGAC CAAACCCGCA
ACCACGCTGG AGTTCACGCC AGCCGAAGTG GCGGCACAAC GTCAGGCATG GATTAGCGAA
TGGCAACGCG CCGTCAGCCG TTAA
 
Protein sequence
MLKKCLPLLL LCTAPVFAKP VLTVYTYDSF AADWGPGPVV KKAFEADCNC ELKLVALEDG 
VSLLNRLRME GKNSKADVVL GLDNNLLDAA SKTGLFAKSG VAADAVNVPG GWNNDIFVPF
DYGYFAFVYD KNKLKNPPQS LKELVESDQN WRVIYQDPRT STPGLGLLLW MQKVYGDDAP
QAWQKLAKKT VTVTKGWSEA YGLFLKGESD LVLSYTTSPA YHILEEKKDN YAAANFSEGH
YLQVEVAART AASKQPELAQ KFLQFMVSPA FQNAIPTGNW MYPVANVTLP AGFEKLTKPA
TTLEFTPAEV AAQRQAWISE WQRAVSR