Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0072 |
Symbol | tbpA |
ID | 6146880 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 81343 |
End bp | 82326 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641614973 |
Product | thiamine transporter substrate binding subunit |
Protein accession | YP_001742189 |
Protein GI | 170682457 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4143] ABC-type thiamine transport system, periplasmic component |
TIGRFAM ID | [TIGR01254] ABC transporter periplasmic binding protein, thiB subfamily [TIGR01276] thiamine ABC transporter, periplasmic binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.338985 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTAAAAA AATATCTTCC CCTGCTGTTG CTGTGCACAG CGCCCGTTTT CGCTAAACCC GTTCTGACCG TCTACACCTA CGATTCCTTC GCCGCCGACT GGGGGCCTGG TCCGGTGGTT AAAAAAGCCT TTGAAGCCGA CTGTAATTGC GAACTGAAAC TGGTGGCTCT GGAAGATGGC GTTTCGTTTC TCAACCGTCT ACGGATGGAA GGCAAAAACA GCAAAGCCGA TGTGGTGCTG GGGCTGGATA ACAATCTGTT AGATGCCGCC AGCAAAACCG GGCTGTTTGC TAAAAGCGGT GTGGCTGCGG ATGCCGTTAA CGTTCCCGGC GGCTGGAATA ATGACACTTT CGTCCCGTTT GATTACGGCT ATTTCGCCTT CGTCTATGAC AAGAACAAAC TGAAAAATCC GCCGCAAAGC CTGAAAGAGC TGGTTGAAAG CGATCAAAAC TGGCGGGTGA TTTATGAAGA TCCACGTACC AGTACACCGG GGCTGGGTCT GTTGCTATGG ATGCAAAAAG TCTATGGCGA TAACGCCCCG CAAGCCTGGC AGAAACTGGC GAAGAAAACT GTCACGGTCA CCAAAGGCTG GAGCGAAGCC TACGGCCTGT TTTTAAAAGG CGAAAGCGAT CTGGTTCTGA GTTACACCAC CTCTCCGGCT TATCACATTC TCGAAGAGAA GAAAGATAAC TACGCCGCCG CGAACTTCAG CGAAGGTCAC TATCTGCAGG TGGAAGTCGC CGCCCGCACC GCTGCCAGCA AGCAGCCAGA GCTGGCGCAA AAATTTCTCC AGTTTATGGT TTCTCCGGCT TTCCAGAATG CGATCCCAAC CGGCAACTGG ATGTATCCGG TGGCAAATGT CACGCTGCCT GCCGGGTTTG AACAATTGAC CAAACCGGCA ACCACGCTGG AGTTCACGCC AGCCGAAGTG GCGGCACAAC GTCAGGCATG GATTAGCGAA TGGCAACGCG CCGTCAGCCG TTAA
|
Protein sequence | MLKKYLPLLL LCTAPVFAKP VLTVYTYDSF AADWGPGPVV KKAFEADCNC ELKLVALEDG VSFLNRLRME GKNSKADVVL GLDNNLLDAA SKTGLFAKSG VAADAVNVPG GWNNDTFVPF DYGYFAFVYD KNKLKNPPQS LKELVESDQN WRVIYEDPRT STPGLGLLLW MQKVYGDNAP QAWQKLAKKT VTVTKGWSEA YGLFLKGESD LVLSYTTSPA YHILEEKKDN YAAANFSEGH YLQVEVAART AASKQPELAQ KFLQFMVSPA FQNAIPTGNW MYPVANVTLP AGFEQLTKPA TTLEFTPAEV AAQRQAWISE WQRAVSR
|
| |