Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3589 |
Symbol | tbpA |
ID | 6066442 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 3924101 |
End bp | 3925084 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641603007 |
Product | thiamine transporter substrate binding subunit |
Protein accession | YP_001726530 |
Protein GI | 170021576 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4143] ABC-type thiamine transport system, periplasmic component |
TIGRFAM ID | [TIGR01254] ABC transporter periplasmic binding protein, thiB subfamily [TIGR01276] thiamine ABC transporter, periplasmic binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000386995 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTTAAAAA AATGTCTGCC CCTGCTGTTG CTGTGCACAG CGCCCGTTTT CGCTAAACCC GTTCTGACTG TTTATACCTA CGATTCCTTC GCCGCCGACT GGGGGCCTGG TCCGGTGGTT AAAAAAGCCT TTGAAGCCGA CTGTAATTGC GAACTGAAAC TGGTGGCGCT GGAAGATGGC GTTTCGCTTC TCAACCGTCT ACGGATGGAA GGCAAAAACA GCAAAGCCGA TGTAGTTCTG GGGCTGGATA ACAACCTGTT AGACGCCGCC AGTAAAACCG GACTGTTTGC CAAAAGCGGC GTGGCAGCGG ATGCCATTAA CGTTCCCGGC GGCTGGAATA ATGACACTTT CGTACCGTTT GATTACGGCT ACTTCGCCTT CGTCTACGAC AAGAACAAAC TGAAAAACCC GCCCCAAAGC CTGAAAGAGC TGGTTGAGAG CGATCAAAAC TGGCGAGTGA TTTATCAGGA TCCGCGCACC AGTACGCCGG GGCTGGGTCT GCTGTTGTGG ATGCAAAAAG TCTATGGCGA TGACACCCCG CAGGCCTGGC AGAAACTGGC GAAGAAAACT GTCACCGTCA CCAAAGGCTG GAGCGAAGCC TACGGCCTGT TTTTAAAAGG TGAAAGCGAT CTGGTACTGA GTTACACCAC CTCTCCGGCT TATCACATTC TCGAAGAGAA AAAAGATAAC TACGCCGCCG CGAACTTTAG CGAAGGTCAC TATCTGCAGG TGGAAGTCGC CGCCCGCACC GCTGCCAGCA AGCAGCCGGA GCTGGCGCAA AAATTTCTCC AGTTTATGGT TTCTCCGGCT TTCCAGAATG CGATCCCAAC CGGCAACTGG ATGTATCCGG TGGCAAATGT CACGCTGCCT GCCGGGTTTG AACAATTGAC CAAACCAGCA ACCACGCTGG AGTTCACGCC AGCCGAAGTG GCGGCACAAC GTCAGGCATG GATTAGCGAA TGGCAACGCG CCGTCAGCCG TTAA
|
Protein sequence | MLKKCLPLLL LCTAPVFAKP VLTVYTYDSF AADWGPGPVV KKAFEADCNC ELKLVALEDG VSLLNRLRME GKNSKADVVL GLDNNLLDAA SKTGLFAKSG VAADAINVPG GWNNDTFVPF DYGYFAFVYD KNKLKNPPQS LKELVESDQN WRVIYQDPRT STPGLGLLLW MQKVYGDDTP QAWQKLAKKT VTVTKGWSEA YGLFLKGESD LVLSYTTSPA YHILEEKKDN YAAANFSEGH YLQVEVAART AASKQPELAQ KFLQFMVSPA FQNAIPTGNW MYPVANVTLP AGFEQLTKPA TTLEFTPAEV AAQRQAWISE WQRAVSR
|
| |