Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0072 |
Symbol | thiP |
ID | 6970863 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 77711 |
End bp | 79321 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643384152 |
Product | thiamine transporter membrane protein |
Protein accession | YP_002268675 |
Protein GI | 209398294 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1178] ABC-type Fe3+ transport system, permease component |
TIGRFAM ID | [TIGR01253] thiamine ABC transporter, permease protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 66 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAACGC GCCGTCAGCC GTTAATTCCC GGCTGGTTAA TTCCAGGTGT AAGCGCCGCC ACGCTGGTGG TAGCGGTTGC GCTGGCGGCG TTTCTCGCCC TGTGGTGGAA CGCGCCGCAG GGTGACTGGT CGGCAGTCTG GCAGGACAGC TACCTGTGGC ATGTGGTGCG CTTCTCCTTC TGGCAGGCGT TTCTCTCGGC GCTACTCTCT GTCGTACCCG CGATATTCCT CGCCCGCGCG CTCTATCGCA GGCGCTTTCC GGGTCGGCTG GCGCTGTTGC GTCTGTGCGC AATGACCTTG ATCCTCCCGG TGCTGGTTGC TGTTTTCGGC ATTCTTAGCG TCTATGGTCG CCAGGGCTGG CTGGCATCGC TCTGCCAATC GCTCGGTCTG GAGTGGACCT TTTCGCCCTA CGGCCTGCAA GGCATTTTGC TGGCCCATGT GTTTTTTAAT CTGCCGATGG CGAGCCGCTT ATTACTCCAG GCACTGGAAA ACATTCCCGG CGAACAACGT CAGCTTGCCG CCCAGCTTGG GATGCGCGGC TGGCATTTTT TCCGCTTCGT CGAATGGCCG TGGTTACGGC GACAAATCCC GCCGGTTACC GCGCTTATCT TTATGCTCTG TTTCGCCAGC TTCGCCACCG TGTTATCGCT GGGCGGCGGT CCGCAGGCAA CCACTATCGA GCTGGCAATC TATCAGGCGC TGAGTTACGA CTACGATCCT GCCCGCGCGG CGATGCTGGC GCTGATCCAG ATGGTGTGTT GCCTTGGGCT GGTGCTGTTG AGTCAGCGAC TGAGTAAGGC CATTGCGCCC GGCACCACGC TGCTGCAAGG CTGGCGCGAC CCGGACGATC GTCTGCATAG CCGCATTTGC GACACGATGT TAATTGTGCT GGCGCTGCTG CTGTTGCTGC CACCGTTGCT GGCGGTGATC GTCGATGGGG TAAATCGCCA GTTGCCGGAA GTGCTGGCAC AACCGGTGCT GTGGCAGGCG CTGTGGACCT CGTTGCGTAT TGCGCTGGCG GCAGGTGTAT TGTGCGTAGT GCTGACCATG ATGCTGCTAT GGAGCAGTCG CGAACTGCGG GCGCGGCAAA AAATGCTGGC GGGCCAGGCG CTGGAGATGA GCGGCATGTT GATCCTCGCT ATGCCGGGGA TTGTGCTGGC AACGGGCTTC TTTTTACTGC TCAACAACAC TATCGGCCTG CCACAATCTG CTGACGGCAT TGTGATTTTC ACCAATGCGT TAATGGCGAT CCCCTACGCC CTGAAAGTGC TGGAAAACCC GATGCGCGAT ATCACCGCCC GCTACAGCAT GTTATGTCAG TCGCTGGGGA TTGAAGGCTG GTCACGCTTA AAAGTAGTGG AGCTGCGCGC CCTGAAACGC CCGCTGGCGC AGGCGCTGGC CTTTGCCTGC GTGCTGTCGA TTGGTGATTT TGGCGTGGTG GCGTTGTTCG GTAACGATGA TTTCCGCACC CTGCCGTTTT ATCTCTACCA GCAAATTGGC TCCTATCGCA GCCAGGACGG CGCGGTCACC GCGTTAATTC TGCTGCTGCT CTGTTTTCTG CTGTTTAGCG TGATTGAAAA AATACCGGGG CGAAATGTTA AAACTGACTG A
|
Protein sequence | MATRRQPLIP GWLIPGVSAA TLVVAVALAA FLALWWNAPQ GDWSAVWQDS YLWHVVRFSF WQAFLSALLS VVPAIFLARA LYRRRFPGRL ALLRLCAMTL ILPVLVAVFG ILSVYGRQGW LASLCQSLGL EWTFSPYGLQ GILLAHVFFN LPMASRLLLQ ALENIPGEQR QLAAQLGMRG WHFFRFVEWP WLRRQIPPVT ALIFMLCFAS FATVLSLGGG PQATTIELAI YQALSYDYDP ARAAMLALIQ MVCCLGLVLL SQRLSKAIAP GTTLLQGWRD PDDRLHSRIC DTMLIVLALL LLLPPLLAVI VDGVNRQLPE VLAQPVLWQA LWTSLRIALA AGVLCVVLTM MLLWSSRELR ARQKMLAGQA LEMSGMLILA MPGIVLATGF FLLLNNTIGL PQSADGIVIF TNALMAIPYA LKVLENPMRD ITARYSMLCQ SLGIEGWSRL KVVELRALKR PLAQALAFAC VLSIGDFGVV ALFGNDDFRT LPFYLYQQIG SYRSQDGAVT ALILLLLCFL LFSVIEKIPG RNVKTD
|
| |