Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0071 |
Symbol | thiP |
ID | 6145002 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 79757 |
End bp | 81367 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641614972 |
Product | thiamine transporter membrane protein |
Protein accession | YP_001742188 |
Protein GI | 170682672 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1178] ABC-type Fe3+ transport system, permease component |
TIGRFAM ID | [TIGR01253] thiamine ABC transporter, permease protein |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.818645 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.387113 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAACGC GCCGTCAGCC GTTAATTCCC GGCTGGTTAA TTCCAGGTGT AAGCGCCGCT ACGCTTGTGG TGGCGGTTGC GCTGGCGGCG TTTCTCGCCC TGTGGTGGAA CGCGCCGCAG GGTGACTGGG TGGCAGTCTG GCAGGACAGC TATCTGTGGC ATGTGGTGCG CTTCTCCTTC TGGCAGGCGT TTCTCTCGGC GCTGCTCTCT GTCGTTCCGG CGATATTCCT CGCCCGCGCC CTCTATCGCA GGCGCTTTCC AGGTCGGCTG GCGCTGTTGC GTCTGTGCGC AATGACCTTG ATCCTCCCGG TGCTGGTCGC TGTTTTCGGC ATTCTTAGCG TCTATGGTCG CCAGGGCTGG CTGGCCTCGC TCTGGCAATC ACTTGGTCTG GAGTGGACCT TTTCGCCCTA CGGCCTGCAA GGTATTTTGC TGGCGCACGT ATTTTTTAAT CTGCCGATGG CGAGCCGTTT ATTACTTCAG GCACTGGAAA ACATTCCCGG CGAACAACGT CAACTTGCCG CACAACTTGG GATGCGCGGC TGGCATTTTT TCCGCTTTGT CGAATGGCCG TGGTTACGTC GACAAATCCC GCCAGTTGCT GCGCTTATCT TTATGCTCTG TTTCGCCAGC TTCGCCACCG TGCTATCGCT GGGGGGCGGT CCGCAGGCGA CCACTATCGA GCTGGCTATT TATCAGGCGC TGAGTTACGA CTACGATCCT GCCCGCGCGG CGATGCTTGC GCTGATCCAG ATGGTGTGTT GCCTTGGGCT GGTGCTGCTG AGTCAGCGAT TGAGTAAGGC CATTGCGCCA GGCACCACGC TGCTGCAAGG CTGGCGCGAC CCGGACGATC GTCTGCATAG CCGCATTTGC GACACTGTGT TAATTGTGCT GGCGCTGCTG CTATTGCTGC CACCGTTACT GGCGGTGATC GTCGATGGGG TAAATCGCCA GTTGCCGGAA GTGCTGGCAC AACCGGTGCT GTGGCAGGCG CTGTGGACCT CGTTGCGTAT TGCGCTGGCG GCAGGCGTAT TGTGCGTAGT GCTGACCATG ATGCTGCTAT GGAGCAGTCG CGAACTGCGG GCGCGGCAGA AAATGCTGGC CGGCCAGGCG CTGGAGATGA GCGGCATGTT GATCCTCGCC ATGCCGGGGA TTGTGCTGGC AACGGGCTTC TTTTTACTGC TCAACAACAC TATCGGCCTG CCACAATCTG CTGACGGCAT TGTGATTTTC ACCAATGCGT TAATGGCGAT CCCTTATGCG CTGAAAGTGC TGGAAAACCC GATGCGCGAT ATCACCGCTC GCTACAACAT GTTATGTCAG TCGCTGGGCA TTGAAGGCTG GTCACGCTTA AAAGTGGTGG AGCTGCGCGC CCTGAAACGT CCGCTGGCGC AGGCACTGGC CTTTGCTTGT GTGCTGTCGA TTGGTGATTT TGGCGTGGTG GCGTTGTTCG GTAACGATGA TTTCCGTACC CTGCCGTTTT ATCTCTACCA GCAAATTGGC TCCTATCGCA GCCAGGACGG CGCGGTCACC GCGTTAATTA TGCTGCTACT CTGTTTTCTG CTGTTTACTG TGATTGAAAA ACTACCGGGG CGAAATGTTA AAACTGACTG A
|
Protein sequence | MATRRQPLIP GWLIPGVSAA TLVVAVALAA FLALWWNAPQ GDWVAVWQDS YLWHVVRFSF WQAFLSALLS VVPAIFLARA LYRRRFPGRL ALLRLCAMTL ILPVLVAVFG ILSVYGRQGW LASLWQSLGL EWTFSPYGLQ GILLAHVFFN LPMASRLLLQ ALENIPGEQR QLAAQLGMRG WHFFRFVEWP WLRRQIPPVA ALIFMLCFAS FATVLSLGGG PQATTIELAI YQALSYDYDP ARAAMLALIQ MVCCLGLVLL SQRLSKAIAP GTTLLQGWRD PDDRLHSRIC DTVLIVLALL LLLPPLLAVI VDGVNRQLPE VLAQPVLWQA LWTSLRIALA AGVLCVVLTM MLLWSSRELR ARQKMLAGQA LEMSGMLILA MPGIVLATGF FLLLNNTIGL PQSADGIVIF TNALMAIPYA LKVLENPMRD ITARYNMLCQ SLGIEGWSRL KVVELRALKR PLAQALAFAC VLSIGDFGVV ALFGNDDFRT LPFYLYQQIG SYRSQDGAVT ALIMLLLCFL LFTVIEKLPG RNVKTD
|
| |