Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3590 |
Symbol | thiP |
ID | 6067549 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 3925060 |
End bp | 3926670 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641603008 |
Product | thiamine transporter membrane protein |
Protein accession | YP_001726531 |
Protein GI | 170021577 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1178] ABC-type Fe3+ transport system, permease component |
TIGRFAM ID | [TIGR01253] thiamine ABC transporter, permease protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.838581 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000463415 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCAACGC GCCGTCAGCC GTTAATTCCC GGCTGGTTAA TTCCAGGTGT AAGCGCCGCC ACGCTGGTGG TAGCGGTTGC GCTCGCGGCG TTTCTCGCCC TGTGGTGGAA CGCGCCGCAG GGTAACTGGG TGGCAATCTG GCAGGACAGC TACCTGTGGC ATGTGGTGCG CTTCTCCTTC TGGCAGGCGT TTCTCTCGGC GCTGCTCTCT GTCGTACCCG CGATATTCCT CGCCCGCGCA CTCTATCGCA GGCGCTTTCC GGGTCGGCTG GCGCTGTTGC GTCTGTGCGC AATGACCTTG ATCCTCCCGG TGCTGGTTGC TGTTTTCGGC ATTCTTAGCG TCTATGGTCG TCAGGGCTGG CTGGCATCGC TCTGCCAATC GCTCGGTCTG GAGTGGACCT TTTCGCCCTA CGGCCTGCAA GGTATTTTGC TGGCGCACGT ATTTTTTAAT CTACCGATGG CGAGCCGCTT ATTACTCCAG GCACTGGAAA ACATTCCCGG CGAGCAACGT CAGCTTGCCG CCCAGCTTGG GATGCGCGGC TGGCATTTTT TCCGCTTCAT CGAATGGCCG TGGCTACGGC GACAAATCCC GCCGGTTGCC GCGCTTATCT TTATGCTCTG TTTCGCCAGC TTCGCCACCG TGCTGTCGCT GGGCGGCGGT CCGCAGGCAA CCACTATCGA GCTGGCAATC TATCAGGCGC TGAGTTACGA CTACGATCCT GCCCGCGCGG CGATGCTGGC GCTGATCCAG ATGGTGTGTT GCCTCGGGTT GGTGCTGCTG AGTCAGCGAT TGAGTAAGGC CATTGCGCCA GGCACCACGC TGCTGCAAGG CTGGCGCGAC CCGGACGATC GTCTGCATAG CCGCATTTGC GACACGGTGT TAATTGTGCT GGCGCTGCTG CTGTTGCTGC CACCGTTGCT GGCGGTGATC GTCGATGGGC TAAATCGCCA GTTGCCGGAA GTGCTGGCAC AACCGGTGCT GTGGCAGGCG CTGTGGACCT CGTTGCGTAT TGCGCTGGCG GCAGGTGTAT TGTGCGTAGT GCTGACCATG ATGCTGCTAT GGAGCAGTCG CGAACTTCGG GCGCGGCAGA AAATGCTGGC GGGTCAGGCG CTGGAGATGA GCGGCATGTT GATTCTCGCC ATGCCGGGGA TTGTGCTGGC TACCGGCTTC TTTTTACTGC TCAACAACAC CATTGGCCTG CCGCAATCTG CTGACGGCAT TGTGATTTTC ACCAATGCGT TAATGGCGAT CCCTTATGCG CTGAAAGTAC TGGAAAACCC GATGCGCGAT ATCACCGCCC GCTACAGTAT GTTGTGTCAG TCGCTGGGCA TTGAAGGCTG GTCGCGCTTA AAAGTGGTCG AGCTGCGCGC CCTGAAACGT CCACTGGCGC AGGCGCTGGC TTTTGCCTGC GTGCTGTCGA TTGGTGATTT TGGCGTAGTG GCGTTGTTCG GTAACGATGA TTTCCGTACC CTGCCGTTTT ATCTCTACCA GCAAATTGGC TCCTATCGCA GCCAGGACGG TGCGGTCACC GCGTTAATTC TGCTACTACT CTGTTTTCTG CTGTTTACCG TGATTGAAAA ACTACCGGGG CGAAATGTTA AAACTGACTG A
|
Protein sequence | MATRRQPLIP GWLIPGVSAA TLVVAVALAA FLALWWNAPQ GNWVAIWQDS YLWHVVRFSF WQAFLSALLS VVPAIFLARA LYRRRFPGRL ALLRLCAMTL ILPVLVAVFG ILSVYGRQGW LASLCQSLGL EWTFSPYGLQ GILLAHVFFN LPMASRLLLQ ALENIPGEQR QLAAQLGMRG WHFFRFIEWP WLRRQIPPVA ALIFMLCFAS FATVLSLGGG PQATTIELAI YQALSYDYDP ARAAMLALIQ MVCCLGLVLL SQRLSKAIAP GTTLLQGWRD PDDRLHSRIC DTVLIVLALL LLLPPLLAVI VDGLNRQLPE VLAQPVLWQA LWTSLRIALA AGVLCVVLTM MLLWSSRELR ARQKMLAGQA LEMSGMLILA MPGIVLATGF FLLLNNTIGL PQSADGIVIF TNALMAIPYA LKVLENPMRD ITARYSMLCQ SLGIEGWSRL KVVELRALKR PLAQALAFAC VLSIGDFGVV ALFGNDDFRT LPFYLYQQIG SYRSQDGAVT ALILLLLCFL LFTVIEKLPG RNVKTD
|
| |