Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A0114 |
Symbol | thiP |
ID | 6484265 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 125885 |
End bp | 127495 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642735556 |
Product | thiamine transporter membrane protein |
Protein accession | YP_002039338 |
Protein GI | 194442730 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1178] ABC-type Fe3+ transport system, permease component |
TIGRFAM ID | [TIGR01253] thiamine ABC transporter, permease protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 62 |
Fosmid unclonability p-value | 0.576888 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAACGC GCCGTCAGCC GTTAATTCCC GGCTGGCTGA TTCCGGGGCT GTGCGCCGCC GCGCTGATGA TAACCGTCTC GCTGGCGGCC TTTCTGGCGC TTTGGCTGAA TGCGCCGTCG GGCGCGTGGT CGACAATCTG GCGGGATAGC TACCTGTGGC ATGTGGTGCG CTTCTCATTC TGGCAGGCGT TTCTGTCCGC AGTGCTGTCT GTGGTCCCGG CGGTTTTTCT CGCCCGCGCG CTTTATCGAC GCCGTTTTCC TGGACGTCTG GCGCTGCTGC GTCTGTGCGC CATGACGCTG ATTCTGCCGG TACTGGTTGC CGTGTTCGGC ATTCTTAGCG TGTATGGCCG TCAGGGCTGG CTGGCCTCGC TGTGGCAGAT GCTGGGGCTT CAGTGGACAT TCTCCCCCTA CGGCTTGCAG GGCATTTTAC TGGCGCACGT CTTTTTTAAC CTGCCAATGG CGAGCCGTTT GTTGCTGCAA TCTTTGGAAA GCATTCCCGG CGAGCAGCGC CAGCTCGCCG CCCAGCTCGG TATGCGCGGC TGGCATTTTT TCCGTTTTGT CGAATGGCCG TGGCTGCGCC GCCAAATTCC GCCCGTCGCG GCGCTGATTT TTATGCTCTG CTTCGCCAGT TTCGCAACGG TTCTGTCGCT CGGCGGCGGC CCGCAGGCCA CCACTATCGA GCTGGCTATC TTTCAGGCGC TCAGTTATGA CTACGATCCC GCCCGCGCGG CGATGCTGGC ATTAATCCAG ATGGTCTGCT GCCTGGCGCT GGTACTGCTA AGTCAACGGT TGAGCAAAGC GATTGCGCCG GGGATGACGC TGACGCAGGG CTGGCGCGAT CCTGACGATC GTCTCCACAG CCGTCTGACG GACGCCTTAT TAATCGTGCT GGCGCTGCTG CTGCTGCTTC CGCCGCTGGT CGCCGTGGTG GTCGATGGCG TCAACCGCAG CCTGCCGGAG GTGCTGGCGC AACCCATTCT GTGGCAGGCT GTCTGGACAT CGCTACGCAT TGCGCTGGCG GCGGGCGTTC TGTGTGTGGT GCTGACCATG ATGCTACTGT GGAGTAGCCG CGAGCTACGC CAACGCCAGC AACTATTCGC CGGACAAACG CTGGAACTCA GCGGGATGTT GATCCTCGCG ATGCCGGGGA TCGTGCTGGC GACAGGCTTC TTTTTACTGC TTAATAACAG CGTTGGCTTA CCGGAATCCG CCGACGGCAT CGTGATTTTC ACCAATGCGC TGATGGCAAT CCCCTACGCG CTAAAAGTCC TGGAAAACCC AATGCGCGAT ATTACCGCCC GTTATGGAAT GCTGTGCCAG TCATTAGGAA TTGAAGGCTG GTCGCGGTTA AAGGTCGTGG AACTGCGGGC GCTGAAACGT CCGCTGGCGC AGGCGCTGGC GTTTGCCTGC GTACTTTCTA TCGGTGATTT TGGCGTCGTC GCGCTTTTCG GCAATGACAA TTTCCGCACG CTGCCGTTTT ATCTGTATCA GCAGATCGGC TCCTACCGCA GCCAGGACGG CGCGGTGACC GCGCTGATAC TGCTGCTGCT CTGCTTTACG TTATTTACCC TCATAGAGAA ACTACCGGGC CGACATGCTA AAACTGATTG A
|
Protein sequence | MATRRQPLIP GWLIPGLCAA ALMITVSLAA FLALWLNAPS GAWSTIWRDS YLWHVVRFSF WQAFLSAVLS VVPAVFLARA LYRRRFPGRL ALLRLCAMTL ILPVLVAVFG ILSVYGRQGW LASLWQMLGL QWTFSPYGLQ GILLAHVFFN LPMASRLLLQ SLESIPGEQR QLAAQLGMRG WHFFRFVEWP WLRRQIPPVA ALIFMLCFAS FATVLSLGGG PQATTIELAI FQALSYDYDP ARAAMLALIQ MVCCLALVLL SQRLSKAIAP GMTLTQGWRD PDDRLHSRLT DALLIVLALL LLLPPLVAVV VDGVNRSLPE VLAQPILWQA VWTSLRIALA AGVLCVVLTM MLLWSSRELR QRQQLFAGQT LELSGMLILA MPGIVLATGF FLLLNNSVGL PESADGIVIF TNALMAIPYA LKVLENPMRD ITARYGMLCQ SLGIEGWSRL KVVELRALKR PLAQALAFAC VLSIGDFGVV ALFGNDNFRT LPFYLYQQIG SYRSQDGAVT ALILLLLCFT LFTLIEKLPG RHAKTD
|
| |