Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C0113 |
Symbol | thiP |
ID | 6490442 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | - |
Start bp | 119300 |
End bp | 120910 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642740401 |
Product | thiamine transporter membrane protein |
Protein accession | YP_002044075 |
Protein GI | 194449238 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1178] ABC-type Fe3+ transport system, permease component |
TIGRFAM ID | [TIGR01253] thiamine ABC transporter, permease protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 0.195449 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAACGC GCCGTCAGCC GTTAATTCCC GGCTGGCTGA TTCCGGGGCT GTGCGCCGCC GCGCTGATGA TAACCGTCTC GCTGGCGGCC TTTCTGGCGC TTTGGCTGAA TGCGCCGTCG GGCGCGTGGT CGACAATCTG GCGGGATAGC TACCTGTGGC ATGTGGTGCG CTTCTCATTC TGGCAGGCGT TTCTGTCCGC AGTGCTGTCT GTGGTCCCGG CGGTTTTTCT CGCCCGCGCG CTTTATCGAC GCCGTTTTCC TGGACGTCTG GCGCTGCTGC GTCTGTGCGC CATGACGCTG ATTCTGCCGG TACTGGTTGC CGTGTTCGGC ATTCTTAGCG TGTATGGCCG TCAGGGCTGG CTGGCCTCGC TGTGGCAGAT GCTGGGGCTT CAGTGGACAT TCTCCCCCTA CGGCTTGCAG GGCATTTTAC TGGCGCACGT CTTTTTTAAC CTGCCAATGG CGAGCCGTTT GTTGCTGCAA TCTTTGGAAA GCATTCCCGG CGAGCAGCGC CAGCTCGCCG CCCAGCTCGG TATGCGCGGC TGGCATTTTT TCCGTTTTGT CGAATGGCCG TGGCTGCGCC GCCAAATTCC GCCTGTCGCG GCGCTGATTT TTATGCTCTG CTTCGCCAGT TTCGCAACGG TTCTGTCGCT CGGCGGCGGC CCGCAGGCCA CCACTATCGA ACTGGCTATC TTTCAGGCGC TCAGTTATGA CTACGATCCC GCCCGCGCGG CGATGCTGGC ATTAATCCAG ATGGTCTGCT GCCTGGCGCT GGTACTGCTA AGCCAACGGT TGAGCAAAGC GATTGCGCCG GGGATGACGC TGACGCAGGG CTGGCGCGAT CCTGACGATC GTCTCCACAG CCGTCTGACG GACGCCTTAT TAATCGTGCT GGCGCTGCTG CTGCTGCTTC CGCCGCTGGT CGCCGTGGTG GTCGATGGCG TCAACCGCAG CCTGCCGGAG GTGCTGGCGC AACCCATTCT GTGGCAGGCT GTCTGGACAT CGCTACGCAT TGCGCTGGCG GCGGGCGTTC TGTGTGTGGT GCTGACCATG ATGCTGCTGT GGAGTAGCCG CGAGCTACGC CAACGCCAGC AACTATTCGC CGGACAAACG CTGGAACTCA GCGGAATGTT GATCCTCGCG ATGCCGGGGA TCGTGCTGGC GACAGGCTTC TTTTTACTGC TCAATAACAG CGTTGGCTTA CCGGAATCCG CCGACGGCAT CGTGATTTTC ACCAATGCGC TGATGGCAAT CCCCTACGCG CTAAAAGTCC TGGAAAACCC AATGCGCGAT ATTACCGCCC GTTATGGAAT GCTGTGCCAG TCATTAGGAA TTGAAGGCTG GTCGCGGTTA AAGATCGTGG AACTGCGGGC GCTGAAACGT CCGCTGGCGC AGGCGCTGGC GTTTGCCTGC GTACTTTCTA TCGGTGATTT TGGCGTCGTC GCGCTTTTCG GCAATGACAA TTTCCGCACG CTGCCGTTTT ATCTGTATCA GCAGATCGGC TCCTACCGCA GCCAGGACGG CGCAGTGACC GCGCTGATAC TGCTGCTGCT CTGCTTTACG TTATTTACCC TCATAGAGAA ACTACCGGGC CGACATGCTA AAACTGATTG A
|
Protein sequence | MATRRQPLIP GWLIPGLCAA ALMITVSLAA FLALWLNAPS GAWSTIWRDS YLWHVVRFSF WQAFLSAVLS VVPAVFLARA LYRRRFPGRL ALLRLCAMTL ILPVLVAVFG ILSVYGRQGW LASLWQMLGL QWTFSPYGLQ GILLAHVFFN LPMASRLLLQ SLESIPGEQR QLAAQLGMRG WHFFRFVEWP WLRRQIPPVA ALIFMLCFAS FATVLSLGGG PQATTIELAI FQALSYDYDP ARAAMLALIQ MVCCLALVLL SQRLSKAIAP GMTLTQGWRD PDDRLHSRLT DALLIVLALL LLLPPLVAVV VDGVNRSLPE VLAQPILWQA VWTSLRIALA AGVLCVVLTM MLLWSSRELR QRQQLFAGQT LELSGMLILA MPGIVLATGF FLLLNNSVGL PESADGIVIF TNALMAIPYA LKVLENPMRD ITARYGMLCQ SLGIEGWSRL KIVELRALKR PLAQALAFAC VLSIGDFGVV ALFGNDNFRT LPFYLYQQIG SYRSQDGAVT ALILLLLCFT LFTLIEKLPG RHAKTD
|
| |