Gene SNSL254_A0114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0114 
SymbolthiP 
ID6484265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp125885 
End bp127495 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content60% 
IMG OID642735556 
Productthiamine transporter membrane protein 
Protein accessionYP_002039338 
Protein GI194442730 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1178] ABC-type Fe3+ transport system, permease component 
TIGRFAM ID[TIGR01253] thiamine ABC transporter, permease protein 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value0.576888 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACGC GCCGTCAGCC GTTAATTCCC GGCTGGCTGA TTCCGGGGCT GTGCGCCGCC 
GCGCTGATGA TAACCGTCTC GCTGGCGGCC TTTCTGGCGC TTTGGCTGAA TGCGCCGTCG
GGCGCGTGGT CGACAATCTG GCGGGATAGC TACCTGTGGC ATGTGGTGCG CTTCTCATTC
TGGCAGGCGT TTCTGTCCGC AGTGCTGTCT GTGGTCCCGG CGGTTTTTCT CGCCCGCGCG
CTTTATCGAC GCCGTTTTCC TGGACGTCTG GCGCTGCTGC GTCTGTGCGC CATGACGCTG
ATTCTGCCGG TACTGGTTGC CGTGTTCGGC ATTCTTAGCG TGTATGGCCG TCAGGGCTGG
CTGGCCTCGC TGTGGCAGAT GCTGGGGCTT CAGTGGACAT TCTCCCCCTA CGGCTTGCAG
GGCATTTTAC TGGCGCACGT CTTTTTTAAC CTGCCAATGG CGAGCCGTTT GTTGCTGCAA
TCTTTGGAAA GCATTCCCGG CGAGCAGCGC CAGCTCGCCG CCCAGCTCGG TATGCGCGGC
TGGCATTTTT TCCGTTTTGT CGAATGGCCG TGGCTGCGCC GCCAAATTCC GCCCGTCGCG
GCGCTGATTT TTATGCTCTG CTTCGCCAGT TTCGCAACGG TTCTGTCGCT CGGCGGCGGC
CCGCAGGCCA CCACTATCGA GCTGGCTATC TTTCAGGCGC TCAGTTATGA CTACGATCCC
GCCCGCGCGG CGATGCTGGC ATTAATCCAG ATGGTCTGCT GCCTGGCGCT GGTACTGCTA
AGTCAACGGT TGAGCAAAGC GATTGCGCCG GGGATGACGC TGACGCAGGG CTGGCGCGAT
CCTGACGATC GTCTCCACAG CCGTCTGACG GACGCCTTAT TAATCGTGCT GGCGCTGCTG
CTGCTGCTTC CGCCGCTGGT CGCCGTGGTG GTCGATGGCG TCAACCGCAG CCTGCCGGAG
GTGCTGGCGC AACCCATTCT GTGGCAGGCT GTCTGGACAT CGCTACGCAT TGCGCTGGCG
GCGGGCGTTC TGTGTGTGGT GCTGACCATG ATGCTACTGT GGAGTAGCCG CGAGCTACGC
CAACGCCAGC AACTATTCGC CGGACAAACG CTGGAACTCA GCGGGATGTT GATCCTCGCG
ATGCCGGGGA TCGTGCTGGC GACAGGCTTC TTTTTACTGC TTAATAACAG CGTTGGCTTA
CCGGAATCCG CCGACGGCAT CGTGATTTTC ACCAATGCGC TGATGGCAAT CCCCTACGCG
CTAAAAGTCC TGGAAAACCC AATGCGCGAT ATTACCGCCC GTTATGGAAT GCTGTGCCAG
TCATTAGGAA TTGAAGGCTG GTCGCGGTTA AAGGTCGTGG AACTGCGGGC GCTGAAACGT
CCGCTGGCGC AGGCGCTGGC GTTTGCCTGC GTACTTTCTA TCGGTGATTT TGGCGTCGTC
GCGCTTTTCG GCAATGACAA TTTCCGCACG CTGCCGTTTT ATCTGTATCA GCAGATCGGC
TCCTACCGCA GCCAGGACGG CGCGGTGACC GCGCTGATAC TGCTGCTGCT CTGCTTTACG
TTATTTACCC TCATAGAGAA ACTACCGGGC CGACATGCTA AAACTGATTG A
 
Protein sequence
MATRRQPLIP GWLIPGLCAA ALMITVSLAA FLALWLNAPS GAWSTIWRDS YLWHVVRFSF 
WQAFLSAVLS VVPAVFLARA LYRRRFPGRL ALLRLCAMTL ILPVLVAVFG ILSVYGRQGW
LASLWQMLGL QWTFSPYGLQ GILLAHVFFN LPMASRLLLQ SLESIPGEQR QLAAQLGMRG
WHFFRFVEWP WLRRQIPPVA ALIFMLCFAS FATVLSLGGG PQATTIELAI FQALSYDYDP
ARAAMLALIQ MVCCLALVLL SQRLSKAIAP GMTLTQGWRD PDDRLHSRLT DALLIVLALL
LLLPPLVAVV VDGVNRSLPE VLAQPILWQA VWTSLRIALA AGVLCVVLTM MLLWSSRELR
QRQQLFAGQT LELSGMLILA MPGIVLATGF FLLLNNSVGL PESADGIVIF TNALMAIPYA
LKVLENPMRD ITARYGMLCQ SLGIEGWSRL KVVELRALKR PLAQALAFAC VLSIGDFGVV
ALFGNDNFRT LPFYLYQQIG SYRSQDGAVT ALILLLLCFT LFTLIEKLPG RHAKTD