Gene SNSL254_A0115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0115 
SymboltbpA 
ID6482863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp127471 
End bp128454 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content55% 
IMG OID642735557 
Productthiamine transporter substrate binding subunit 
Protein accessionYP_002039339 
Protein GI194444243 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4143] ABC-type thiamine transport system, periplasmic component 
TIGRFAM ID[TIGR01254] ABC transporter periplasmic binding protein, thiB subfamily
[TIGR01276] thiamine ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value0.390705 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTAAAAA AATATCTTCC ACTTTTACTC CTGTGCGCGG CGCCTGCTTT CGCCAAACCC 
GTTCTCACCG TCTATACCTA CGACTCGTTC GCCGCCGACT GGGGGCCAGG CCCGGCGGTG
AAAAAAGCGT TTGAAGCCGA TTGCAACTGC GAGCTGAAAC TGGTGGCGCT GGAGGATGGC
GTTTCACTGC TCAACCGCCT GCGGATGGAG GGGAAGAACA GCAAAGCCGA TGTGGTGTTG
GGGCTGGACA ACAATCTGCT GGAAGCGGCC ACGCAAACTA AACTTTTTGC CAAAAGCGGC
GTGGCGAATG AAGCGGTCAA GGTGCCCGGC GGCTGGAAAA ACGACACATT TGTGCCGTTC
GATTACGGCT ATTTCGCCTT TGTCTACGAT AAAAGCAAGC TGAAAAATCC GCCGAAAAGC
CTGAAAGAAC TGGTCGAGAG CGATCAAAAA TGGCGGGTGA TTTATCAGGA CCCGCGTACC
AGTACGCCAG GGCTGGGGCT GTTACTGTGG ATGCGCAAAG TCTATGGCGA TAACGCGCCG
CAGGCCTGGC AAAAACTGGC GGCCAAAACG GTGACGGTGA CGAAAGGCTG GAGCGAGGCC
TACGGTTTAT TTCTGAAAGG TGAAAGCGAT TTGGTGCTCA GTTACACCAC CTCTCCGGCG
TATCACATTA TTGAAGAGAA GAAGGACAAT TACGCCGCCG CGAACTTCAG CGAAGGCCAT
TACTTACAGG TAGAAGTCGC GGCGCGTACC GTCGCCAGTA AGCAGCCGGA ACTGGCGGAG
AAATTCCTCA AATTTATGGT TTCTCCGGCG TTTCAGAACG CCATACCCAC CGGCAACTGG
ATGTACCCGG TAGCGGACGT CGCCTTACCC GCAGGGTTTG AATCATTGGC CAAACCCGCC
ACAACGCTGG AATTCACGCC GCAACAAGTG GCAGCACAAC GCCAGGCATG GATTAGCGAA
TGGCAACGCG CCGTCAGCCG TTAA
 
Protein sequence
MLKKYLPLLL LCAAPAFAKP VLTVYTYDSF AADWGPGPAV KKAFEADCNC ELKLVALEDG 
VSLLNRLRME GKNSKADVVL GLDNNLLEAA TQTKLFAKSG VANEAVKVPG GWKNDTFVPF
DYGYFAFVYD KSKLKNPPKS LKELVESDQK WRVIYQDPRT STPGLGLLLW MRKVYGDNAP
QAWQKLAAKT VTVTKGWSEA YGLFLKGESD LVLSYTTSPA YHIIEEKKDN YAAANFSEGH
YLQVEVAART VASKQPELAE KFLKFMVSPA FQNAIPTGNW MYPVADVALP AGFESLAKPA
TTLEFTPQQV AAQRQAWISE WQRAVSR