Gene SeHA_C0114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C0114 
SymboltbpA 
ID6492061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp120886 
End bp121869 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content55% 
IMG OID642740402 
Productthiamine transporter substrate binding subunit 
Protein accessionYP_002044076 
Protein GI194449014 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4143] ABC-type thiamine transport system, periplasmic component 
TIGRFAM ID[TIGR01254] ABC transporter periplasmic binding protein, thiB subfamily
[TIGR01276] thiamine ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.51564 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value0.157223 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTAAAAA AATATCTTCC ACTTTTACTC CTGTGCGCGG CGCCTGCTTT CGCCAAACCC 
GTTCTCACCG TCTATACCTA CGACTCGTTC GCCGCCGACT GGGGGCCAGG CCCGGCGGTG
AAAAAAGCGT TTGAAGCCGA TTGCAACTGC GAGCTGAAAC TGGTGGCGCT GGAGGATGGC
GTTTCGCTGC TCAACCGCCT GCGGATGGAG GGGAAGAACA GCAAAGCCGA TGTGGTGTTG
GGGCTGGACA ACAATCTGCT GGAAGCGGCC ACGCAAACTA AACTCTTTGC CAAAAGCGGC
GTGGCGAATG AAGCGGTCAA GGTGCCCGGC GGCTGGAAAA ACGACACATT TGTGCCATTC
GATTACGGCT ATTTCGCCTT TGTCTACGAT AAAAGCAAGC TGAAAAATCC GCCGAAAAGC
CTGAAAGAAC TGGTCGAGAG CGATCAAAAA TGGCGGGTGA TTTATCAGGA CCCACGTACC
AGTACGCCAG GGCTGGGGCT GTTACTGTGG ATGCGCAAAG TCTATGGCGA TAACGCGCCG
CAGGCCTGGC AAAAACTGGC GGCCAAAACG GTGACGGTGA CGAAAGGCTG GAGCGAGGCC
TACGGTTTAT TTCTGAAAGG TGAAAGCGAT TTGGTGCTCA GTTACACCAC CTCTCCGGCG
TATCACATTA TTGAAGAGAA GAAGGACAAT TACGCCGCCG CGAACTTCAG CGAAGGCCAT
TACTTACAGG TAGAAGTCGC GGCGCGTACC GTCGCCAGTA AGCAGCCGGA ACTGGCGGAG
AAATTCCTCA AATTTATGGT TTCTCCGGCG TTTCAGAACG CCATACCCAC CGGCAACTGG
ATGTACCCGG TAGCGGACGT CGCCTTACCC GCAGGGTTTG AATCATTGGC CAAACCCGCC
ACAACGCTGG AATTCACGCC GCAACAAGTG GCAGCACAAC GCCAGGCATG GATTAGCGAA
TGGCAACGCG CCGTCAGCCG TTAA
 
Protein sequence
MLKKYLPLLL LCAAPAFAKP VLTVYTYDSF AADWGPGPAV KKAFEADCNC ELKLVALEDG 
VSLLNRLRME GKNSKADVVL GLDNNLLEAA TQTKLFAKSG VANEAVKVPG GWKNDTFVPF
DYGYFAFVYD KSKLKNPPKS LKELVESDQK WRVIYQDPRT STPGLGLLLW MRKVYGDNAP
QAWQKLAAKT VTVTKGWSEA YGLFLKGESD LVLSYTTSPA YHIIEEKKDN YAAANFSEGH
YLQVEVAART VASKQPELAE KFLKFMVSPA FQNAIPTGNW MYPVADVALP AGFESLAKPA
TTLEFTPQQV AAQRQAWISE WQRAVSR