Gene SeD_A0112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0112 
SymboltbpA 
ID6874660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp120984 
End bp121967 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content55% 
IMG OID642783365 
Productthiamine transporter substrate binding subunit 
Protein accessionYP_002214059 
Protein GI198245931 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4143] ABC-type thiamine transport system, periplasmic component 
TIGRFAM ID[TIGR01254] ABC transporter periplasmic binding protein, thiB subfamily
[TIGR01276] thiamine ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTAAAAA AATATCTTCC GCTTTTACTC CTGTGCGCGG CGCCTGCTTT CGCCAAACCC 
GTTCTCACCG TCTATACCTA CGACTCGTTC GCCGCCGACT GGGGGCCAGG CCCGGCGGTG
AAAAAAGCGT TTGAAGCCGA TTGCAACTGC GAGCTGAAAC TGGTGGCGCT GGAGGATGGC
GTTTCGCTGC TCAACCGCCT GCGGATGGAG GGGAAGAACA GCAAAGCCGA TGTGGTGTTG
GGGCTGGACA ATAATCTGCT GGAAGCGGCC ACGCAAACTA AACTCTTTGC CAAAAGCGGC
GTGGCGAATG AGGCGGTCAA GGTGCCCGGC GGCTGGAAAA ACGACACATT TGTGCCGTTC
GATTACGGCT ATTTCGCCTT TGTCTACGAT AAAAGCAAGC TGAAAAATCC GCCGAAAAGC
CTGAAAGAAC TGGTCGAGAG CGATCAAAAA TGGCGGGTAA TTTATCAGGA CCCGCGTACC
AGTACGCCAG GGCTGGGGCT GTTACTGTGG ATGCGCAAAG TCTATGGCGA TAACGCGCCG
CAGGCCTGGC AAAAACTGGC GGCCAAAACG GTGACGGTGA CGAAAGGCTG GAGCGAGGCC
TACGGCTTAT TTCTGAAAGG TGAAAGCGAT TTGGTGCTCA GTTACACCAC CTCTCCGGCG
TATCACATTA TTGAAGAGAA GAAGGACAAT TACGCCGCCG CGAACTTCAG CGAAGGCCAT
TACTTACAGG TAGAAGTCGC GGCGCGTACC GTCGCCAGTA AGCAGCCGGA ACTGGCGGAG
AAATTCCTCA AATTTATGGT TTCTCCGGCG TTTCAGAACG CCATACCCAC CGGCAACTGG
ATGTACCCGG TAGCGGACGT CGCCTTACCA GCAGGGTTTG AATCATTGGC CAAACCCGCC
ACAACGCTGG AATTCACGCC GCAACAAGTG GCAGCACAAC GCCAGGCATG GATTAGCGAA
TGGCAACGCG CCGTCAGCCG TTAA
 
Protein sequence
MLKKYLPLLL LCAAPAFAKP VLTVYTYDSF AADWGPGPAV KKAFEADCNC ELKLVALEDG 
VSLLNRLRME GKNSKADVVL GLDNNLLEAA TQTKLFAKSG VANEAVKVPG GWKNDTFVPF
DYGYFAFVYD KSKLKNPPKS LKELVESDQK WRVIYQDPRT STPGLGLLLW MRKVYGDNAP
QAWQKLAAKT VTVTKGWSEA YGLFLKGESD LVLSYTTSPA YHIIEEKKDN YAAANFSEGH
YLQVEVAART VASKQPELAE KFLKFMVSPA FQNAIPTGNW MYPVADVALP AGFESLAKPA
TTLEFTPQQV AAQRQAWISE WQRAVSR