Gene Smed_3107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3107 
SymboltbpA 
ID5323986 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3252828 
End bp3253856 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content61% 
IMG OID640792057 
Productthiamine transporter substrate binding subunit 
Protein accessionYP_001328768 
Protein GI150398301 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4143] ABC-type thiamine transport system, periplasmic component 
TIGRFAM ID[TIGR01254] ABC transporter periplasmic binding protein, thiB subfamily
[TIGR01276] thiamine ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCATTT CACTCCACGG CAGAACACTT GCCGGGTTCA TGATTGCAGC GGCCACCGTC 
ACGGGCGTTT CCGCAAGTGC TTTCGCCGCG GAAAAGACGC TGACGGTCTA CACTTACGAA
AGCTTCATCA CAGAATGGGG GCCGGGCGCG AAGGTCTCCG AAGCCTTCGA GAAGGTCTGC
GACTGCAAGG TCGATTATGT GGGCGTCGCC GACGGGGTCG AACTGCTGAC GCGGCTGAAG
CTCGAAGGTG AGGGATCCAA AGCCGACGTC GTGCTCGGTC TCGACACCAA TCTCGTTGCC
GAGGCCAAGG CGACCGGCTT CTTCGTTCCG CACGGCGTCG ATACCACTTC CGTCGATGTT
CCTGGTGGTT TCACCGACGA CACCTTCATC CCCTATGACT ACGGCCATTT CGCCGTGGTG
TACGATACCG AGATGCTGAA GAGCCCGCCG AAGAGCCTCA GGGATCTGGT GGAAGGCGAT
CCAACGCAGA AGATCGTGAT CGAGGACCCG CGCACTTCCA CCCCCGGCCT CGGCCTGCTG
CTTTGGGTGA AATCGGTCTA TGGCGATCGG GCCGGCGAGG CCTGGGCAAA GCTCAAGGCG
CGTGTGCTGA CGGTCACGCC GGGCTGGTCG GAGGCCTATG GCCTCTTCAC CAAGGGTGAG
GCGCCGATGG TTCTGTCCTA CACCACCTCG CCCGCATATC ACATGGTCGC GGAAGATACC
GAGCGCTATC AGGCGGCCCC GTTCACCGAG GGCCACTACA TCCAGATCGA AGTCGCCGCA
TTGACGAAGA ACGCGAAGGA CCCGGAGCTC GCCCGGAAGT TTCTGGACTT CATGATCGGT
CCGGAATTCC AGTCGATCAT CCCGACGACC AATTGGATGA TGCCGGTGAC GGCCACAAAG
GAACCGCTGC CGGAGGCCTT CGGAAAGCTC GTCGAACCCC GGAAGACCTT TCTCATCCCC
TCCGAGGAGG TTGCGGCCAA CCGCAGGGCC TGGATCGATG AGTGGCTGAC GGCGATGAGC
AGGAACTGA
 
Protein sequence
MSISLHGRTL AGFMIAAATV TGVSASAFAA EKTLTVYTYE SFITEWGPGA KVSEAFEKVC 
DCKVDYVGVA DGVELLTRLK LEGEGSKADV VLGLDTNLVA EAKATGFFVP HGVDTTSVDV
PGGFTDDTFI PYDYGHFAVV YDTEMLKSPP KSLRDLVEGD PTQKIVIEDP RTSTPGLGLL
LWVKSVYGDR AGEAWAKLKA RVLTVTPGWS EAYGLFTKGE APMVLSYTTS PAYHMVAEDT
ERYQAAPFTE GHYIQIEVAA LTKNAKDPEL ARKFLDFMIG PEFQSIIPTT NWMMPVTATK
EPLPEAFGKL VEPRKTFLIP SEEVAANRRA WIDEWLTAMS RN