Gene TM1040_2818 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2818 
SymboltbpA 
ID4076637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2981594 
End bp2982574 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content58% 
IMG OID638008144 
Productthiamine transporter substrate binding subunit 
Protein accessionYP_614812 
Protein GI99082658 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4143] ABC-type thiamine transport system, periplasmic component 
TIGRFAM ID[TIGR01254] ABC transporter periplasmic binding protein, thiB subfamily
[TIGR01276] thiamine ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0458711 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.879524 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCCC TCATCTTTGC GAGCGCCACT TGTTTAGCTA CTGCAGTTGC CGCTGCGGAT 
ACGCCGGAAT TGGTGGTCTA TACTTACGAC AGCTTTGTCT CGGAATGGGG ACCCGGACCG
GCCGTCGAAG AGGCTTTTGA AGCGGTCTGC GGATGCGATC TGAAATTCGT CGGCGCGGGC
GATGGCGCTG CGCTGCTCGC GCGGATCAAA CTGGAAGGCG CTCGGTCTGA CGCGGATGTG
GTCTTAGGGC TCGACACCAA CCTTACCGCG GCAGCCAAGG AAACCGGATT GTTTGCGCCA
GTGTCGGTTG AGGCCGATTA CGCACTGCCA ATCACCTGGA GCGACACGCA TTTTGCCCCC
TATGACTGGG GATATTTTGC ATTTGTTCAC AACGCAGATG TTCCGGCACC TTCGAACTTT
GAAGCCTTGG CTGACAGTGA TCTGAAAATC GTGATCCAGG ATCCAAGGTC CTCGACGCCG
GGACTGGGGC TCTTGATGTG GGTAAAGGCC GCATATGGGG AGGACGCGCC TGCCCTCTGG
GAAGGTCTCA GCGACAATAT CGTCACCGTC ACCAAAGGCT GGTCCGAAGC ATACGGACTG
TTCCTCGAAG GCGAGGCAGA TATGGTGCTC TCCTACACCA CGTCGCCCGC CTATCATCTG
ATCGCCGAAG AGGACGACAG CAAGTCGGCT GCACTATTCG ATGAAGGTCA CTACATGCAG
GTCGAGGTCG CGGGCAAGCT CGCGGCGAGC GATGAGAGCG CATTGGCGGA TCAGTTCCTC
GAGTTCATGG TCTCTGATGC CTTCCAGTCG ATCATCCCAA CCACAAACTG GATGTACCCC
GCCGTCACGC CTGATTCAGG CTTGCCACAG GGGTTTGAAA CCCTGATCAG CCCGGAGAAA
TCACTGCTTC TGCCCGAGGA CGAAGCCGCT GCGCTGCGCG CCGAGGCGTT GGAAGAATGG
CGCGCAGCGC TCAGCCGATA A
 
Protein sequence
MKSLIFASAT CLATAVAAAD TPELVVYTYD SFVSEWGPGP AVEEAFEAVC GCDLKFVGAG 
DGAALLARIK LEGARSDADV VLGLDTNLTA AAKETGLFAP VSVEADYALP ITWSDTHFAP
YDWGYFAFVH NADVPAPSNF EALADSDLKI VIQDPRSSTP GLGLLMWVKA AYGEDAPALW
EGLSDNIVTV TKGWSEAYGL FLEGEADMVL SYTTSPAYHL IAEEDDSKSA ALFDEGHYMQ
VEVAGKLAAS DESALADQFL EFMVSDAFQS IIPTTNWMYP AVTPDSGLPQ GFETLISPEK
SLLLPEDEAA ALRAEALEEW RAALSR