Gene RSP_1391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_1391 
SymboltbpA 
ID3720797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp3168984 
End bp3170012 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content69% 
IMG OID640072617 
Productthiamine transporter substrate binding subunit 
Protein accessionYP_354471 
Protein GI77464967 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4143] ABC-type thiamine transport system, periplasmic component 
TIGRFAM ID[TIGR01254] ABC transporter periplasmic binding protein, thiB subfamily
[TIGR01276] thiamine ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.994265 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCCTC ATCCCGTTCC GTTCCGCGTC GAAGAGGAGG ATGAGATGAG ACTTCCGATC 
GTTGCTTCCT GTGTTGCGCT GGCTGCGGGA ACGGCCGCGG CCGAGACGCC CGTGCTGACG
GTGCTGACCT ACGACAGCTT CACCTCGGAA TGGGGTCCTG GCCCCGCGGT CGAGAAAGCC
TTCGAGGAGA CCTGCGCCTG CGACCTGCGC TTCGTCGCCG CGGGCGACGG GGCGGCGCTT
CTGGCGCGGC TCCAGCTCGA GGGCGCGCGG AGCGAGGCCG ATGTGGTTCT GGGGCTCGAC
ACGAACCTCA CGGCGGCGGC GGCCGCGACC GGCCTCTTCG CTCCGCACGG CGTGAGCACG
CCGCTCGATC TGCCGGTGGC CTGGGAGGAT CCGGTGTTCC TGCCCTTCGA CTGGGGCTGG
TTCGCCTTCG TCCATGACCG CAGGATGGAA GATGTGCCGG CCTCGTTCGA GGAGCTGGCG
GCGTCGGACT CCAAGATCGT CATCCAGGAT CCGCGCAGCT CGACCCCGGG TCTGGGCCTT
CTGATGTGGG TGGAGGCGGC CTACGGCGAC CGCGCGCCCG AGATCTGGGA GGGGCTCGCC
GACAATATCG TGACGGTGAC GCCCGGCTGG TCCGAGGCCT ACGGCCTCTT CATGGAGGGC
GAGGCCGACA TGGTGCTGAG CTACACCACC TCGCCCGCCT ACCATCTGAT CGCTGAGGAG
GACGATACCA AGACTGCGGC CGCCTTCCGC GAGGGGCACT ATCTGCAGGT CGAGGTGGCG
GGCAAGCTGG CCGCGACCGA CCAGCCGGAG CTGGCCGACC GCTTCATGGC CTTCCTGCTG
GAGGAGCCGG TGCAGTCGGT GCTGCCCACG ACGAACTGGA TGTATCCGGC GAAGCTGCCC
GCCGCGGGCC TGCCCGAGGG GTTCGAGACG CTGGTGCAGC CCGAGACCTC GCTTCTGCTG
TCGGCTGACG AGGCGCTCGC CCTCCGGCCC GAGGCTCTGG CCGAATGGCA AGACGCGCTG
GCGCGCTGA
 
Protein sequence
MAPHPVPFRV EEEDEMRLPI VASCVALAAG TAAAETPVLT VLTYDSFTSE WGPGPAVEKA 
FEETCACDLR FVAAGDGAAL LARLQLEGAR SEADVVLGLD TNLTAAAAAT GLFAPHGVST
PLDLPVAWED PVFLPFDWGW FAFVHDRRME DVPASFEELA ASDSKIVIQD PRSSTPGLGL
LMWVEAAYGD RAPEIWEGLA DNIVTVTPGW SEAYGLFMEG EADMVLSYTT SPAYHLIAEE
DDTKTAAAFR EGHYLQVEVA GKLAATDQPE LADRFMAFLL EEPVQSVLPT TNWMYPAKLP
AAGLPEGFET LVQPETSLLL SADEALALRP EALAEWQDAL AR