Gene Rsph17029_1546 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1546 
Symbol 
ID4897004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1624722 
End bp1626866 
Gene Length2145 bp 
Protein Length714 aa 
Translation table11 
GC content69% 
IMG OID640112136 
Productorganic solvent tolerance protein 
Protein accessionYP_001043428 
Protein GI126462314 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1452] Organic solvent tolerance protein OstA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTTTT TGCGGACTCT GCTTCTGTCT GCCACCATCC TCGCGCCGCT GCCCGCGGCC 
GCGCAGGAGA AGGCCACGCT CGTAGCCGAC AGCGTGTCGA TTGCGAACGA CACCACACTG
GTGGCCGAGG GCCATGTCGA GGTGCTCTAT CAGGGCACGC GCCTCAGTGC GACGCGCGTG
CTCTACGATC AGGCGGCGGA CCGGCTGACT ATCGACGGGC CCATCGTGCT CGACAACGGC
GAGGGCCAGA TCGTGCTGGC CGATCAGGCG GCGCTGTCGG GCGATCTGGC CAACGGGGTC
CTGACCAGCG CGCGGCTGGT GCTCGACCAG CAGCTTCAGC TGGCCTCCTC CGAGCTTGCG
CGGGTGGGCA ACCGCTACAC CGCGCTCGGC CGCACCGTCG CCTCCTCCTG CCAGATCTGC
GCGGACAATC CGGTGCCGCT GTGGGAAGTG CGCGCCAGCC GCGTGGTCCA CGACACGCAG
GAGCGGCAGA TCTATTTCGA CAATGCCTCG CTGCGCATCG CTGGGGTGCC GGTGGCCTAC
CTCCCGCGGC TGCGCATCGC GGATGCGACG CTCGACCGGG CCACGGGCTT CCTGCGTCCC
GAGCTGCGCA CCACGAGCCA GCTCGGCACC GGCGTGCTCG CGCCCTACTT CATCAAGCTC
GGCGACAGCC GCGACCTGAC GCTCCTGCCC CAGATCACCA CCAAGGGCGG CCGCTCGCTC
GGCGCGCGCT ACCGCGAGGC CTTCTCCTTC GGGGCCATCG AGGCGGAAGG TGCGGTCTCG
CGCGACGAGA TCCTGCCCGA CGACCGGCGG GGCTATCTCT TCGCGCGCGG CGCCTTCGAC
CTGCCGCGCG ATTTCACGCT GGCCTTCCAG ATCGAGGAGG TGAGCGACCG GGCCTATTAT
CTCGACTATG GCCTCGGCGA GAAGGACCGT CTCGCCTCCG GCTTTCAGGT CGATAGGGCG
CGGCGGAACG AGTACATCTC GGGGCGCATC CTGAAGTTCC GCACGCTGCG CGAGGATGAG
GACAACGATA CGATCCCCTC CCTCGTGGCG GACGCGACCT TCCACCGCCG CTTCGGCTTC
GGACCGCTCG GCGGCGAGGG CGGGATCCGC TTCTCGACCC ACAGCCACCG GCGCGACTCG
ACCGATCCGC TCGACAGCGA CCTCGACGCC GATACGATCG CCGACGGACG CGACATGTCG
CGGGCCTCGG TGCTGGTGGA CTGGCGCCGC AACTGGATGC TCGGCGGCGG GATCCTCGGC
TCGGCGCTGG CCGAGATCGG CGCCGACGCC TACGACATCT CGCAGGACGC CGCCTGGGAA
GGCTCGGACA CGCGGATCCA TGGCGCCGGC GGCATCGAGC TGCGCTGGCC CTGGGTGCGC
AGCAGCGCGA GCGGCGCCAG CGACGTGATC CAGCCGGTGC TGCAGGCGGT CTGGGCCGAT
ACATCCGGCG ACCGCATCCC GAACGAGGAT TCGGTGCTGG TGGAATTCGA CGAAGGCAGC
CTCTTCTCGC TCGACCGATT CCCCGGCTCG GACGCGGTCG AGGACGGCGG GCGCGCGGCC
ATCGGCGTGA ACTGGACGCG CTACGATCCG GCAGGCTGGT CGCTGGGCGC CACCCTCGGT
CAGGTGATCT GGCAGGAGGA GGATCCCGGC TTCTCCGAGG CCTCCGGCTT CGACGGCAAT
CAGTCGGACT GGCTGGCCGC GCTGCAGCTG TCGCTGCAGA ACGGGCTCAC CCTCACCCAG
CGCACCATCG TCGACAAGGA CTTCGGCGTG ACCAAGGCCG AATGGCAGAT CGACCTCGAC
CGCGACCGCT ACGGCATCAC CTCGCGCTAT GTCCGCATCC GCGAGAGCAC CGAGGAGGCG
CGGCCGGATC CGGTCGAGGA ACTGACGCTC GACACGCGCT ACCGGCTGAC CGACGCCTGG
ACCGCAAACG CCGAGGGCCG CTACGATTTC GAGGCCGACC GCACGGCGCG GGCCGGGGTC
GGGCTCGAGT TCCGGAACGA GTGTCTGAAG GTGGATCTTT CCCTCTCGCG TCGGTTCACT
TCCTCGACTA GTGTGACGCC GAGCACGGAT TTCGGCCTGT CGGTCGATCT CATCGGCTTC
GGCAGCGGAG CGACCCCTGG CCCGTCGCGG GTCTGCCGCA GGTGA
 
Protein sequence
MRFLRTLLLS ATILAPLPAA AQEKATLVAD SVSIANDTTL VAEGHVEVLY QGTRLSATRV 
LYDQAADRLT IDGPIVLDNG EGQIVLADQA ALSGDLANGV LTSARLVLDQ QLQLASSELA
RVGNRYTALG RTVASSCQIC ADNPVPLWEV RASRVVHDTQ ERQIYFDNAS LRIAGVPVAY
LPRLRIADAT LDRATGFLRP ELRTTSQLGT GVLAPYFIKL GDSRDLTLLP QITTKGGRSL
GARYREAFSF GAIEAEGAVS RDEILPDDRR GYLFARGAFD LPRDFTLAFQ IEEVSDRAYY
LDYGLGEKDR LASGFQVDRA RRNEYISGRI LKFRTLREDE DNDTIPSLVA DATFHRRFGF
GPLGGEGGIR FSTHSHRRDS TDPLDSDLDA DTIADGRDMS RASVLVDWRR NWMLGGGILG
SALAEIGADA YDISQDAAWE GSDTRIHGAG GIELRWPWVR SSASGASDVI QPVLQAVWAD
TSGDRIPNED SVLVEFDEGS LFSLDRFPGS DAVEDGGRAA IGVNWTRYDP AGWSLGATLG
QVIWQEEDPG FSEASGFDGN QSDWLAALQL SLQNGLTLTQ RTIVDKDFGV TKAEWQIDLD
RDRYGITSRY VRIRESTEEA RPDPVEELTL DTRYRLTDAW TANAEGRYDF EADRTARAGV
GLEFRNECLK VDLSLSRRFT SSTSVTPSTD FGLSVDLIGF GSGATPGPSR VCRR