Gene Rsph17029_1870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1870 
Symbol 
ID4897724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1978201 
End bp1979799 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content72% 
IMG OID640112464 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_001043746 
Protein GI126462632 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGGTC CTGAGGCGCG CGCCACGGGC CTGCGCACCT GGGCGGCCTT CACCGCCATG 
TGCCTCGGCA TGTTCATGGC GATCCTCGAC GTCCAGATCG TCGCCACCTC GCTGCCCGAG
ATGCAGCGCG CGCTCGGGAT CTCTCCCGAC CGGATGAGCT GGGTGCAGAC CGCCTATCTC
ATCGCCGAAG TGATCGCGAT CCCGCTCACG GGGGCGCTCA TGCGCATCCT GACCATGCGC
GGGCTGTTCG CGCTGACGAC CTCGCTCTTC ACCCTCGCCT CCATCGGCTG CGCGGCCAGC
GAGAGCTTCG GCCCGCTGGT GGCCTGGCGG GTGCTGCAGG GCTTTGCCGG CGGCACGCTG
ATCCCCGCAG TCTTCGCGGC CGTGTTCCTG CTGTTTCCGG TCCACCGGCA GGGTCTCGCC
ACCACGCTTG CGGGCGTGCT CGCCGTGCTT GCGCCCACCG TCGGGCCGGT CGTCGGCGGC
TGGATTACCG AAAGCTGGTC GTGGCACTGG CTCTTCCTCA TCAACGTGGC GCCGGGCGTG
CTGGCGGTGG GGATCGGCGC GACCCTCCTG CCGCGCGAGC GGCTGCGGCT CGCCGAGGCG
CGGCAGCTCG ATCTCGCAGC CCTCGGGCTC CTCGCGCTGT CTCTCGCCGC GCTCGAGATC
GCGCTGAAGG AGGCGCCGGG GCGCGGCTGG ACGAGCGGCC TCGTCCTCGC TCTTCTCGCC
CTCTGGGCCG CCGCGGGCGC GGGCTTCGTG CGCCGCTGCC TGAGCGGCCC CCGCCCCGTG
GTGGAGTTGC GCGCCTTCGC CGACCGGCGC TTCGCGCTCG CCTGCGTGCT GAGCTTCGTG
CTGGGGATCG GTCTCTTCGG GTCGGTCTAT CTGATGCCGG TCTTTCTCTC CTTCGTGCGC
GGGCACGGGC CGCTCGAGAT CGGGACCATT ATGCTGGTGA CGGGCGTGGC GCAGCTCCTG
ACCGCGCCGC TCGCGGTGGC GGCCGAGCAG CGGCTGGGGG CGCGACTGCT GACGGGCTTC
GGCTTCGCGC TCTTCGCGGC GGGCCTTGCG CTCAGCTCCT TCCAGACGCC GCGCACCGAT
CATGACGAGA TGTTCTGGCC GCAGGTGGTG CGCGGGGTGG CGATCATGTT CTGCCTGCTG
CCGCCGACCC GTCTCGCGCT CGGCCATCTC TCCGAGGCGC GCGTGGCGGA TGCGAGCGGG
CTCTTCAACC TGATGCGCAA CCTCGGCGGC GCCATCGGCC TCGCGCTGAT CGACACGGTG
ATCTTCTCGC GCAGCGCGGG CCATGGCGAG AGCCTGCTCG CGCGGCTCAC CGCCCGCGAT
CTCGAGGCCG CGCGCTTCGT GGGCGTGCCC GAGGCGATGC TCGCGAGCCT GCCGCCGGGG
CCGGTGCCGC CTCAGGCGCA GGCGATGCTG GCGCCGCTTC TGGAAAAGGC TGCCCTCACT
CAGGCCATCA ACGAGGCTTG GGGGATGATC GCGCTTCTGA CGCTGGCGGC GCTTCTCTGC
GTGCCCTTCG CCCGGCCGTC GCGCGGACAG GATCGGGCAG CACCCGTCAC GCTGCAGAGA
GCGCCGCCAC GACGCTCTCC CACCCCGCGC CGCCGGTGA
 
Protein sequence
MPGPEARATG LRTWAAFTAM CLGMFMAILD VQIVATSLPE MQRALGISPD RMSWVQTAYL 
IAEVIAIPLT GALMRILTMR GLFALTTSLF TLASIGCAAS ESFGPLVAWR VLQGFAGGTL
IPAVFAAVFL LFPVHRQGLA TTLAGVLAVL APTVGPVVGG WITESWSWHW LFLINVAPGV
LAVGIGATLL PRERLRLAEA RQLDLAALGL LALSLAALEI ALKEAPGRGW TSGLVLALLA
LWAAAGAGFV RRCLSGPRPV VELRAFADRR FALACVLSFV LGIGLFGSVY LMPVFLSFVR
GHGPLEIGTI MLVTGVAQLL TAPLAVAAEQ RLGARLLTGF GFALFAAGLA LSSFQTPRTD
HDEMFWPQVV RGVAIMFCLL PPTRLALGHL SEARVADASG LFNLMRNLGG AIGLALIDTV
IFSRSAGHGE SLLARLTARD LEAARFVGVP EAMLASLPPG PVPPQAQAML APLLEKAALT
QAINEAWGMI ALLTLAALLC VPFARPSRGQ DRAAPVTLQR APPRRSPTPR RR