Gene RPD_3417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3417 
Symbol 
ID4023930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3801254 
End bp3802297 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content66% 
IMG OID637963622 
Productbile acid:sodium symporter 
Protein accessionYP_570542 
Protein GI91977883 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACCT TCGAACGCTA TCTGACGCTG TGGGTCGCGC TGTGCATCGC GGCCGGCATT 
GCGCTCGGCC ATGTCGTGCC CGGCCTGTTC CACGCCGTTG GCGCAGCCGA GGTCGCCAAG
GTCAATCTGC CGGTCGCGGC GCTGATCTGG CTGATGATCA TCCCGATGCT GGTGAAGATC
GATTTCGCCG CGCTGGGGCA GGTGCGCGAG CATTGGCGCG GCATCGGCGT GACGCTGTTC
ATCAACTGGG CGGTGAAGCC GTTCTCGATG GCGGCGCTGG CCTGGCTGTT CGTCGGCTAT
CTGTTCAGGC CGTATCTGCC GGCCGATCAG ATCAACTCCT ACATCGCCGG GCTGATCATC
CTCGCTGCGG CGCCGTGCAC GGCGATGGTG TTCGTGTGGT CGAACCTGAC CAAAGGCGAG
CCGCATTTTA CTCTGAGCCA GGTTGCGCTC AACGACACCA TCATGGTGTT CGCGTTCGCG
CCGATCGTCG GCCTGCTGCT CGGGCTGTCG GCGATCACCG TGCCGTGGGA CACGCTGGTG
CTGTCAGTGG TGCTGTACAT CGTGGTGCCG GTGGTGATCG CGCAAGCGTT GCGGTGGCGC
GTGCTGGCGA GCGGCGGCGA GGCGGCGTTG CAGCGGCTGC TCGGCCGGCT GCAGCCGCTG
TCGCTGCTCG CCTTGCTGGC GACGCTGGTG CTGCTGTTCG GCTTTCAGGG CGCGCAGATC
ATCCGGCAGC CGCTGGTGAT CGCGCTGCTC GCGGTGCCGA TCCTGATCCA GGTCTATTTC
AACGCCGGTC TCGCGTATCT GCTCAACAGG GTCAGCGGCG AGCAGCATTG CGTCGCCGGA
CCCTCGGCGC TGATCGGCGC CAGCAATTTC TTCGAACTTG CGGTCGCCGC CGCGATCAGC
CTGTTCGGCT TCGAATCCGG CGCCGCGCTC GCCACCGTGG TCGGCGTGCT GATCGAGGTG
CCGGTGATGC TGACGGTGGT GTGGATCGTC AACCGCTCGA AGGGGTGGTA CGAAGATCAG
CCGAACGTGA CGCGCGAGGC TTAG
 
Protein sequence
MSTFERYLTL WVALCIAAGI ALGHVVPGLF HAVGAAEVAK VNLPVAALIW LMIIPMLVKI 
DFAALGQVRE HWRGIGVTLF INWAVKPFSM AALAWLFVGY LFRPYLPADQ INSYIAGLII
LAAAPCTAMV FVWSNLTKGE PHFTLSQVAL NDTIMVFAFA PIVGLLLGLS AITVPWDTLV
LSVVLYIVVP VVIAQALRWR VLASGGEAAL QRLLGRLQPL SLLALLATLV LLFGFQGAQI
IRQPLVIALL AVPILIQVYF NAGLAYLLNR VSGEQHCVAG PSALIGASNF FELAVAAAIS
LFGFESGAAL ATVVGVLIEV PVMLTVVWIV NRSKGWYEDQ PNVTREA