Gene RPB_1972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1972 
Symbol 
ID3909477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2238228 
End bp2239280 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content66% 
IMG OID637883866 
Productbile acid:sodium symporter 
Protein accessionYP_485591 
Protein GI86749095 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACCT TCGAACGCTA TCTGACGCTG TGGGTCGCGC TGTGCATCGC CGTCGGCATC 
GGGCTCGGCC ATGTCGTGCC CGGCCTGTTC CAGGCGGTCG CCGCCGCCGA GATCGCCAAG
GTCAATCTGC CGGTGGCGGT GCTGATCTGG CTGATGATCA TTCCGATGCT GGTGAAGATC
GATTTCGCCG CGCTGGCGCG GGTGCGCGAG CATTGGCGCG GCATCGGCGT GACGCTGTTC
ATTAACTGGG CGGTGAAGCC GTTCTCGATG GCGGCGCTGG CCTGGCTGTT CATCGGCTGG
CTGTTCAGGG ATCATCTGCC GGCCGATCAG ATCAATTCCT ACATCGCCGG GCTGATCATT
CTGGCCGCGG CGCCGTGCAC CGCGATGGTG TTCGTGTGGT CGAACCTGAT CAAAGGCGAG
CCGCATTTCA CGCTGAGCCA GGTAGCGCTG AACGACACCA TCATGGTGTT CGCCTTCGCG
CCGATCGTCG GCCTGCTGCT CGGCCTGTCG GCGATCACCG TGCCGTGGGA CACGCTGATG
ATCTCGGTGG CGCTGTACAT CGTGGTGCCG GTGATCATCG CGCAAATGCT GCGGCGGCGG
GTGCTGGCGG CCGGCGGCGA GGCGGGATTG CAGCGCTTGC TCGGCGCGGT TCAGCCGCTG
TCGCTGGTCG CCTTGCTGGC GACGCTGGTG CTGCTGTTCG GCTTCCAGGG CGAGCAGATC
ATCCGGCAGC CGCTGGTGAT CGCGCTGCTC GCGGTGCCGA TCCTGATCCA GGTGTATTTC
AACGCCGGGC TCGCTTATCT GCTCAATCGC CTGAGCGGCG AGCAGCATTG CGTCGCGGGT
CCCTCGGCGC TGATCGGCGC CAGCAACTTC TTCGAACTCG CGGTGGCCGC CGCGATCAGC
CTGTTCGGCT TCGAATCCGG CGCGGCGCTG GCCACCGTGG TCGGCGTGCT GATCGAGGTG
CCGGTGATGC TGACGGTGGT GTGGATCGTC AACCGCTCCA AGGGCTGGTA CGAGGGCGAG
GCGCGCGCCG CCGTCACGAC CCGGCCGGGT TAG
 
Protein sequence
MSTFERYLTL WVALCIAVGI GLGHVVPGLF QAVAAAEIAK VNLPVAVLIW LMIIPMLVKI 
DFAALARVRE HWRGIGVTLF INWAVKPFSM AALAWLFIGW LFRDHLPADQ INSYIAGLII
LAAAPCTAMV FVWSNLIKGE PHFTLSQVAL NDTIMVFAFA PIVGLLLGLS AITVPWDTLM
ISVALYIVVP VIIAQMLRRR VLAAGGEAGL QRLLGAVQPL SLVALLATLV LLFGFQGEQI
IRQPLVIALL AVPILIQVYF NAGLAYLLNR LSGEQHCVAG PSALIGASNF FELAVAAAIS
LFGFESGAAL ATVVGVLIEV PVMLTVVWIV NRSKGWYEGE ARAAVTTRPG