Gene RPD_3638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3638 
Symbol 
ID4024152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4061600 
End bp4062580 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content63% 
IMG OID637963842 
Productbile acid:sodium symporter 
Protein accessionYP_570762 
Protein GI91978103 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.283342 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATAG ACGAATTGCG GGATGTTATG GAAAGCCGTC AGGTCGCCGT GTACTTCGTC 
GCCGTCATCC TTGGCGCGCT GGCAGGGACG CTTTTCAGCG GGACGGAAGC GCTTGAAAGG
GCCATCAATC CGGCTCTTGC CCTCATGCTG TTCGTGACGT TTCTTCAGGT CCCGGTCGGC
TCGCTGGGGC AGGCATTTCG CAACGGGCGC TTCTTCGCGG CTCTGCTCCT GACGAATTTC
GTCGCCGTGC CGGTCCTCGC TGCGGCTATC ATCCCGTTTG CTCCACCCGA CGTCCTTGTT
CGGATGGGCG TCCTTTTCGT TCTGTTATGC CCTTGCATCG ACTATGTCGT CACTTTCGCG
CACCTCGGAA GAGGCGATGC CCGGCTCCTG CTGGCCGCTA CACCCGTCTT GCTGGTCGTG
CAAATGCTGC TGCTGCCGCT GTGGCTTCGT CTCTTTCTGG GAGCGGACGC CGCCCAGTTC
GTACAGCCTG AGCCGTTCGT GCATGCTTTC GTCTGGCTCA TCGCGATCCC ACTGGGCCTC
GCGATGGCGT GCCAGCTTTG GGCAGCGCAA AACAAGGCTG GCACTCGCGC CGTAAAAACG
CTCGGTCTTC TGCCGGTGCC GGCGACGGCG GCGGTCCTAT TCATCGTGAT CGCGGCGGTG
CTGCCGCAGA TCGGCCCGGC ACAGGCGGCC GTACTCGGTG TCGCACCGCT TTACGTTGTT
TTCGCGGTGC TGGCCCCGCT GGCCGGCCTG GTCATCGCCC GTATCGCGGG CTTGGAGGCG
CCCGCCGGTC GCGCCGTAGC GTTCAGCGGT GCCACACGCA ATTCGCTCGT CGTCCTCCCC
CTTGCGCTCG CCGTGCCGGG TGCCATTCCG ATAATACCGG CTGTCATAGT AGCGCAGACT
CTGGTGGAGT TGACCGCCTC GCTCGTCTAC ATCCGGCTAA TGCCGCTATT CGGAAGCGAT
GGCGACGCAG CCGCGCATTA G
 
Protein sequence
MKIDELRDVM ESRQVAVYFV AVILGALAGT LFSGTEALER AINPALALML FVTFLQVPVG 
SLGQAFRNGR FFAALLLTNF VAVPVLAAAI IPFAPPDVLV RMGVLFVLLC PCIDYVVTFA
HLGRGDARLL LAATPVLLVV QMLLLPLWLR LFLGADAAQF VQPEPFVHAF VWLIAIPLGL
AMACQLWAAQ NKAGTRAVKT LGLLPVPATA AVLFIVIAAV LPQIGPAQAA VLGVAPLYVV
FAVLAPLAGL VIARIAGLEA PAGRAVAFSG ATRNSLVVLP LALAVPGAIP IIPAVIVAQT
LVELTASLVY IRLMPLFGSD GDAAAH