Gene RPD_1636 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1636 
Symbol 
ID4022116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1834627 
End bp1835778 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content66% 
IMG OID637961831 
Producthypothetical protein 
Protein accessionYP_568774 
Protein GI91976115 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0767] ABC-type transport system involved in resistance to organic solvents, permease component 
TIGRFAM ID[TIGR00056] conserved hypothetical integral membrane protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACAGC AGCTGGCATT GGACGGCGGT CCGACGCTCG AACATATCGC GCGAGGCGAG 
GGGCTGGCCC TGTGCGCCGC AGGTTCCTGG ACCGCGCGGT TCGCGCCGTC GCTCGAGCGG
ATCGTCGCCG ACGCCGAGAA ATTGACCGGG ACGCGGCCGA ACATCTTCAT CGATGTGTCC
GAAGTCGCAC GGCTCGACAC CTTCGGCGCC TGGCTGATCG AGCGGCTGCG CCGAAATCTC
ACCCAGGATG GCGTCGAGGC CCGGATCGCG GGACTGTCGG CGAATTATGC TAGCCTGGTC
GACGAGGTCC GCCAGGTCCA ACCGGACGTC CCGGCGGTCG CATCGGGCGG GGCATTGCGG
GCGCCGATCG AGAAGCTCGG CCGGACCATG TATGTCTTCG CGGACGACAT CGTCGCGTTG
ATCAGCATGT TGGGCGCGGT GCTGGCCGGC GTGATGCGGG CAATCCTGCA TCCGACGACG
TTCCGTCTGA CCTCAACGGT GCATCATCTG GAGCAGGTGT GCTGGCGCGC GGTGCCGATC
GTCGTGCTGA TCACTTTCCT GATCGGCTGC ATCATCGCGC AACAGGGCAT CTTCCATTTC
CGCCGGTTCG GCGCCGACGT GTTCGTGGTC GACATGCTCG GCGTGCTGGT GCTGCGCGAA
ATCGGCGTGC TGCTGGTGGC GATCATGGTC GCCGGCCGCT CGGGCAGCGC CTACACCGCC
GAACTCGGCT CGATGAAGAT GCGCGAGGAG ATCGACGCGT TGCGCACCAT GGGGTTCGAC
CCGATCGACG TGCTGATCGT GCCGCGGCTG ATCGCGCTGC TGCTGGCGAT GCCGATCCTG
ACGTTTCTGG GCGCGATGTC GGCGCTGTAT GGCGGCGGGC TGGTGGCGTG GCTGTATGGC
GGGGTCGATC CGGAGGCGTT CTTGCTGCGG CTGCGCGACG CGATCTCGAT CAATCACTTC
ACCGTCGGCA TGATCAAGGC GCCGGTGATG GCGCTGGTGA TCGGCATCGT CGCCTGTGTC
GAAGGGCTGG CGGTGAAGGG CAGCGCCGAG TCGCTCGGCA GTCACACAAC CGCCTCGGTG
GTGAAGGGGA TCTTCTTCGT GATCGTGATG GACGGCGTGT TCGCGATCTT CTTCGCCTCG
ATCGGGATTT GA
 
Protein sequence
MAQQLALDGG PTLEHIARGE GLALCAAGSW TARFAPSLER IVADAEKLTG TRPNIFIDVS 
EVARLDTFGA WLIERLRRNL TQDGVEARIA GLSANYASLV DEVRQVQPDV PAVASGGALR
APIEKLGRTM YVFADDIVAL ISMLGAVLAG VMRAILHPTT FRLTSTVHHL EQVCWRAVPI
VVLITFLIGC IIAQQGIFHF RRFGADVFVV DMLGVLVLRE IGVLLVAIMV AGRSGSAYTA
ELGSMKMREE IDALRTMGFD PIDVLIVPRL IALLLAMPIL TFLGAMSALY GGGLVAWLYG
GVDPEAFLLR LRDAISINHF TVGMIKAPVM ALVIGIVACV EGLAVKGSAE SLGSHTTASV
VKGIFFVIVM DGVFAIFFAS IGI