Gene RPD_3272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3272 
Symbol 
ID4023781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3627009 
End bp3628136 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content69% 
IMG OID637963475 
Productrhomboid-like protein 
Protein accessionYP_570397 
Protein GI91977738 
COG category[R] General function prediction only 
COG ID[COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00126234 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCGGTTTG CGGCCGCCGC TGCGGTGCCT AGAGTGGGTC CGTTCTCGGC GCGTAGGCCT 
TTTCCAGGCT CCTGCGCGAT CTTTACGGGG TGGCTGCAAA ACAGACCGGC GCGCGCGCGG
ATCATCGCGG CGGACAGCAC CGGCAGGCTT GAAGGGGAGG GCTTTCTCAG AGTCCTCCCC
TTTTTTGTGG GTGCCAGGGC TCGGCAGGGG ACGGCGATCA TCGGGCTCCG TTGGGCGTCG
CCGGGGGCGC CGGCGGACCG GCTTGCGCCG TTCAGCCAAG CCGCCTATGG GATCGGATCG
CCTCCGGGGC CGTGCGCCGC GGTCGGGCCG CTCGCAGCAG GTCGTAAAAC GTTGGATTCC
TCCCCCGAAA CGCAGCCGCT GCCACCGCCC CCGCCGGAGC CGCCGCGCGA ACCGATCCTG
AACCTTCCGG CCGCGCTCGC CGCCTATGTC GCGCTGCTGG CGGTGATCCA TCTGCGCGTG
TTGCTGCCGC CGGACATCGA ATATTGGACC ATCGAAGTGT TCGGCTTCAT TCCGAAGCGC
TATGACGCGA CCCTGCTGGC GACGCCGTTC GCCGGGGGCA GCGGCGCCAA GGTCTGGAGT
TTCGTGACCT ATTCGCTGCT CCACGCCAAT CTCAGCCACA TCATCTTCAA CGTGCTGTGG
CTGCTGCCGT TCGGCAGCGC AGTGGCGCGG CGGTTCGGCG CGGCGCGGTT CTTCCTGTTC
ATGGCGGTCA CCGCGGTCGG CGGCGCGCTC GCCCATCTCG TCACCCACGA GCACGAGATC
GCGCCGATGA TCGGCGCTTC GGCCTCGGTG TCCGGCGCGA TGGCGGCGGC GATCCGGTTT
GCGTTTGCGC GCGGCAGTTT CCTGTCGCTG CGCAGCGGCG ACGCCGATGC GGCGGCGCGG
GTGCCGGCGC AGCCCTTGAT CCGCGCGCTG CGCGATCCGC GCGTGCTCGC CTTCCTCGCG
ATCTGGTTCG GCATCAACAT CATCTTCGGC GTCGGCTCGA TCGCGGTCGG CAACGAAGGC
GCGAGCGTCG CCTGGCAGGC GCATATCGGC GGCTTCTTCG CGGGCCTGCT GCTGTTCTCG
TTGTTCGACC CGGTGCCGCG ATCGGCGCAG ACCTCCGCTC ACAACTAA
 
Protein sequence
MRFAAAAAVP RVGPFSARRP FPGSCAIFTG WLQNRPARAR IIAADSTGRL EGEGFLRVLP 
FFVGARARQG TAIIGLRWAS PGAPADRLAP FSQAAYGIGS PPGPCAAVGP LAAGRKTLDS
SPETQPLPPP PPEPPREPIL NLPAALAAYV ALLAVIHLRV LLPPDIEYWT IEVFGFIPKR
YDATLLATPF AGGSGAKVWS FVTYSLLHAN LSHIIFNVLW LLPFGSAVAR RFGAARFFLF
MAVTAVGGAL AHLVTHEHEI APMIGASASV SGAMAAAIRF AFARGSFLSL RSGDADAAAR
VPAQPLIRAL RDPRVLAFLA IWFGINIIFG VGSIAVGNEG ASVAWQAHIG GFFAGLLLFS
LFDPVPRSAQ TSAHN