Gene RPD_3691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3691 
Symbol 
ID4024207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4118230 
End bp4119309 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content69% 
IMG OID637963895 
Productpeptidase M48, Ste24p 
Protein accessionYP_570813 
Protein GI91978154 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAGTG ACGTTTCCGC CCCGCCGCCG GCGGTCTTCT TCGACGGCGC GTCGAGCCGG 
CGCCGGCCGG TGACGCTGGC GTTCTCCGAT CGGCTCGAGA TCCTGCAGGA CGGCCGCACG
CTGGCGGCGT GGCCGTTCGC AGATATTCGC CGCGCCGACG GCGCTCCCGG CCTGTTGCGG
CTCGGCTGCG TTTCCGCACC GGCGCTGGCC CGGCTGGAGG TCCCCGATCC CGCCATTGCG
CAACAGCTTG CCGCGCGCTG CAGCTATCTC GATGCCGACG TTCCCCAACG TCACGGCGTC
CGCGCCATCG TCGGCTGGTC GTTGGCCGCG ATCGTATCGC TGGTTCTGGT GTCGGTTTAC
GGCATGCCGC TGATCGCCGA TCGCCTGGCG CCGCTGTTGC CGCAGGCGTT CGAGCGCCGC
GTCGGCGACG TCGCCGACCG GCAGATCAGG ACGTTGTTCG GCGACAAGGT CTGCGACCGG
CCGGCCGGGC AGGCGGCGTT CGCCGTGTTG GTCGAGAAGC TGCGCGCCGC CGGCAGCATC
GGCGAAACGG TGCAGCCGGC GGTGCTGTCG AGCGAGATCT CCAACGCCAT CGCGCTGCCG
GGCGGCCGTG TCTATCTGTT CAGCGCGCTG CTCGACAAGG CGGACAATCC CGACGAGATC
GCCGGCGTGC TCGCGCATGA ATTCGGCCAT GTCGCGCGCC GCGACAACAT GCGGCATCTG
ATCCGCGAAG GCGGCAGTTC GTTCCTGATC GGTCTGTTGT TCGGCGACGT CACCGGGTCG
GGCGCGCTGA TCTTCGCCTC GCGCACGCTG CTCAATTCGT CCTACTCGCG CGAAGCCGAA
CACGACGCCG ACAGCTTCGC CATCGGCGTG ATGCACGGCC TCGGCCGGCC GGTGAAGCCG
ATGGGCGAGC TGCTGTTCCG GGTCACCGGC AAGCAGCGCG ATTCGAGCAT CAGCATTCTG
GCCAGTCATC CACTGACCGA GGACCGGCTG GCGCGGATGA GCGCCGAAGC TGCGATGTCG
CCCGGCGCGC CGCTGCTCTC TGCCGAGCAG TGGCAGGCCC TGAAGGCGAT CTGCAAGTAG
 
Protein sequence
MMSDVSAPPP AVFFDGASSR RRPVTLAFSD RLEILQDGRT LAAWPFADIR RADGAPGLLR 
LGCVSAPALA RLEVPDPAIA QQLAARCSYL DADVPQRHGV RAIVGWSLAA IVSLVLVSVY
GMPLIADRLA PLLPQAFERR VGDVADRQIR TLFGDKVCDR PAGQAAFAVL VEKLRAAGSI
GETVQPAVLS SEISNAIALP GGRVYLFSAL LDKADNPDEI AGVLAHEFGH VARRDNMRHL
IREGGSSFLI GLLFGDVTGS GALIFASRTL LNSSYSREAE HDADSFAIGV MHGLGRPVKP
MGELLFRVTG KQRDSSISIL ASHPLTEDRL ARMSAEAAMS PGAPLLSAEQ WQALKAICK