Gene RPD_3217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3217 
Symbol 
ID4023724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3569305 
End bp3570468 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content70% 
IMG OID637963419 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_570343 
Protein GI91977684 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.557937 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCTCT CCGTCCCGCC GACCGGCCCC ACGCCTGATC CGCAAATCCG CCGGGCGCAG 
CGCACCGACC GCCTGCTTCG CATCGCCGTG ATCTGGCTGC TGCTGCTGGC GACCGCCTGG
GTCGCGCAGC CTTATCTCGC CGCATTGTGG TTCTCGGTGT CCGGCCCGCG CACCGTCACC
GCGCGCGGCG ATCTCGCGCC GGCCGAGACC TCGACCATCG AGCTGTTCAA GCGGGTGTCG
CCTTCCGTGG TGCACGTCTA TGCCCAGTCG AGCCGACGAA CGCCGTCCTT GTTGGAGGCA
CAGCAGGGCG GGGTGCAGTC CGGCTCCGGC GTGATCTGGG ACGCCGCCGG TCACGTCATT
ACCAACAACC ACGTCATCCA GGGCGCGAGC GCGCTCGGCG CGCGGCTGTC GACCGGCGAG
TTCGTCACCG CCCGCGTGAT CGGCACCGCG CCGAACTACG ACCTCGCGGT GCTGCAGCTC
GAGCGACCGC GTGCGGCGCT GCGTCCGATC GCTATCGGGA GCTCGTCGGA TCTGCAGGTC
GGGCAGGCGG CTTTTGCGAT CGGCAGTCCT TATGGCCTGG AGCAGACGCT CACCACCGGC
ATCGTCAGCG CGTTGCAGCG TCGGCTCCCG ACCGCGGCGG CGCATGAAAT CAGCGGCGTG
ATCCAGACCG ACGCGGCGAT CAACCCGGGC AATTCAGGCG GTCCATTGCT CGACAGCGCC
GGCCGGCTGA TCGGGTTGAA CACCGCGATC ATCTCGGGCT CGGGCGCGTC GGCCGGCATC
GGCTTCGCCA TCCCCGTGGA TGCCGTCAAC CGCATCGCGA CCTCGCTGAT CCGCACCGGC
ACCGTGCCGG TTCCTGGAAT CGGCATCATC GCGGCCGACG AGAACGAGGC GGCGCGGCTC
GGGATCGACG GCGTCGTCGT GGTGCGGACG CTGCCGGACT CTCCGGCGGC GCGCGCCGGA
CTGACCGGCG CCAGCGAGAC CGGCATGGTG GAGGACGTCA TCATCGGCGC CAACGGCCAG
GAGATCCACA GCATGTCGGA TCTCGCCGCG GCGCTCGAAG GCATCGGCAT CGGCAGCGAT
GTCAAATTGC AGGTGATCCG CGACGGCCGC GCGCGGACGG TCAACGTGCA GGTGACCGAC
ATCTCTCGGC TGAAGCGGAA CTAG
 
Protein sequence
MPLSVPPTGP TPDPQIRRAQ RTDRLLRIAV IWLLLLATAW VAQPYLAALW FSVSGPRTVT 
ARGDLAPAET STIELFKRVS PSVVHVYAQS SRRTPSLLEA QQGGVQSGSG VIWDAAGHVI
TNNHVIQGAS ALGARLSTGE FVTARVIGTA PNYDLAVLQL ERPRAALRPI AIGSSSDLQV
GQAAFAIGSP YGLEQTLTTG IVSALQRRLP TAAAHEISGV IQTDAAINPG NSGGPLLDSA
GRLIGLNTAI ISGSGASAGI GFAIPVDAVN RIATSLIRTG TVPVPGIGII AADENEAARL
GIDGVVVVRT LPDSPAARAG LTGASETGMV EDVIIGANGQ EIHSMSDLAA ALEGIGIGSD
VKLQVIRDGR ARTVNVQVTD ISRLKRN