Gene RPC_1127 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1127 
Symbol 
ID3969522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp1226719 
End bp1227819 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content65% 
IMG OID637924238 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_531010 
Protein GI90422640 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0664983 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGATC GTTTCTCCCG CGTTGCGACC GTCATTCTGA TCGCGCTGCT CGCCGCGCTG 
GTCGGGCAGC CCTATATCGA CCGGCTGCTG TTCGCGGCAA CATCGCCCAG GGCGGTCGCT
GCACGAAGCT ATCTGGCGGA ATCCGAGCGG GCGACCATCA ACCTGTTCGA GCGGGTCTCG
CCCTCGGTGG TTCAGGTGGT CGGCTCAGCC GCCGGCAGCG GCCCAACCGA CTTCGAAGGC
GAGCAGCCTC GGGAGCAGAG CGGCACCGGC ATGATCTGGG ACGCCGCAGG TCACGTGGTG
ACCAACAACC ACGTGGTGAA CGGGACCGCT CACGTCGCCG TTCGTCTCGC CAGCGGCGAT
GTCGTTCCCG GCACGATCGT CGGCACCGCT CCGAATTACG ATTTGGCGGT GGTTCGGCTG
CAGAACCCTC GCCGTCTGCC TGCGCCGATT ACGGTGGGCA GCTCGGCCGA TTTGAAAGTC
GGACAGGCCG CGTTCGTGAT CGGCAACCCG TTCGGTCTCG ACCAATCGTT GTCGACCGGC
GTAATCAGCG CCTTGAAGCG GCGCTTGCCG ACCGGTTCAG GGCGGGAAAT CGGCAACGTC
GTCCAGACCG ACGCCGCCGT TAATCCTGGA AACTCCGGAG GTCCGCTACT GGATTCCGCG
GGACGACTGA TCGGCGTGAC CACCGCGATT ATCTCGCCCT CGGGCTCGAA CGCCGGGATC
GGCTTTGCGA TTCCTGTGGA TACGGTGAAT CGGGTGGTCC CCGAACTGAT CAAATACGGA
CGGGTGCCGA CGCCCGGGAT CGGCATCGTC GCCGCCAACG AAGCGGTCGC GACCCGGCTC
GGAATCGAAG GCGTCATCAT TGTCCGTGCG CTGCCGGGAT CGCCCGCCGC CAAATCCGGA
CTGCGCGGCA TCGATCAGGC GGCCGGCGAA ATCGGCGACG TGATCGTCAG CGCCAACGGC
CAACCGACGA GACGCCTGTC GGATCTCACC GACCAGTTAG AGGCGGTCGG AGTCGGACAG
GAGATCGAGC TATCGATCAG GCGCAACAAC CGGTCGAGCA CGGTTCGCGT CAGGGTGCAG
GACATCAGTC AGCCTTCTTG A
 
Protein sequence
MRDRFSRVAT VILIALLAAL VGQPYIDRLL FAATSPRAVA ARSYLAESER ATINLFERVS 
PSVVQVVGSA AGSGPTDFEG EQPREQSGTG MIWDAAGHVV TNNHVVNGTA HVAVRLASGD
VVPGTIVGTA PNYDLAVVRL QNPRRLPAPI TVGSSADLKV GQAAFVIGNP FGLDQSLSTG
VISALKRRLP TGSGREIGNV VQTDAAVNPG NSGGPLLDSA GRLIGVTTAI ISPSGSNAGI
GFAIPVDTVN RVVPELIKYG RVPTPGIGIV AANEAVATRL GIEGVIIVRA LPGSPAAKSG
LRGIDQAAGE IGDVIVSANG QPTRRLSDLT DQLEAVGVGQ EIELSIRRNN RSSTVRVRVQ
DISQPS