Gene RPD_2423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2423 
Symbol 
ID4022914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2701740 
End bp2702906 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content63% 
IMG OID637962616 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_569554 
Protein GI91976895 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00407497 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGGAGCTC TGATCGCCGT CCTCTCACTT GCATTGATGC CGGCGGCTCG CTCCGAAGAC 
GGTCTGAACA TCGACAAATC ATTCGGATCG GTCTCGGGTT GGGACGTTGG GTTCAGCAAG
AACGTCGGTG GCTGTCTGGC CGCAGCGACC TATCGCGACC GGACGACGGT GTGGTTCGGC
TTTGCCGGAG ACAAGCCGAG CGCCTACATC GCCTTCACCA ACCCGCGCTG GGCGTCCGTC
GAGGTCGATG GCCAGTACGA CCTGCAACTG GTCATGCGCC GCACGCGGTG GAACGGTCAG
TTCGTCGGCT TCACGCGCGG TAACGAGAGG GGCGTTTTCT CTGCCGGTCT CAAGACCGAG
TTCATGGTCG AACTGGCCGA GTCCGGCGGT GTGGGTGTGT TCCTGAATCG AAATCGCATC
GCGGCGCTTT CCCTCGACGG CTCCCGGCGC GCCCTCGAAG CGGTGCTGTC TTGCCAGAAG
GCATTCATGA CGGCCCAGAG CGACACACGC GACGAAGGTT CGACCGGGGC CAAGCCAAAG
CGCAACGCCA GGAGTTCAGG CACGGGATTC TACGTCTCCG GGAACGGGCA CATCGTGACC
AACAACCACG TCATCGCCGA ATGCTCGGCG ATCAATGTGA TTCCCCCCGG CGGGGCGCCG
TTGCGCGCGA CTCTCGTGGC GAAGGACAAG ACCAACGATC TCGCGATTCT GAAGACGTCG
TCGTCGCCGC CCGCTGTTCC TGGACTCAGA ACCCAGATGC GGTTGGGCGA AGCCGTGTAC
GTGTTCGGCT TTCCTCTGAC TGGCATCCTG TCGACATCAG GAAACTTCAC GGCCGGCGCG
ATCACCGCGA CCACCGGCAT GGAAGACGAC ACCCGCCTCG CCCAGATCTC CGCTCCGGTT
CAACCGGGCA ACAGCGGCGG TCCGTTGCTC GACAAATACG GCAACGTTGT CGGCGTGATC
GTATCGAAGC TCAACGCCTT GAACATCGCC GCCGCGACCA AAGACATTCC GCAGAACGTC
AATTTCGCGA TCAAATCCGG CATCGCGACG AACTTCCTCG ACAGCAGCGG CGTGCTCCCC
AGCGGCACGG TGAGCACGCG CGAACTCCCG CCGGAGGCGA TCGCCGATCT GGCCAAATCC
TTCACGGTCC AGGTGCTCTG TAATTAG
 
Protein sequence
MGALIAVLSL ALMPAARSED GLNIDKSFGS VSGWDVGFSK NVGGCLAAAT YRDRTTVWFG 
FAGDKPSAYI AFTNPRWASV EVDGQYDLQL VMRRTRWNGQ FVGFTRGNER GVFSAGLKTE
FMVELAESGG VGVFLNRNRI AALSLDGSRR ALEAVLSCQK AFMTAQSDTR DEGSTGAKPK
RNARSSGTGF YVSGNGHIVT NNHVIAECSA INVIPPGGAP LRATLVAKDK TNDLAILKTS
SSPPAVPGLR TQMRLGEAVY VFGFPLTGIL STSGNFTAGA ITATTGMEDD TRLAQISAPV
QPGNSGGPLL DKYGNVVGVI VSKLNALNIA AATKDIPQNV NFAIKSGIAT NFLDSSGVLP
SGTVSTRELP PEAIADLAKS FTVQVLCN