Gene RPD_2468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2468 
Symbol 
ID4022959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2758068 
End bp2759957 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content61% 
IMG OID637962661 
Productpeptidase C14, caspase catalytic subunit p20 
Protein accessionYP_569599 
Protein GI91976940 
COG category[R] General function prediction only 
COG ID[COG4249] Uncharacterized protein containing caspase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0996556 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.406351 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGGCGC GCGGCAATTG CTATAGCAAA GTTCTTTCGC GTTCGATTTC AACTCTCGCA 
GCAATCGCCT CGGTCCTGCT GCTTGCCAAT CCGGCTTTCT CGCAAGGCCA GGCGCTGGAA
CGCTGTCGCG AAACCGTCGG CAGGCCGATC GTCATGCCTT GCATGAAGGC AGGCGGAACG
CTGGAGTCCT GCCGTGCGCT GGCAACGCCG AAGGTCAGGG CCTGCGTCCA GGCCGCCATG
GGCGGTGGTA TGGGCGGTGG TCCCGGCGGT GCCGGCAGGC AGGGCCCCGG TGGCTTGAAC
CAGCCGCGTT CGGCCGCGGG GCCGGCGGTC GGCAAGGCGC GATCGCTGAT CGAGCAGGGC
AAATATGCCG GAGCGATCGT CGAACTCGAT CGGGCCGTGA AGCAGGATCC GAAATTTCCC
TTTGCCTACG CTTGGCGCGG TGTCGCCAAA ATGCGGATTG GAAAATTCGA CGAAGCCATG
TCTGACTTCA ACGAGGCGTT GAAGCTGGAT CCGCGCAATG CGTTCGCGCT GGGTCAACGT
GGCAACGCGT TCTTTGCGCT GCGTGACAAC GGCAAGGGAT TGGCGGACGT CAATGCGGCC
CTCGAGATCG ACAATACCGC AGCCGCCCCC TACGCCTTCA GGGGAATGAT CTATTCGGAC
ATGGGCGACC AAGACAAGGC ATTGGCCGAC CTGACCCGAG CCGTAAAGCT GAATCCGAAC
CTGCCCCCCG CTCATGGAGG ATTGGGATCA GTTTACAGCA AGCTGCAGGA GTTTGAAAAA
TCGCTGGCGG CTTACAACAG GGCGCTCGAG CTGGCGCCAA ACACCGCTGC CTATCTCAGC
GGCAGAGGCT ATGTCCATTT CAGTCTCGGC GAGTATGACC GCGCGATCAC CGATATTTCG
CAGGCCATCG CGATCAACTC CAGATTTGCC CGCCCCTACA TCAACCGCGG CCGCGCCTAT
ATCGCGACCA ACAATCTGTC GGCGGCAATC AAGGATTTCG ACGAGGCACT CAAGATCGAG
CCGAAGAACA TAACCGCTCT GCTCCAGCGC GCGCAGGCGT TCGAACGTTC GCGCGATTTC
GCCAAGGCGC AAGCCGACCT TCAGGACGCG TTGAAACTGG TTCCTTCGCA CCCCGTCGCC
GTCGCGGGCA TCGAGCGGAT CGACGCCAAA ATGGGCGCCA CCCGTCCGGG CACGGAGCGA
ACCGGTGGAC GTATCGCTCT GGTGATCGGC AATTCGAAAT ATGAGGCATT CGACACGCTC
GCAAATCCCA AACGGGATGC TGCTGCCATT GCGGACGCGC TCGGGAGATC CGGTTTCCAG
AACGTCAAGC TGCTGACAGA CGTGAACCGG GACTCGCTTT TGGCTGCACT TAAAACCTTC
ACCGACGAGA GTAAAAACGC CAACTGGTCG GTCGTTTATT TTGCCGGTCA CGGCATTGAA
CTCGACGGCA GCAATTATCT CGTTCCGGTC GACGCGAAAT TCGAGAACGA TGCCAATATT
CCGAACGAAG GCGTTGCCCT CGATCAGGTC CTCAACGCAG TCGGCGCAGC CGACAAGATG
CGTCTCGTGA TCCTGGACGC CTGCCGCGAG AATCCCTTCG CCGCAGAGAA GAAATCACTC
TCGGTCGGCA GAGGCCTCGC GCGTATCGAA CCCGAAAGCG GCACGCTGGT GGCGTTTGCC
ACCAAGCACG GCCACTACGC GACCGACGGC TCCGGCGACA ACAGCCCGTT CGCGACGTCG
CTGGTCCGGC TCATGGACGC GCCGGGTCTC GAGATCAACC AGCTATTCCG GATGGTGCAT
GACGACGTCT ACGCCAGCAC CGCGAAGAAG CAAGAGCCGT TCACCTACGG GCAGTTGTCG
GCTCAGGGTT TCTATTTCAA GGCAAGATAG
 
Protein sequence
MWARGNCYSK VLSRSISTLA AIASVLLLAN PAFSQGQALE RCRETVGRPI VMPCMKAGGT 
LESCRALATP KVRACVQAAM GGGMGGGPGG AGRQGPGGLN QPRSAAGPAV GKARSLIEQG
KYAGAIVELD RAVKQDPKFP FAYAWRGVAK MRIGKFDEAM SDFNEALKLD PRNAFALGQR
GNAFFALRDN GKGLADVNAA LEIDNTAAAP YAFRGMIYSD MGDQDKALAD LTRAVKLNPN
LPPAHGGLGS VYSKLQEFEK SLAAYNRALE LAPNTAAYLS GRGYVHFSLG EYDRAITDIS
QAIAINSRFA RPYINRGRAY IATNNLSAAI KDFDEALKIE PKNITALLQR AQAFERSRDF
AKAQADLQDA LKLVPSHPVA VAGIERIDAK MGATRPGTER TGGRIALVIG NSKYEAFDTL
ANPKRDAAAI ADALGRSGFQ NVKLLTDVNR DSLLAALKTF TDESKNANWS VVYFAGHGIE
LDGSNYLVPV DAKFENDANI PNEGVALDQV LNAVGAADKM RLVILDACRE NPFAAEKKSL
SVGRGLARIE PESGTLVAFA TKHGHYATDG SGDNSPFATS LVRLMDAPGL EINQLFRMVH
DDVYASTAKK QEPFTYGQLS AQGFYFKAR