Gene RPD_1009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1009 
Symbol 
ID4021484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1144956 
End bp1146542 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content63% 
IMG OID637961200 
Productpeptidase C14, caspase catalytic subunit p20 
Protein accessionYP_568148 
Protein GI91975489 
COG category[R] General function prediction only 
COG ID[COG4249] Uncharacterized protein containing caspase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.781757 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.111483 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATCC GCATTGCGTT GTTCGTACTC CTCACTTTGC TGATGGCGCC GGCAACCCGA 
TCGCAAGCCG ACGATCTGCG CGGCGGGCGC GTCGCGCTGG TGATCGGCAA CGCCAAATAT
CCCGACGCCG AGAAGCCTTT GACTCAACCG GTGAACGATG CGCGCGATCT GGCCGACGAG
TTGAAGCGCG ACGGCTTCGA CGTCGACGCC GGCGAGAACC TCAACGGAGA CGCGATGCGG
CGCGCCTTCG ATCGTCTGTA CAATCGTGTC AAGCCGGGAT CGGTGGCGTT GGTGTTCTTC
AGCGGCTTCG GCATCCAGTC CGGACGGCAG AGCTACATGA TTCCGGTCGA TGCCCAGATC
TGGACCGAGC CGGACGTCCG CCGCGACGGC ATCAGCCTCG AGACCGTGCT CGGGGAACTG
AACAGCCGCG GAGCAACCGT CAAGATCGCG TTGATCGACG CCTCGCGGCG CAATCCGTTC
GAGCGCCGGT TTCGTAGCTT CTCGGCGGGC CTCGCCCCGA TCATCGCACC CGGCGGCTCG
CTGGTGATGT ACTCCGCCGC GTTGAGTTCG GTGGTCAGTG ACAATGGCGG CGACCACAGC
CTGTTCGTCA GCGAACTGCT CAAGGAAATC CGCGTTCCCG GCCTCGGGGC CGAGGAAGCA
CTCAATCGCA CCCGGGTCGG CGTCACGCGG GCGTCGCGCA GCGAACAGGT GCCGTGGATT
TCGTCGTCGC TGGCGGAAGA CTTCTCCTTC GTGCCCAGTT CCAAGACCGC TGCGGCCGAT
ACCTCCAAGG CAAATCCCAT TGTCGATCGC ACGCCGCCCG AACCCGCGGC GAAATCCAAC
CCTGCTCCGG TCGAAACCAA GCCCGCCGTG GCGGTTAGTC CGCCCGTGGC GAAGCCTTCC
GCCACCGTCA CCAAGTCTGC TCCGGCGGCC GCCGCCGAAC CTGACAAGCC CGCTGAACTG
GCCAAGCAGA TCGAACTTCC GAAGCCGATC GATGTGCCGA AGGAGCTCGC CAAGGAACTC
GGCTCGATCA CAGCCGAACC GGTCTCCGGT GACGCCGGGC AGGACAACGA GAAGGTGAAG
CTGTCGCTGC GGGACGATCC GACCGTCCAG AGCCTGAGCA ATCGCATCGA GGAAAACCCG
GCCGACGCCA ATGCGCTGTA CCGGCGCGGT CAGGTCTACG CCAGCAAGGG CGCCTACTGG
TCAGCGATCA AGGATTTCGA CGACACGATC CGGCTCAACC CGAGGGATGT CGAGGCCTAC
AACAATCGCT GCTTCGTGCG CACCATCGTC AATGAACTGA CCGCAGCGCT CAAGGACTGC
AATGAAGCAC TGCGTTTGCG CCCCAATTTC GTCGATGCGC TCGACAGCCG GGGGCTGCTC
AATCTCAAGA ACGGCCAGAA CAAGAATGCG ATCGCGGATT TCGACGCGGC GCTGAAGATC
AATCCGCGGC TCACTTCGTC GCTGTACGGA CGGGGACTCG CCCGTCTGCG CTCCGGGATG
AAGTCCGAAG GGGAAATCGA CATCACAACC GCCAAGGGAT TGGACCCGAA TATCGTGAAG
GAGTTCGCTG GCTACGGGGT GCGCTGA
 
Protein sequence
MNIRIALFVL LTLLMAPATR SQADDLRGGR VALVIGNAKY PDAEKPLTQP VNDARDLADE 
LKRDGFDVDA GENLNGDAMR RAFDRLYNRV KPGSVALVFF SGFGIQSGRQ SYMIPVDAQI
WTEPDVRRDG ISLETVLGEL NSRGATVKIA LIDASRRNPF ERRFRSFSAG LAPIIAPGGS
LVMYSAALSS VVSDNGGDHS LFVSELLKEI RVPGLGAEEA LNRTRVGVTR ASRSEQVPWI
SSSLAEDFSF VPSSKTAAAD TSKANPIVDR TPPEPAAKSN PAPVETKPAV AVSPPVAKPS
ATVTKSAPAA AAEPDKPAEL AKQIELPKPI DVPKELAKEL GSITAEPVSG DAGQDNEKVK
LSLRDDPTVQ SLSNRIEENP ADANALYRRG QVYASKGAYW SAIKDFDDTI RLNPRDVEAY
NNRCFVRTIV NELTAALKDC NEALRLRPNF VDALDSRGLL NLKNGQNKNA IADFDAALKI
NPRLTSSLYG RGLARLRSGM KSEGEIDITT AKGLDPNIVK EFAGYGVR