Gene RPB_2982 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2982 
Symbol 
ID3910781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3394162 
End bp3395670 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content68% 
IMG OID637884888 
Productpeptidase C14, caspase catalytic subunit p20 
Protein accessionYP_486595 
Protein GI86750099 
COG category[R] General function prediction only 
COG ID[COG4249] Uncharacterized protein containing caspase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0310724 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATCGC TGGCGCGGTG GGGACTATCG GCGATCGGGA TGATCGCCGC GGTGCTGATC 
GGCAGCGCGC CCGCCGAGGC CGCCAAGCGC GTCGCGCTGG TGGTCGGCAA CAACGATTAT
CGCAACGTTC CGAAGCTGCA GAAAGCGGTG AACGACGCCC GCACCATGGG CGACGCGCTG
CGCACGCTCG GCTTTCAGGT GATGGTGGCG GAGAATCAGA ACCGCACGGC ATTCAGCCAG
AGCCTGCTGG CGTTCGATCA GGCGATCGAG CCCGGCGACA CAGCGTTCTT CTTCTACGCC
GGCCACGGTT TCGAAATTTC CGGCCAGAAC TTCCTGCTGC CGACCGATGT GCCGGCGGCG
ACCGAAGGCC AGGAGGAGCT GGTCCGCGAC GCCTCGGTGC TCGCCGACCG CATCGTCGAG
CGGCTGCAGA ATCGCGGCGC GCGCACTGCG ATCCTGGTGT TCGACGCCTG CCGCAACAAT
CCGTTCGAGC GCCGCGGCGT CCGCGCGCTG AGTGGCCGCG GCGGGCTGGC GCCGATGACG
ACGCTGCCCG AGGGCGTGTT CTCGGTGTTC TCGGCCGGGC CGCGGCAGAC CGCGCTCGAC
CGGCTGTCGG ACACCGACGC CGATCCGAAT TCGGTGTTCA CCCGGGTGTT CGCCAAGCAA
CTGCTCGATC CCGGCGAGAA CCTGGTGCAG GTCGCGCAGC GGACGCGCCG CGCCGTCAGC
GAGATGGCCG ACACCGTCGG CCATCGGCAG GTGCCGGTGT ATTTCGACCA GATGGTCGAC
GACGTGTTTC TGAACGGCGC GGCGAAGCCC GCCGCAGCCG CGGCCGCGCC GGTAGCGTCC
GCCGCGCCGC CGCAGAAGGT CGCCGCGCTG CCGCCGGTGG CGCCGCTGAA GCCGCCGACC
TCGGAGAGTC TCAACGCGCC GATCGCCAGC TTCTCCCGCC ACAATGGCGG CTGGACCGTG
GTGTTCTCGA TCGCTGATCC GACGCTCGGC ATCTCCTGGC GGCTCGGCGA CAGCGGCGAG
TTCCGCGAGA CCGGCTTCAT GGACACGCTC GATCCGCGCA CCCGCAAGCG GATGCCGAAC
CCGTCGATCC AGCTCCCCGC CGATGCCCAG GCCTCGACCA TCGAGGTGCG CTATGTCGAT
GCGCTCGGCG AGACGCAGGG GCCGTTTCCG ATCCGGTTCG AGCCGGAGGC TGCATTGCTG
CGCGACCAGC GCAAGATCCT CGACATGACC GCGACGAGCT GGCTGTCGTT TCGCGACTAC
AACGGCCTGC TGGTCTACTA CACGCATCTG ATGTCGTATC GCTGCGCGAT CCGCGAGGTG
AGGATCGGCA TCGACAGCGC GGTGCCCGAC AAGGTGCTGA AACTGCCCGC CTGCGACCTG
CGCGACCCGG TCGCGATCCC GAGCAGCGCC ACGCCCTATC TGAAATTGTC GCCCGGCGTG
AAGTCGGTGT CGGTCGAACT GACCTATCGC GACGGCTCGG TGTCGGAGAT CAAGACGTTC
CGGCGGTGA
 
Protein sequence
MRSLARWGLS AIGMIAAVLI GSAPAEAAKR VALVVGNNDY RNVPKLQKAV NDARTMGDAL 
RTLGFQVMVA ENQNRTAFSQ SLLAFDQAIE PGDTAFFFYA GHGFEISGQN FLLPTDVPAA
TEGQEELVRD ASVLADRIVE RLQNRGARTA ILVFDACRNN PFERRGVRAL SGRGGLAPMT
TLPEGVFSVF SAGPRQTALD RLSDTDADPN SVFTRVFAKQ LLDPGENLVQ VAQRTRRAVS
EMADTVGHRQ VPVYFDQMVD DVFLNGAAKP AAAAAAPVAS AAPPQKVAAL PPVAPLKPPT
SESLNAPIAS FSRHNGGWTV VFSIADPTLG ISWRLGDSGE FRETGFMDTL DPRTRKRMPN
PSIQLPADAQ ASTIEVRYVD ALGETQGPFP IRFEPEAALL RDQRKILDMT ATSWLSFRDY
NGLLVYYTHL MSYRCAIREV RIGIDSAVPD KVLKLPACDL RDPVAIPSSA TPYLKLSPGV
KSVSVELTYR DGSVSEIKTF RR