Gene RPB_3501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3501 
Symbol 
ID3911303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4005281 
End bp4006618 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content69% 
IMG OID637885403 
Productpeptidase C14, caspase catalytic subunit p20 
Protein accessionYP_487107 
Protein GI86750611 
COG category[R] General function prediction only 
COG ID[COG4249] Uncharacterized protein containing caspase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCGTT GGGCAATCGG AATTGCGCTG GCGGTGGTGG CTGCGCTCTG GCTGTCGCCG 
GCGCGCGCCG AAAAGCGCGT CGCGCTGGTG ATCGGCAATT CCGGCTATCA GAACGTGTCG
CGGCTCGACA ATCCGAAGAA CGACGCGGTG CTGATGGCGG AGACGCTGAG TGTGATCGGC
TTCACGCTGG TCGGCGGCCG CGCCCAGCTC GATCTCGACA AGCCGTCGCT GGATTCCGCG
GTGCAGAATT TCGGCCGACA GATCCAGGGC GCCGACGTCG CGCTATTCTA CTTTGCCGGC
CACGGCGTGC AGATCAGCGG CGCCAATTAT CTGGTGCCGG TCAACGCCAA TCCGACCCGC
GAGGCCGACG TCGATTTCCA GATGGTCGAC GTCAACGTCG TGCTGCGGCA GATGCAGGCG
GCCGGAACGC GGCTGAAGAT CGTCATTCTC GACGCCTGCC GCAACAACCC GTTCGGCGCG
CGAGGACTGC GATCCTCCGA AGGTGGCCTC GCGCAGATGC GCGCGCCCGA CGGCACGCTG
ATTTCCTACG CGACCCAGCC CGGCAGCGTC GCGCTCGACG GCGGCGATGG CCACAGCCCG
TATACGCGCG CGCTGGCGGC GACGGTGAAG CGGGCCGGGC TCGATCTGTT CCAGACCTTC
AATCAGGTCG GCCTCGCGGT GATGCGTGCG ACCGGCGGCG CTCAGCAGCC CTGGGTGTCG
TCGTCGCCGA TCGACGGCAC GTTCTACTTC GTCGCGCCGG CGCTGCCGCT GCCGCCCGCG
TCGCCGTCGC CGATGCAGGA GGCGCGACTG AGCGAGACCC CGCGTCGCGA TCCCGATCGC
GCGCCGCTGA CCGATGCCGG TGCGCTGCGC GAACTGCGCG ACCGGCTGTA CGAACGCAAT
TTCGATCCCG ACGTGCCGGA CGACAAAGCC GGCTTGCGCA CGGCGATCGC CAAGTTCCAG
GAGAAGGCGG CGCTGCCGCA GACCGGGGAG GCGACCGAAG GCGTGCTCGC GCGGCTGCGG
CAGACCGACG ATCTGAAGCC GTGGGGATCG ATCGTGTACG ACCCGGACAA CGAGAAGTGG
GGAATGTCCT GGAACCACGC GTCGCGGAAA GCCGCGGTGT CGGACGCCGG CGCGAAATGC
AGCGGCGCGC CGTGCAAGGT CGAACTCAGC TTCTACGGCC AGCGCTGCGG CGCCTTCGCG
GTGTCGGCGA TGGCGTGGTC GCTGGTCGAC CGCGACAGCG TCCAGGCGGC GAAGGATGCC
GCGCTCAGCG CCTGCGGCAA GTCCGGCAAG CCGTGCCGCG TGATCGGCGC GGTCTGCGCC
GACGGCTCCG GCCGCTGA
 
Protein sequence
MERWAIGIAL AVVAALWLSP ARAEKRVALV IGNSGYQNVS RLDNPKNDAV LMAETLSVIG 
FTLVGGRAQL DLDKPSLDSA VQNFGRQIQG ADVALFYFAG HGVQISGANY LVPVNANPTR
EADVDFQMVD VNVVLRQMQA AGTRLKIVIL DACRNNPFGA RGLRSSEGGL AQMRAPDGTL
ISYATQPGSV ALDGGDGHSP YTRALAATVK RAGLDLFQTF NQVGLAVMRA TGGAQQPWVS
SSPIDGTFYF VAPALPLPPA SPSPMQEARL SETPRRDPDR APLTDAGALR ELRDRLYERN
FDPDVPDDKA GLRTAIAKFQ EKAALPQTGE ATEGVLARLR QTDDLKPWGS IVYDPDNEKW
GMSWNHASRK AAVSDAGAKC SGAPCKVELS FYGQRCGAFA VSAMAWSLVD RDSVQAAKDA
ALSACGKSGK PCRVIGAVCA DGSGR