Gene Rsph17029_3142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3142 
Symbol 
ID4899057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp162028 
End bp164943 
Gene Length2916 bp 
Protein Length971 aa 
Translation table11 
GC content74% 
IMG OID640113744 
Productpeptidase C14, caspase catalytic subunit p20 
Protein accessionYP_001045014 
Protein GI126463901 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily
[COG4249] Uncharacterized protein containing caspase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0120555 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.089148 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGGGG CAGAGCAGCG ATGGCCGGCG CGGGCGGCTC TCCTGCTCGT GCTGATCGCT 
GCGGCCGGGC TGGCGCGGGG GCAGGAGACA CCCTCCGGCC CGCCGCCGCG GGTGGCGATG
GTGGTCGGCA ATTCCGCCTA TCAGAACGTG CCGGCTCTGC CCAATCCCGC GGGCGACGCG
GAACTCGTGG CGGAGAAGCT GTGGGAATCG GGCTTCGAGG TCATCGAGAC GGTGGACGCC
GACCGCGAGA CCATGCTCGC CGATCTCGCG ACCTTCCGCA GCCGCCTGCG CGAGGGCAGC
GAGGCGCTCT TCTTCTACGC GGGCCACGGG GTGCAGATCG GCGGGCGGAA CTACCTGCTG
CCGGTCTCGG TCGCCCCTTC CTCGGTCGAA GACCTGAAGG CCCAGGCCAT CGATGCGCAG
CTCTTTCTTG ATGTGATGGC GCAGTCCGGC GCGCGGCTGA CGCTCGTGCT GCTCGACGCC
TGCCGCAACA ATCCGTTTCT CGACATGCCC TCCGACGGCG CCGAGGCCAT CGCCACCCGC
GCCCTCTCGA TCGGCGCCTC GCGCGCAGCG GTGGAGCGGG GGCTCCGCGA TCTGGCGGCG
GCCAGCCGGG GCGGTCTGGC CGAGATGTCG GCCGGCCGCG GCGAGGCGCT CATCAGCTTT
GCCACCGCGC CCGGACAGGT GGCCTTCGAC GGGCGCGGAC GGCACAGCCC CTATACGAGC
GCGCTCGCCG GCAGCATCGA CGAGCCGGGG CTCGAGCTCG TCGATCTCTT CCGCAGGGTG
CGGGGCGCGG TGCGCGAGGC GACCGGCGGG CAGCAGATCG CCTGGACGGC GAGCACGCTC
GAGAGCCGCT TCTATTTCAA GCCGCCGGCC GAAGATCTGC GCTCTGCCAC CACCGGGATG
GGCACCGGGT CGGACACGCT CGGCGCGTTG CCGCCACGGC GGGTGGTGGA CCGCACCTTC
TGGCGCGCGA TCCGCGACAC GGACCGGCTC GACGCCTTCA CCGCCTATGT CCGCACCATG
CCGGACGGCG CCTTCGTCGC CGAGGCGAAG GAGAAGATCC GCAGCCTCGG CGGCGATCCC
GAGGCGATCC CGGTCTCGGA CCTGTCGCTG CCGGAGCGGC CTCGCGGCTT CGTGCTGGCC
GATCCTGCGG CGCAGGCCGA TCTCGCGGCG ACGCTCGACC GGGCGCCCGC GTCGGTGCCC
ATCGGAACGG GGGCGGCCCG CATCGGCGCC GCACCCGGCA GGGCGGGCTG GGTTCATGTG
GCCGCGGCCC CGCGTCTCGG CGCGGTCTCG GCCGGCGGCA CCCGCCTCGA GGCGGGATCG
GTGCGCTGGA TGGAGGCGGA CCAACCGCTC GACTACCTCC CCGGCATCGG CTCGAACGGC
GGGCTCGACA GTCTCCGGGC CGAGGCCCTG CGCGAGGGCG GCACGACCGA ACCCCTTGAG
GCTGCGGTCG AAACCTATGT CGACGCCTGC GACATCCTTG CGGGCAACCC CTACGACAGC
CAGCGCGTCA CCGCCGGCAC GCGCCAGTTC ATCCTCGACC GCAACCATGA TGCCGCCATC
GCGGTCTGCG AGATCGCCGT CGCGCGCCAT CCGGAGGTGG TGCGGTTCTG GGCCGAACTC
GCGCGCGGCT ACCGCGCGGC GGGCCGCTAC GAGGAGGCGC TCCACTGGCA GCAGAAGGCG
GTCGATGCGG GCTATGCCTC GGCGATGGTC TATCTCGGGC AGATGTTCCT CGACGGCCAG
GCCGTGCCGC AGGACTTCGA CCGTGCGCGG GAGCTGTTCG AGGCGGCCGA TGCCCGCGGC
GAGACGGCGG CGCTGACGGC GCTGGCCTGG ATCCACCGGG CGGGGGTGGG TGTGCCGGAG
GATCCGGCCC GGGCGCTCGA CTTCTACCGG CAGGGGGCGG CGCGCGGCAA CGACTGGGCG
ATGACCAACA TCGGCGAGTT CTACCAGAAG GGCCTGAGCG TCGCCCGGGA TCCGGCCGAA
GCGGTGCGCT GGTATACGGC CGCGGCCAAG AGCGGCGAGC TGACCGCCCA AACGCGGCTC
GCGCGGATGT ATCAGACCGG CGACGGCATT GCGGTGGATG AGGCGCAGGC CCGCTTCTGG
TTCGAGACTG CCGCGGGCCG GGGCGTGCCG AATGCGCTGA CCCGTCTCGG CCTCATGTAT
GAGCAGGGTC AGGGGGCGGA CCGGGATCTC GAGGCAGCGG CGCGTCTTTA CGGCCGGGCG
GCTGCCGAGG GCGACGCCGA GGCCTGGCTG CGGCTCGGCC GGCTCGAAGC CTCGGACGCG
CCGCTGTTCG ACCGGCCCGA GCGCGCCCTG CCGCTGCTGG AGAAGGCGCT CGCGGCGCAG
GTGCCGGGCG CCGCGCGCGA GCTCGGGCGG CTCTATGAGA CGGGGCGCGG CGTGACGAAG
GATCTGGCCC GCGCCCGGAC GCTCTATGTG CAGGAGGCGG GCGCGAACCC CTGGGCCGCG
CGCGACGCCG GGCGCGCCTT CGCCTCGGAT GAGGGCGCGC CCGCCGATCC GGCGCAGGCG
GCCCGCTGGT ATCGGGCCGC GGCCGAGGGC GGGGTTCCGT GGGCGGCTCT CGATCTCGGG
CGGCTCTACG AGACCGGCCG CGGTGTGCCG CAGGACCGGA CCGAGGCGCT GGTCCTCTAT
GCCGCGGCCG CGCGTCCCGG AGGCGATGCC AGGGCGGCCG AGGCCGCCCG CCGCGCCGCC
GCCGGCTATT CGGCCGAGGA GACGATCCGG GCGGCGCAGC TGCTGCTCGG CCGCCTCGGC
GCCGAGGTGG GCACGCCCGA CGGCCGGGTC GGCCCCGCGA CGCGGGACGC CCTCGCCCGC
GTCTTTGCCG CGCAGGGGCG GGCCGCGCCC GGCACCCGCA TCGACTTCGA CCTGCTGGCC
GAGCTTTCGG CGATGGATGA GGAGAGACTG CCATGA
 
Protein sequence
MPGAEQRWPA RAALLLVLIA AAGLARGQET PSGPPPRVAM VVGNSAYQNV PALPNPAGDA 
ELVAEKLWES GFEVIETVDA DRETMLADLA TFRSRLREGS EALFFYAGHG VQIGGRNYLL
PVSVAPSSVE DLKAQAIDAQ LFLDVMAQSG ARLTLVLLDA CRNNPFLDMP SDGAEAIATR
ALSIGASRAA VERGLRDLAA ASRGGLAEMS AGRGEALISF ATAPGQVAFD GRGRHSPYTS
ALAGSIDEPG LELVDLFRRV RGAVREATGG QQIAWTASTL ESRFYFKPPA EDLRSATTGM
GTGSDTLGAL PPRRVVDRTF WRAIRDTDRL DAFTAYVRTM PDGAFVAEAK EKIRSLGGDP
EAIPVSDLSL PERPRGFVLA DPAAQADLAA TLDRAPASVP IGTGAARIGA APGRAGWVHV
AAAPRLGAVS AGGTRLEAGS VRWMEADQPL DYLPGIGSNG GLDSLRAEAL REGGTTEPLE
AAVETYVDAC DILAGNPYDS QRVTAGTRQF ILDRNHDAAI AVCEIAVARH PEVVRFWAEL
ARGYRAAGRY EEALHWQQKA VDAGYASAMV YLGQMFLDGQ AVPQDFDRAR ELFEAADARG
ETAALTALAW IHRAGVGVPE DPARALDFYR QGAARGNDWA MTNIGEFYQK GLSVARDPAE
AVRWYTAAAK SGELTAQTRL ARMYQTGDGI AVDEAQARFW FETAAGRGVP NALTRLGLMY
EQGQGADRDL EAAARLYGRA AAEGDAEAWL RLGRLEASDA PLFDRPERAL PLLEKALAAQ
VPGAARELGR LYETGRGVTK DLARARTLYV QEAGANPWAA RDAGRAFASD EGAPADPAQA
ARWYRAAAEG GVPWAALDLG RLYETGRGVP QDRTEALVLY AAAARPGGDA RAAEAARRAA
AGYSAEETIR AAQLLLGRLG AEVGTPDGRV GPATRDALAR VFAAQGRAAP GTRIDFDLLA
ELSAMDEERL P