Gene Rsph17029_3074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3074 
Symbol 
ID4898589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp89081 
End bp90919 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content70% 
IMG OID640113676 
Productpeptidase C14, caspase catalytic subunit p20 
Protein accessionYP_001044946 
Protein GI126463833 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0393435 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.650719 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCCG ATCCCGGAAC GCGTCCTGAG GATCTCGCCC CCTATTTCCG CACGCGAGAC 
GCGGCCTCGC CGTTCGATCC ACAGATCGAG ATCGACGAGA CGCGGGTCAT GGTGCCCGAC
ACGGCCGAAG CGGCCAGCCG CTCGGCGCTC GTGCTCAACA GCGCCAACTG GATCGAGCGG
ATGAAGCGGC AGACGCGCTT CCACCGCCGC CTGTCCGAGG GCAACTACAC CGGCCCGATC
ATCGTCGAGG AGGGCGACAG CTGGTTCCAG TATCCGCTCC TGCTGCGCGA CGTGATCGAC
GTGCTGATGG ACCGCTATGC CATCTTCTCG CTGAGCGCCG GCGGCGACAC GCTGGAAAAC
ATGGCGACCG GGGCGCAGTA TCGGAAGGCG CTCGAGGAGA CGCAGGCCAG CATCCTGCTC
CTGTCAGGGG GCGGCAACGA CCTTGTGGCG GACGGGCGGC TGGCCGATCA TCTGCGCCCC
TTCGATCCGA ACCTCCGCCC CGCCGACTAT CTGCTCGGCA GCTTCGACCA GCTGATCGCC
CGGTCGATGG CGCTCTACGA CCGCATCTTC GCCGATGTGG CGCGCCGCTT TCCGAAGGTC
GATGTGATCT GCCACGGTTA CGACTACACC CTGCCCCGCA TCCGCGGAAA GTGGCTGGGC
CAGCCCATGC AGGCGCGCGG CATTGCGGAT CCGGCCCTGC AGGCCTCCAT CGCCGTGGTC
ATGGTCGACC GGCTGAATGC CGAGCTGGCC CGCCTCGCCC GCCGGCATCC GCGGGTGCAT
CACCTCGACC TGCGCGGCCG CGTGGAGCGC GATCACTGGA ACGACGAGCT CCATCCCGAC
GACAGCGGCT ATGCCGCGGT CGCCGCCGTC TTTGCCGAGC GGATCGAGGC GCTGACGCGC
CGCCCGCGCG GGGTCCCGCG CGGCGCAGGG ACAGGCGGCC CTGCGCCCGA GGCCGCTCCC
GCCGCCCCCG CCCTCATCCA GCCGCGCCCG ATGGCCCTCT CGCTTCATGT GGGGGTGAAC
AGGGTCGATC AGGCGCATTA CCGCAACTTC GTGCAGGATC TCGAATTCTG CGTGAACGAT
GCCGAGGCGA TGCGGGATCT GGCGGTCCAG CGCGGCTACG AGACCCGCCT GCTGACGGAC
GCTCAGGCCA CGCGCGAGGC GCTGCGCGGG GCGATGACCG ATGCGGCGCA GCAACTGGAA
CCGGGCGGGA TCTTCCTGAT GAGCTATGCC GGGCACGGTG CCCAGATCGG CGACTTCAAC
GGCGACGAGG GCGACGGCCC GGACCGGGAC CGGCTGGACG AGACGCTCTG CCTGCACGAC
GCCATGCTCG TCGATGACGA GCTCTACCAG CTGTGGGCCG CCTTCCGCGA AGGCGTGCGG
GTGGTGGCGG TCTTCGATTC CTGCCATTCG GGCAGCATCC TCCGGGCGAG CGCCAATCGC
CGCACCGACC GTGCCGGCCG CACGGGCCGC GTGCGCACCA TCTCGCTCGG GGCGAGCGTC
CAGATCTACC GCGCGAACCG CGCCTTCTAC GACGGGTTGC CCTCCTCGAT CCTGCCCTCC
GACAGCGCGG TCCTGACCAA GGAGCTCACC TATCCCGTCA GCGCCTCCAT CCTCCAGATC
TCGGCCTGCC AGTCGAACCA GACCGCCGAG GAGGCCTTCG GCAACGGGCT CTTCACCGAG
CGGCTGATGG CCACTCTGGC CGAGGGCAGC GGCCGGCTGG GCTACAGCGG CTTCACCGAC
CGGATTGCGG CGCGGATGCC GCCCGAGCAG ACGCCGAAGT TCTGGCGCGT GGGGCGGCCC
GATCCGGTCT TCGAGGCGCA GACCGTCTTC TCCGTCTGA
 
Protein sequence
MAADPGTRPE DLAPYFRTRD AASPFDPQIE IDETRVMVPD TAEAASRSAL VLNSANWIER 
MKRQTRFHRR LSEGNYTGPI IVEEGDSWFQ YPLLLRDVID VLMDRYAIFS LSAGGDTLEN
MATGAQYRKA LEETQASILL LSGGGNDLVA DGRLADHLRP FDPNLRPADY LLGSFDQLIA
RSMALYDRIF ADVARRFPKV DVICHGYDYT LPRIRGKWLG QPMQARGIAD PALQASIAVV
MVDRLNAELA RLARRHPRVH HLDLRGRVER DHWNDELHPD DSGYAAVAAV FAERIEALTR
RPRGVPRGAG TGGPAPEAAP AAPALIQPRP MALSLHVGVN RVDQAHYRNF VQDLEFCVND
AEAMRDLAVQ RGYETRLLTD AQATREALRG AMTDAAQQLE PGGIFLMSYA GHGAQIGDFN
GDEGDGPDRD RLDETLCLHD AMLVDDELYQ LWAAFREGVR VVAVFDSCHS GSILRASANR
RTDRAGRTGR VRTISLGASV QIYRANRAFY DGLPSSILPS DSAVLTKELT YPVSASILQI
SACQSNQTAE EAFGNGLFTE RLMATLAEGS GRLGYSGFTD RIAARMPPEQ TPKFWRVGRP
DPVFEAQTVF SV