Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3142 |
Symbol | |
ID | 4899057 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | + |
Start bp | 162028 |
End bp | 164943 |
Gene Length | 2916 bp |
Protein Length | 971 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640113744 |
Product | peptidase C14, caspase catalytic subunit p20 |
Protein accession | YP_001045014 |
Protein GI | 126463901 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily [COG4249] Uncharacterized protein containing caspase domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0120555 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.089148 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGGGG CAGAGCAGCG ATGGCCGGCG CGGGCGGCTC TCCTGCTCGT GCTGATCGCT GCGGCCGGGC TGGCGCGGGG GCAGGAGACA CCCTCCGGCC CGCCGCCGCG GGTGGCGATG GTGGTCGGCA ATTCCGCCTA TCAGAACGTG CCGGCTCTGC CCAATCCCGC GGGCGACGCG GAACTCGTGG CGGAGAAGCT GTGGGAATCG GGCTTCGAGG TCATCGAGAC GGTGGACGCC GACCGCGAGA CCATGCTCGC CGATCTCGCG ACCTTCCGCA GCCGCCTGCG CGAGGGCAGC GAGGCGCTCT TCTTCTACGC GGGCCACGGG GTGCAGATCG GCGGGCGGAA CTACCTGCTG CCGGTCTCGG TCGCCCCTTC CTCGGTCGAA GACCTGAAGG CCCAGGCCAT CGATGCGCAG CTCTTTCTTG ATGTGATGGC GCAGTCCGGC GCGCGGCTGA CGCTCGTGCT GCTCGACGCC TGCCGCAACA ATCCGTTTCT CGACATGCCC TCCGACGGCG CCGAGGCCAT CGCCACCCGC GCCCTCTCGA TCGGCGCCTC GCGCGCAGCG GTGGAGCGGG GGCTCCGCGA TCTGGCGGCG GCCAGCCGGG GCGGTCTGGC CGAGATGTCG GCCGGCCGCG GCGAGGCGCT CATCAGCTTT GCCACCGCGC CCGGACAGGT GGCCTTCGAC GGGCGCGGAC GGCACAGCCC CTATACGAGC GCGCTCGCCG GCAGCATCGA CGAGCCGGGG CTCGAGCTCG TCGATCTCTT CCGCAGGGTG CGGGGCGCGG TGCGCGAGGC GACCGGCGGG CAGCAGATCG CCTGGACGGC GAGCACGCTC GAGAGCCGCT TCTATTTCAA GCCGCCGGCC GAAGATCTGC GCTCTGCCAC CACCGGGATG GGCACCGGGT CGGACACGCT CGGCGCGTTG CCGCCACGGC GGGTGGTGGA CCGCACCTTC TGGCGCGCGA TCCGCGACAC GGACCGGCTC GACGCCTTCA CCGCCTATGT CCGCACCATG CCGGACGGCG CCTTCGTCGC CGAGGCGAAG GAGAAGATCC GCAGCCTCGG CGGCGATCCC GAGGCGATCC CGGTCTCGGA CCTGTCGCTG CCGGAGCGGC CTCGCGGCTT CGTGCTGGCC GATCCTGCGG CGCAGGCCGA TCTCGCGGCG ACGCTCGACC GGGCGCCCGC GTCGGTGCCC ATCGGAACGG GGGCGGCCCG CATCGGCGCC GCACCCGGCA GGGCGGGCTG GGTTCATGTG GCCGCGGCCC CGCGTCTCGG CGCGGTCTCG GCCGGCGGCA CCCGCCTCGA GGCGGGATCG GTGCGCTGGA TGGAGGCGGA CCAACCGCTC GACTACCTCC CCGGCATCGG CTCGAACGGC GGGCTCGACA GTCTCCGGGC CGAGGCCCTG CGCGAGGGCG GCACGACCGA ACCCCTTGAG GCTGCGGTCG AAACCTATGT CGACGCCTGC GACATCCTTG CGGGCAACCC CTACGACAGC CAGCGCGTCA CCGCCGGCAC GCGCCAGTTC ATCCTCGACC GCAACCATGA TGCCGCCATC GCGGTCTGCG AGATCGCCGT CGCGCGCCAT CCGGAGGTGG TGCGGTTCTG GGCCGAACTC GCGCGCGGCT ACCGCGCGGC GGGCCGCTAC GAGGAGGCGC TCCACTGGCA GCAGAAGGCG GTCGATGCGG GCTATGCCTC GGCGATGGTC TATCTCGGGC AGATGTTCCT CGACGGCCAG GCCGTGCCGC AGGACTTCGA CCGTGCGCGG GAGCTGTTCG AGGCGGCCGA TGCCCGCGGC GAGACGGCGG CGCTGACGGC GCTGGCCTGG ATCCACCGGG CGGGGGTGGG TGTGCCGGAG GATCCGGCCC GGGCGCTCGA CTTCTACCGG CAGGGGGCGG CGCGCGGCAA CGACTGGGCG ATGACCAACA TCGGCGAGTT CTACCAGAAG GGCCTGAGCG TCGCCCGGGA TCCGGCCGAA GCGGTGCGCT GGTATACGGC CGCGGCCAAG AGCGGCGAGC TGACCGCCCA AACGCGGCTC GCGCGGATGT ATCAGACCGG CGACGGCATT GCGGTGGATG AGGCGCAGGC CCGCTTCTGG TTCGAGACTG CCGCGGGCCG GGGCGTGCCG AATGCGCTGA CCCGTCTCGG CCTCATGTAT GAGCAGGGTC AGGGGGCGGA CCGGGATCTC GAGGCAGCGG CGCGTCTTTA CGGCCGGGCG GCTGCCGAGG GCGACGCCGA GGCCTGGCTG CGGCTCGGCC GGCTCGAAGC CTCGGACGCG CCGCTGTTCG ACCGGCCCGA GCGCGCCCTG CCGCTGCTGG AGAAGGCGCT CGCGGCGCAG GTGCCGGGCG CCGCGCGCGA GCTCGGGCGG CTCTATGAGA CGGGGCGCGG CGTGACGAAG GATCTGGCCC GCGCCCGGAC GCTCTATGTG CAGGAGGCGG GCGCGAACCC CTGGGCCGCG CGCGACGCCG GGCGCGCCTT CGCCTCGGAT GAGGGCGCGC CCGCCGATCC GGCGCAGGCG GCCCGCTGGT ATCGGGCCGC GGCCGAGGGC GGGGTTCCGT GGGCGGCTCT CGATCTCGGG CGGCTCTACG AGACCGGCCG CGGTGTGCCG CAGGACCGGA CCGAGGCGCT GGTCCTCTAT GCCGCGGCCG CGCGTCCCGG AGGCGATGCC AGGGCGGCCG AGGCCGCCCG CCGCGCCGCC GCCGGCTATT CGGCCGAGGA GACGATCCGG GCGGCGCAGC TGCTGCTCGG CCGCCTCGGC GCCGAGGTGG GCACGCCCGA CGGCCGGGTC GGCCCCGCGA CGCGGGACGC CCTCGCCCGC GTCTTTGCCG CGCAGGGGCG GGCCGCGCCC GGCACCCGCA TCGACTTCGA CCTGCTGGCC GAGCTTTCGG CGATGGATGA GGAGAGACTG CCATGA
|
Protein sequence | MPGAEQRWPA RAALLLVLIA AAGLARGQET PSGPPPRVAM VVGNSAYQNV PALPNPAGDA ELVAEKLWES GFEVIETVDA DRETMLADLA TFRSRLREGS EALFFYAGHG VQIGGRNYLL PVSVAPSSVE DLKAQAIDAQ LFLDVMAQSG ARLTLVLLDA CRNNPFLDMP SDGAEAIATR ALSIGASRAA VERGLRDLAA ASRGGLAEMS AGRGEALISF ATAPGQVAFD GRGRHSPYTS ALAGSIDEPG LELVDLFRRV RGAVREATGG QQIAWTASTL ESRFYFKPPA EDLRSATTGM GTGSDTLGAL PPRRVVDRTF WRAIRDTDRL DAFTAYVRTM PDGAFVAEAK EKIRSLGGDP EAIPVSDLSL PERPRGFVLA DPAAQADLAA TLDRAPASVP IGTGAARIGA APGRAGWVHV AAAPRLGAVS AGGTRLEAGS VRWMEADQPL DYLPGIGSNG GLDSLRAEAL REGGTTEPLE AAVETYVDAC DILAGNPYDS QRVTAGTRQF ILDRNHDAAI AVCEIAVARH PEVVRFWAEL ARGYRAAGRY EEALHWQQKA VDAGYASAMV YLGQMFLDGQ AVPQDFDRAR ELFEAADARG ETAALTALAW IHRAGVGVPE DPARALDFYR QGAARGNDWA MTNIGEFYQK GLSVARDPAE AVRWYTAAAK SGELTAQTRL ARMYQTGDGI AVDEAQARFW FETAAGRGVP NALTRLGLMY EQGQGADRDL EAAARLYGRA AAEGDAEAWL RLGRLEASDA PLFDRPERAL PLLEKALAAQ VPGAARELGR LYETGRGVTK DLARARTLYV QEAGANPWAA RDAGRAFASD EGAPADPAQA ARWYRAAAEG GVPWAALDLG RLYETGRGVP QDRTEALVLY AAAARPGGDA RAAEAARRAA AGYSAEETIR AAQLLLGRLG AEVGTPDGRV GPATRDALAR VFAAQGRAAP GTRIDFDLLA ELSAMDEERL P
|
| |