Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A0868 |
Symbol | |
ID | 3834345 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | - |
Start bp | 1037228 |
End bp | 1040548 |
Gene Length | 3321 bp |
Protein Length | 1106 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637824956 |
Product | peptidase C14, caspase catalytic subunit p20 |
Protein accession | YP_425956 |
Protein GI | 83592204 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGAGGG GGTCACAAAC AGCGTCCGGC CGGCTTGGCG AAGGGAAGAC GCTTGGACAG GGAAGGATGC GATGGGTAGC CCTGACCCTG CTGTTTTTGT CGTTCTTGGC GACGGCGCAC GGCGCGCGAT GCCAGGAGAG CGGTCTGCCG GACCAGCCTT TTTTACGCTT CGAGACCGGG ATGCACACGG CGCCGGTGAC CGATCTGGCG GTTGGCGCCG CGGGCACCCT GATCGCCACG GCCTCGCTTG ATAAAAGCCT GCGCCTTTGG GACGGACAGA GCGGGCGCCT TTTGGGCGGC GTTCCCGTGC CCCAAGGGGA CGGGCTCGAA GGAAGCCTTT ATACGGTGAG CCTGTCTCCC GATGGGAGCA AGGCCCTGGT GGCGGGGTAC ACGGCGCAAA GCTGGGATGG CAGCCCCTCC CTTTACCTGA TCGACGTGAA GGCCCAGAAG CTCCAATTCC GCATCAAAGG CCTGCCCGCC GTCGTCACCA GTTTGCGGCA CCGCTCCGAC GGCAAGGTCT TCGCCGCCGG TTTCGCCGCC GGGGCCGGTA TCCGCGTGTG GGACGCGCTG ACCGGCCGGA CCCTGGCCGA GGACAAGGCC TTTCAGGACG CCACCGTGTC TTCCCTGGCC TTCGCCGCCG ACGGCCGGCT TGCCGTCTCC AGTTCCGACG GCTTTGTGCG TTTATACGCC GCGGATTTCC GCCTTTTGGC CCGTGTTTCC GTCAGAAAAA CCGGAACGCC CTTCGGTTTG GCTTTTTCCC CCGATGGTCG AACCCTCGCC ATAGGATATC GCTCGGCGGC CCTTGTGGAC CTGCGCGACG GCCTGACCCT GGCCGCCGGC ACCATCCTTT CCGTACCGCC CGGCGAGACG GGGGAAACTC CCCTTGTCAC CTGGACGACC GTGGCCGGCG AGAGCGTTCT CGTCGCCGGC GGCACCCGCG CCGACACCGA CCAGCGAACT CTTTTGCTTG CCTACGCCCC CCAAAAGCGC AAGTGGTCGG TCCTTGGAAC GGTGGGACAC GACACGGTGA CGGCTCTCGC CCCCCATCCG GACGGGGGCG TCGTGTATGG AACGGCCGAT CCGTCTTGGG GAATGATGGG GAGGGGAACG GCCGAGGCGC CGCTGCCGGC CATCCACCAC GAGGCCGGCA TCGCCGATTT CCGCGATGTC GCCTCCGGCT TTTTCGCCGT GACCCCCGAA GGGACCCTCC TGGAGGTCGG TATCGCCAAG GGAGGACAGA CGCCCTGGAC GATCGATCTT TCCCGGGGCT TGGTTCGCCC GGGGCCGGCC GGACAAGAGA CCGTTCGCGC CACGGCCAGC CCCGGTCTGG CTGACTGGCG CAACGGCCGC GCCCCCCTTC TGGGGAAAAT CCCCCTGGTC CTCGAGCCGG CGGAGACGGC CCGTAGCCAA GCTTTGCTTC CCGATGGCAA GCGCTTCATC CTTGGCGCCG AATTCTCCCT GCGTCTATAC GACGAGACCG GACGCCTGCT TGACCGCCTG ATCACCCCCG CCCCGGTCTG GGGGGTTCTT CCCGTTCCAA ACCTCCCCCT CGTCGTTGCC GCCCTGGGGG ACGGAACCCT GCGTTGGTAT CATGTTACCT TGGAAAACGG GCTGCGCGAG CAAGGCGCCT TTTTCCTTCA CCCCGCCTCC AACCGCTGGG TGGCCTGGAC GCCGGAGGGC TTTTTCGCCC ATTCCGACAG TGGCGGGCAG ACCATGGTGG GCTACCTGTT CAACGTGGCC AAAGCGCAAT CCCCGACCTT GGTGGATTTC GCCCAGATGT ATCGTCTGTT CCATAAGCCG GGTCTGGTCT GGCCGGCGTT GATCGATCCG ATCGGCGCGC GGCGGGGACT GGATCTCGCG CTCGCCCAGA TCGGAGATAT CCGCTCGGTT CTGGGGATGG CGCCGCCCCC CTCCGTCGCC CTGGCGCAGG TCTGCCCCAA GGAAAACACC CCCGTCACCC GCGGCTTTCA TGTGATTGAG ACCACCCCGC CCCCGCCATC GGTGGGCGGG AGCGACGAGA CCGACCCTTG TTTCGCCCCC GCCGCCCTAG CCCGCCAAGC GGGTTCGTCC GATGCGATTC ACGGCAAGGA CGCGGCGGTT CTTTCGGTCA TGACCCTGGC TTCGACCCAA AAGGAGGTGG CTTTGCGGTT TGATGTCGAG GATCGCGGCG GCGGCCTGGA ACTGGTCGAC GTTCTGGTCA ACGGGCTCAA CGTGGGGCGG AAGCGGGCTT TCGAAAAGCG GGTCGCCGCC CCCCCTTCGC CGCCTTCGGC TTCGGCGCCT TCGGCCGCGC CCCCGTCCGT CGCCGAGCCG TCCGCTGCCT CCTCCCCCAA GACCACGGCC ATCGCGGCGG CCATCGCCCC CGCGCAGCCG GCGGACCCCG CGCAGACAGC GGATACGGAA GACTTTCCCT TCCCCGCGCC CCCCTTGAGA TTCGAGCGGG CCGTCACCCT GAATGAGGGG ATCAACCGGG TCGAGATCCG GGCCTATGGA CAAAACGGGG TCTACGGCCG CACGGCTTTT GATCTCGTGG TCCCGCCCCC ACCGCTGGAG AAGGCCCCTC CCTCACCAGC CCCCGCCCCC CACAGCCGAC TTGTCGTGCT GGCCGTTGGC ATCAACGACT ATGCCGGCGC CAGCAATGAT CTCCAATATG CCGTAGCCGA CGCCAGAACC TTCGCGACCA CCGTTCGGCG TCAAGCGCTG GGCGTTTATG AGGAGATCCG GGTTATCGAG CTTTATGATG CCCAGGCCGA GCGTCCCGCC CTGGAAGCCC ATTTGGCCCA GCTCGCCGAT GAAACCCGGC CCGAGGACAG CCTGCTTCTT TATTTCTCCG GACATGGCGA GGTCGACGAA CAGGGACTCT ATCGCTTCCT GACCCCGCTC GCCTATACCA CGCCCGAGGA GCGCGGGCTT GACGCGAACC GTTTGGTGGA GTTGCTCGGC GATATTCCGG CGCAACGCAA AATGTTGTTC CTCGATACCT GTCACTCCGG CGCTTTCGAC ATTGAAGAGG TTTCGGGAAA CCTGTTCAAC GAAACGGGGC AATTCATCCT GTCCGCGGCC GAGGCCTCGG AAACCGCCGC GGATGTGCTG CCCGGCACCC AAAACGGGGT ATTCGCGGTC GCCGTCATGC AAGGCTTGGA ACGCGATGCC GCCCTGCGCA GCGGAACCAC CGTAAGCGCC CTCACCTTGG GGGAATGGGT CCGCCTGCGG GTGCCCGTCC TGGCGCAAGA ACGCCGCCTG GGCGCCCAAC ACGCCGTGTT CAAGGGCAAT AACACCATGG CCTTTCCCCT CACCCATATC CTGGAAGAGC CACAACCATG A
|
Protein sequence | MQRGSQTASG RLGEGKTLGQ GRMRWVALTL LFLSFLATAH GARCQESGLP DQPFLRFETG MHTAPVTDLA VGAAGTLIAT ASLDKSLRLW DGQSGRLLGG VPVPQGDGLE GSLYTVSLSP DGSKALVAGY TAQSWDGSPS LYLIDVKAQK LQFRIKGLPA VVTSLRHRSD GKVFAAGFAA GAGIRVWDAL TGRTLAEDKA FQDATVSSLA FAADGRLAVS SSDGFVRLYA ADFRLLARVS VRKTGTPFGL AFSPDGRTLA IGYRSAALVD LRDGLTLAAG TILSVPPGET GETPLVTWTT VAGESVLVAG GTRADTDQRT LLLAYAPQKR KWSVLGTVGH DTVTALAPHP DGGVVYGTAD PSWGMMGRGT AEAPLPAIHH EAGIADFRDV ASGFFAVTPE GTLLEVGIAK GGQTPWTIDL SRGLVRPGPA GQETVRATAS PGLADWRNGR APLLGKIPLV LEPAETARSQ ALLPDGKRFI LGAEFSLRLY DETGRLLDRL ITPAPVWGVL PVPNLPLVVA ALGDGTLRWY HVTLENGLRE QGAFFLHPAS NRWVAWTPEG FFAHSDSGGQ TMVGYLFNVA KAQSPTLVDF AQMYRLFHKP GLVWPALIDP IGARRGLDLA LAQIGDIRSV LGMAPPPSVA LAQVCPKENT PVTRGFHVIE TTPPPPSVGG SDETDPCFAP AALARQAGSS DAIHGKDAAV LSVMTLASTQ KEVALRFDVE DRGGGLELVD VLVNGLNVGR KRAFEKRVAA PPSPPSASAP SAAPPSVAEP SAASSPKTTA IAAAIAPAQP ADPAQTADTE DFPFPAPPLR FERAVTLNEG INRVEIRAYG QNGVYGRTAF DLVVPPPPLE KAPPSPAPAP HSRLVVLAVG INDYAGASND LQYAVADART FATTVRRQAL GVYEEIRVIE LYDAQAERPA LEAHLAQLAD ETRPEDSLLL YFSGHGEVDE QGLYRFLTPL AYTTPEERGL DANRLVELLG DIPAQRKMLF LDTCHSGAFD IEEVSGNLFN ETGQFILSAA EASETAADVL PGTQNGVFAV AVMQGLERDA ALRSGTTVSA LTLGEWVRLR VPVLAQERRL GAQHAVFKGN NTMAFPLTHI LEEPQP
|
| |