Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A2180 |
Symbol | |
ID | 3835607 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 2531866 |
End bp | 2532765 |
Gene Length | 900 bp |
Protein Length | 299 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637826282 |
Product | peptidase S33, tricorn interacting factor 1 |
Protein accession | YP_427267 |
Protein GI | 83593515 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01250] proline-specific peptidases, Bacillus coagulans-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCAATTG AAAGCGTTGA GGGGTTCGCC CCGTTTCGGG AGTTTCAGAC CTGGTACAGG GTCACCGGCG ATCTGAAGGC GGCCAAGGCG CCCTTGGTCA TCGCCCACGG CGGTCCGGGC TGTACCCACG ACTATGTTGA TTCCTTCAAA GAGATCGCCG GGAGCGGACG GGCGGTTGTG CACTACGACC AGATCGGCAA TGGCCGCTCC ACCCATCTGC GCGACAAGGG GGCGGATTTC TGGACGGTCG AGCTGTTCCT TGACGAACTC GACTCGCTGC TGGCCCATCT CGGCATCGCC GGGCGCTATC ACCTGCTTGG CCAGTCGTGG GGCGGCATGT TGAGCGCCGA GCACGCGGTG CGCCAACCTG CCGGCCTGCT CTCGCTGACC CTGGCCAATT CCCTGGCCTC GATGCCGTTG TGGACCGCCG CCGCCGCCGG ACTGCGCGCC GAACTGCCCG CCGAGGTCAG GGCCACCCTC ACCGCCCATG AAGCGGCCGG AACCACCGAC CATCCCGACT ACAAGGCGGC GAGCCGGGCC TTTTATGACC GCCACGTCTG CCGCGTCACC CCCTGGCCGC CCGAGGTGGC GCGCACCTTC GCCGCCGTCG ATGACGACCC CACCGTTTAC GTCACCATGA ACGGCCCCAC CGAATTCCAC GTGATCGGCA CCCTGCGCGA CTGGAGCGTC ATCGACCGCC TGCCCAGGAT CGACGTGCCG ACCTTCATTT ATCGCGGCGC CTTCGATGAG GCGACCCAGG CCTGCATCCA GCCGTTCATC GATCACATCG GCAAAGCGGA ATGGGCGGTC TTCCCGGACT CCAGCCACAT GCCCCATGTC GAAGTAAAGG ATATCTGCCT GGGCGCGGTC GCCGCCTTCC TTGATCGCCA CGACGGCTGA
|
Protein sequence | MAIESVEGFA PFREFQTWYR VTGDLKAAKA PLVIAHGGPG CTHDYVDSFK EIAGSGRAVV HYDQIGNGRS THLRDKGADF WTVELFLDEL DSLLAHLGIA GRYHLLGQSW GGMLSAEHAV RQPAGLLSLT LANSLASMPL WTAAAAGLRA ELPAEVRATL TAHEAAGTTD HPDYKAASRA FYDRHVCRVT PWPPEVARTF AAVDDDPTVY VTMNGPTEFH VIGTLRDWSV IDRLPRIDVP TFIYRGAFDE ATQACIQPFI DHIGKAEWAV FPDSSHMPHV EVKDICLGAV AAFLDRHDG
|
| |