Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A0084 |
Symbol | |
ID | 3834277 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 97229 |
End bp | 98185 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637824154 |
Product | prolyl aminopeptidase |
Protein accession | YP_425176 |
Protein GI | 83591424 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.826118 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAGTC CCGATCCCCT TGCCGATCTT TATCCGCCGA TCGAGCCGCG CCACACCGGT CGGCTGCGTG TGCGTCCTCC CCATGTGATT CATTGGGAGG AAAGCGGCAA TCCCGATGGC ATCGCGGTGA TCTTCGTCCA TGGCGGGCCC GGGGCGGGAA CGGCGCCGTT TTGTCGGCGC TATTTCGACC CGGAGCGCTA CCGGGTGATT ATTTTCGATC AGCGCGGCGC CGGCCGGTCG CGGCCCTTCG CCGAGATCGC CGATAATACC ACCCAGGAAC TGGTCGCCGA TATGGAGCGG CTGCGCGTGC ATCTGGAGGT CGAGCGCTGG CTGGTGTTTG GCGGGTCCTG GGGCAGCACC CTGGCCCTGG CTTACGGCCA AACCCATCCC GAGCGCTGTC TGGGCTTCAT CCTGCGCGGC GTTTTTTTGT TTCGCGGCTT CGAGGTCGAC TGGTTTCTCA ACGGCATGGG CCGTTTCTTC CCCGAAGCGG CGAGCGCCTT CCTCGACTTC CTGCCCGAAG ACGAACGCGC CGATCCGCTG GCGGCCTATT ACCGTCGGCT GACCCATGCC GATCCGTCGA TCCATCTGGC GGCCGCCCGG GTCTGGTCGA ATTATGAAGA CGCCTGCGCC CGCCTGCGCC CGCGCCCGGG CGACGAGGGG GACGGCCGCT CGGCTCTGGC CCTGGCCCGC CTTGAATGCC ATTACATGCG CCATGGCGGT TTTCTGCGCG AAGGCCAGCT TCTGACCGAG ATCGACCGGG TTCGCGATCT GCCCTGCACC ATCGTCCAGG GCCGCTACGA CGTGGTCTGT CCGCCGGTCA GCGCCTGGGA ACTTCACCGG GTCTGGACGG GGAGCAAATT GGTGATGGTC CCCGATGCCG GGCATAGCGC CCTGGAACCG GGCGTGCGCG TCGCCCTGGT TCAGGCGACC CGCCGCTTCG CCGAAAGTCA GGGTTGA
|
Protein sequence | MNSPDPLADL YPPIEPRHTG RLRVRPPHVI HWEESGNPDG IAVIFVHGGP GAGTAPFCRR YFDPERYRVI IFDQRGAGRS RPFAEIADNT TQELVADMER LRVHLEVERW LVFGGSWGST LALAYGQTHP ERCLGFILRG VFLFRGFEVD WFLNGMGRFF PEAASAFLDF LPEDERADPL AAYYRRLTHA DPSIHLAAAR VWSNYEDACA RLRPRPGDEG DGRSALALAR LECHYMRHGG FLREGQLLTE IDRVRDLPCT IVQGRYDVVC PPVSAWELHR VWTGSKLVMV PDAGHSALEP GVRVALVQAT RRFAESQG
|
| |