Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1894 |
Symbol | |
ID | 3907973 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 2168876 |
End bp | 2169865 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637883788 |
Product | proline iminopeptidase |
Protein accession | YP_485513 |
Protein GI | 86749017 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACCCG ACGCAAAATC CGAGATCCGG TCCGACGACA GCACCAAGGC CGCAGCGCCG CTCAGCTCTC GAATGCTCGC GGTCGGCGAC GGCCATGAAA TCTATGTCGA AACCAACGGT AACGCCGACG GTCTTCCCGC GGTCTATCTG CATGGCGGCC CGGGAAGCGG TTGTCAGCCG GACCACCGGA GGCTGTTCGA TCCGCGGCGG TTTCACGCCG TGCTGTTCGA TCAGCGCGGC GCCGGCCGCA GCCGGCCGAA AGGCGGGCGT GAGGCGAACA CACTGCCGCA TCTGATCGCC GATATGGAGT CGATCCGCAC CACGCTCGGG ATCGAACGTT GGCTTGTGGT CGGCGGCTCG TGGGGGGCGA CGCTGGCGCT GGCCTATACG CAGGCACATC CGCAACGCGT CAGCGGCATC GTGCTGCGCG CGACCTTCCT CGGTACCCGC GCCGAGCTCG AGGAGGCGTT TCTGTCCACC CTGCCGCGTT TCTATCCGGA ACTGTCTGCT GATTTTCTGG GCATGCTGCC GGAGGCCGAG CGCGCGGCGC CGCTCGACGC CTATTGGCGC CGAATCCTCG ATCCAGACCC GGCGGTGCAT GGCCCCGCGG CGCGGGCCTG GGGCGAAACC GAATCGATCA TGTCGCAGAT CGCACCGAAG CGACATCGCC TCGAGGTCTT CGACAAGAAC AACAGCCGGC CGATGCCGTC GACACCGTTC ATGGAAGCGC ACTACTTCGC GCATGACTGC TTCATGCGCC CCGATCAGTT GCTGCAGGGA GCGCGCGCGC TCGCCGGCAT TCCCGGCATC ATTGTGCAGG GTCGTTACGA TCTGCTGTGC CCGCCCGCCA CCGCGCATCG GCTGATCGCG GCGTGGCCGG ACGCCGAGCT CCGCATCGTC GATGCCGCCG GGCATCTTCT GTACGACCCG GGGATTCGCG ACGCGGTGAT CGCCGCGATC GACGACGTCG CAGGCAGGAT AACAACGTAA
|
Protein sequence | MAPDAKSEIR SDDSTKAAAP LSSRMLAVGD GHEIYVETNG NADGLPAVYL HGGPGSGCQP DHRRLFDPRR FHAVLFDQRG AGRSRPKGGR EANTLPHLIA DMESIRTTLG IERWLVVGGS WGATLALAYT QAHPQRVSGI VLRATFLGTR AELEEAFLST LPRFYPELSA DFLGMLPEAE RAAPLDAYWR RILDPDPAVH GPAARAWGET ESIMSQIAPK RHRLEVFDKN NSRPMPSTPF MEAHYFAHDC FMRPDQLLQG ARALAGIPGI IVQGRYDLLC PPATAHRLIA AWPDAELRIV DAAGHLLYDP GIRDAVIAAI DDVAGRITT
|
| |