Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3473 |
Symbol | |
ID | 4023987 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 3852277 |
End bp | 3853266 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637963677 |
Product | proline iminopeptidase |
Protein accession | YP_570597 |
Protein GI | 91977938 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.598272 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACCCG ACGCGAAGTC CGAGATTCGT TCCGACAGCA ACGGCAAATC CACGGCGCCG CTCACCGCGC AAATGCTCGC GGTCGGCGAC GGCCACGAGT TATATGTCGA AACCAACGGC AACCCCGATG GTCTCGCCGC GGTCTACCTG CATGGCGGCC CCGGCAGCGG TTGCCAGCCC GATCATCGGC GGCTGTTCGA TGCGCAGCGG TTTCATGCCG TGTTGTTCGA TCAGCGCGGC GCGGGACGTA GCCGCCCGAA AGGCGGGCGT TACGCGAACA CGCTGCCGCA TCTGATCGCC GACATGGAAA TGATCCGCAC CACGCTCGGC ATCGAACGCT GGCTCGTAGT CGGCGGATCG TGGGGCGCGA CGCTGGCGCT GGCCTATGCG CAGTCGCATC CGCAGCGCGT CAGCGGCGTC GTTCTGCGTG CGGCTTTTCT CGGCACGCGC GCGGAACTCG AGGGTGCCTT CATGTCGAGC CTGCCGCGGT TCTATCCGGA ACTGCACGCG GATTTTCTCG GCATCCTTCC CGCGGCGGAG CGCAGCGCGC CGCTCGACGC CTATTGGCGG CGCATCCTCG ATCCCGATCC GGAGGTGCAC GGCCCCGCGG CGCGGGCCTG GGGCGAAACC GAGGCGATCA TGTCGCAAAT CGGGCCAAAG CGGTCACGGC TCGAAATCTC CAATGAAAAC AATACCCGGC CGATCCCGTC GACGCCGTTC ATGGAAGCGC ATTACTTCGT CCACGACTGC TTCATGCGCC CCGATCAATT GCTGCATGAC GCGCCGGCGC TCGCGGGCAT TCCCGGCGTC ATCGTGCAAG GCCGCTACGA TCTGCTCTGC CCGCCGGCCA CCGCGCATCG GCTCGGCGCG GCGTGGCCGG ACGCCGAACT ACGCGTCATC GATGCCGCCG GACATCTGTT GTACGATCCG GGAATCCGCG ACGCGGTGAT CGCCGCGATC AACGACCTCG CGACCAAGAT CAAAGCGTGA
|
Protein sequence | MAPDAKSEIR SDSNGKSTAP LTAQMLAVGD GHELYVETNG NPDGLAAVYL HGGPGSGCQP DHRRLFDAQR FHAVLFDQRG AGRSRPKGGR YANTLPHLIA DMEMIRTTLG IERWLVVGGS WGATLALAYA QSHPQRVSGV VLRAAFLGTR AELEGAFMSS LPRFYPELHA DFLGILPAAE RSAPLDAYWR RILDPDPEVH GPAARAWGET EAIMSQIGPK RSRLEISNEN NTRPIPSTPF MEAHYFVHDC FMRPDQLLHD APALAGIPGV IVQGRYDLLC PPATAHRLGA AWPDAELRVI DAAGHLLYDP GIRDAVIAAI NDLATKIKA
|
| |