Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_3668 |
Symbol | |
ID | 3969605 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 4080933 |
End bp | 4081925 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637926778 |
Product | proline iminopeptidase |
Protein accession | YP_533522 |
Protein GI | 90425152 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGCGG AAGCTGCGGC TCAACCCGGC AAAGCCACCG AACGCCCCGC GCCGCTGTCG GCGCAATGGT GGCCCGCGGG CCAAGGTCAC GAGATCTACG TCGAATCCGT CGGCCGCGCC GACGGCACCC CCGCGGTCTA TCTGCATGGT GGTCCCGGCG GCGGCTGCCA GCCGGATCAC CGCCGGCTGT TCTCTGCCGA GCGGTTTCAC GCGGTGCTGT TCGATCAGCG CGGCGCCGGC CGCAGCCGGC CGAAGGCCAG CCGCGATGCC AACACCCTCG CCCTTTTGAT CGCCGACATC GAGCTGATCC GCGAGCGGTT CGGCTTCGAA CGTTGGATGG TGGTCGGCGG TTCGTGGGGC GCCACGCTGG CGCTGGCCTA TGCGCAGGCC TATCCGCAGC GGGTCAACGG ATTGGTGCTG CGCGCCACCT TCCTCGGCAG CCGCGACGAA CTCGACGCGG CGTTCCTCGG CGCGTTGCCG CGGTTCTATC CCGGCCTCAA CGAGGACTTC CTCAGCCTGC TGAGCACGGA GGAACGAAAG CTGCCGCTTG CCGCCTATTG GCGCCGCATC CTCGATCCCG ATCCGGCAAT TCATGGGCCG GCCGCCCGCG CCTGGCACGA CACCGAACGG ATCCTGTCCG AGCACGTGCC GGGCCGCAGC CGGCTCGATC TGGGGTCGCT GAGCGGCGCG CGACCGCTGC CGAGCACGCC GTTCATGGAA GCGCATTATT TCAGCAACGA CTGTTTCCTG CGGCCGAACC AGTTGCTCGA CGGTGCCGCG CGGCTGGCCG GCATCCCCGG CATCATCGTG CAGGGCCGCT ACGACCTGTT GTGCCCGCCA TCGTCGTCCG ACGCGCTGGC GGCCCGCTGG CCCGAGGCCG AAGTCCGTGT CGTCGAAGGC GCCGGCCACT CGCTGTACGA TCCGGGCGTT CGCGACGCGG TGAGCCAGGC GATCGCCGAC CTTCAAATTA GAACCAGCAA ATCGGAAAGT TGA
|
Protein sequence | MEAEAAAQPG KATERPAPLS AQWWPAGQGH EIYVESVGRA DGTPAVYLHG GPGGGCQPDH RRLFSAERFH AVLFDQRGAG RSRPKASRDA NTLALLIADI ELIRERFGFE RWMVVGGSWG ATLALAYAQA YPQRVNGLVL RATFLGSRDE LDAAFLGALP RFYPGLNEDF LSLLSTEERK LPLAAYWRRI LDPDPAIHGP AARAWHDTER ILSEHVPGRS RLDLGSLSGA RPLPSTPFME AHYFSNDCFL RPNQLLDGAA RLAGIPGIIV QGRYDLLCPP SSSDALAARW PEAEVRVVEG AGHSLYDPGV RDAVSQAIAD LQIRTSKSES
|
| |