Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_1174 |
Symbol | pip |
ID | 3718167 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007493 |
Strand | - |
Start bp | 2941873 |
End bp | 2942838 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640072405 |
Product | prolyl aminopeptidase |
Protein accession | YP_354259 |
Protein GI | 77464755 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.651315 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCAAA GATCAGGCCA AAAGCGCGCA GTCGAGTTCC TCTATCCGTC GATCGATCCC TACGATCAGC GGGTCATCGA CATGGGCGAC GGCCATCGGA TCTATGTCGA GCAATGCGGC GACCCGGACG GCGAGCCGGT GCTGGTGCTG CATGGCGGCC CCGGGGGCGG ATGCAGCCCC TCGATGCGGC GCTATTTCGA CCCGAGCCGC TACCGCGTGA TCCTGTTCGA CCAGCGCGGC TGCGGCCGGT CGCGGCCCCA TGCCTCGGTC GAGGCGAACA CGACCTGGCA CCTCGTCTCG GACATCGAGG CGATCCGCCG GAAGCTCGGC ATCGACCGCT GGACCTGCTT CGGCGGCAGC TGGGGGGCGA CGCTTGCGCT GATCTATGCG ATCTCGCACC CCGAGCGGGT GTCGAACCTG ATCCTGCGCG GCGTCTTCCT GATGACCAAG GCCGAGCTCG ACTGGTTCTA CGGCGGCGGA GCGGCGGCCT TCTTCCCCGA CATCTGGGCC CGGTTCGTGG CCCCCGTCCC GCCGGAGGAG CGGGGCGATC TCGTCGCGGC CTATCGGCGG CGGCTCTTCT CGGGAAACCT GATGGAAGAG ACGCGTTTCG GCCGCACCTG GGCCAACTGG GAGAATGCGC TGGCCTCGGT CGCGCAGGAC GGGCCGCTGG GCGAGAGCCC GTCGGAATAT GCCCGCGCCT TCGCCCGGCT CGAGAACCAC TATTTCTCCC ACGCAGGCTT CCTCGAGCAC GACGGCTGGA TCCTCGCCAA CCGCCACCGG ATCGAGCATA TCCCGGCGGT GATCGTGCAG GGGCGCTACG ACATGATCTG CCCGCCGGTC TCCGCCTGGA CGCTGGCCGA CGGGTGGGAG AAGGCGGATC TCAGGATCGT GCCGTTCGCG GGCCACGCGC TCTCGGAACC CGGCATCAGC GCCGAACTCG TCCGCGTGAT GGACACGCTT CCCTGA
|
Protein sequence | MDQRSGQKRA VEFLYPSIDP YDQRVIDMGD GHRIYVEQCG DPDGEPVLVL HGGPGGGCSP SMRRYFDPSR YRVILFDQRG CGRSRPHASV EANTTWHLVS DIEAIRRKLG IDRWTCFGGS WGATLALIYA ISHPERVSNL ILRGVFLMTK AELDWFYGGG AAAFFPDIWA RFVAPVPPEE RGDLVAAYRR RLFSGNLMEE TRFGRTWANW ENALASVAQD GPLGESPSEY ARAFARLENH YFSHAGFLEH DGWILANRHR IEHIPAVIVQ GRYDMICPPV SAWTLADGWE KADLRIVPFA GHALSEPGIS AELVRVMDTL P
|
| |