Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_2836 |
Symbol | |
ID | 4897366 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 2989052 |
End bp | 2990017 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640113439 |
Product | proline iminopeptidase |
Protein accession | YP_001044710 |
Protein GI | 126463596 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCAAA GATCAGGCCA AAAGCGCGCA GTCGAGTTCC TCTATCCGTC GATCGATCCC TACGATCAGC GGGTCATCGA CATGGGCGAC GGCCATCGGA TCTATGTCGA GCAATGCGGC GACCCGGACG GCGAGCCGGT GCTGGTGCTG CATGGCGGCC CCGGGGGCGG CTGCAGCCCC TCGATGCGGC GCTATTTCGA CCCGAGCCGC TACCGCGTGA TCCTGTTCGA CCAGCGCGGC TGCGGCCGGT CGCGGCCCCA TGCCTCGGTC GAGGCGAACA CGACCTGGCA CCTCGTCTCG GACATCGAGG CGATCCGCCG GAAGCTCGGC ATCGACCGCT GGACCTGCTT CGGCGGCAGC TGGGGGGCAA CCCTTGCGCT GATCTATGCG ATCTCGCACC CCGAGCGGGT GTCGAATCTG ATCCTGCGCG GCGTCTTCCT GATGACCAAG GCCGAGCTCG ACTGGTTCTA CGGCGGCGGA GCGGCGGCCT TCTTCCCCGA CATCTGGGCC CGGTTCGTGG CCCCCGTCCC GCCGGAGGAG CGGGGCGATC TCGTCGCGGC CTACCGGCGG CGGCTCTTCT CGGGCAACCT GATGGAAGAG ACGCGTTTCG GCCGCACTTG GGCCAACTGG GAGAATGCGC TGGCCTCGGT CGCGCAGGAC GGGCCGCTGG GCGAGAGCCC GTCGGAATAT GCCCGCGCCT TCGCCCGGCT CGAGAACCAC TATTTCTCCC ATGGGGGCTT CCTCGAGCAC GACGGCTGGA TCCTCGCCAA CCGCCACCGG ATCGAGCATA TCCCGGCGGT GATCGTGCAG GGGCGCTACG ACATGATCTG CCCGCCGGTC TCCGCCTGGA CGCTGGCCGA CGGGTGGGAA AAGGCGGATC TCAGGGTCGT GCCGTTCGCG GGCCACGCGC TCTCGGAACC CGGCATCAGC GCCGAACTCG TCCGCGTGAT GGACACGCTT CCCTGA
|
Protein sequence | MDQRSGQKRA VEFLYPSIDP YDQRVIDMGD GHRIYVEQCG DPDGEPVLVL HGGPGGGCSP SMRRYFDPSR YRVILFDQRG CGRSRPHASV EANTTWHLVS DIEAIRRKLG IDRWTCFGGS WGATLALIYA ISHPERVSNL ILRGVFLMTK AELDWFYGGG AAAFFPDIWA RFVAPVPPEE RGDLVAAYRR RLFSGNLMEE TRFGRTWANW ENALASVAQD GPLGESPSEY ARAFARLENH YFSHGGFLEH DGWILANRHR IEHIPAVIVQ GRYDMICPPV SAWTLADGWE KADLRVVPFA GHALSEPGIS AELVRVMDTL P
|
| |