Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_2767 |
Symbol | |
ID | 5085120 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | - |
Start bp | 2813324 |
End bp | 2814292 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640484330 |
Product | proline iminopeptidase |
Protein accession | YP_001168959 |
Protein GI | 146278800 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCAAA GATCAGGCCA AAAGCGCGCA GTCGAGTTCC TTTATCCGTC GATCGACCCG TTCGATCAGC GGGTGATCGA CATGGGCGAC GGCCACCGGA TCTATGTCGA GCAGTGCGGC AACCCGCAAG GTGAGGCGGT GCTCGTGCTT CATGGCGGAC CGGGCGGCGG CTGCAGCCCC TCGATGCGGC GCTATTTCGA CCCGGTACGC TACCGGGTGG TGCTCTTCGA CCAGCGCGGC TGCGGGCGGT CGCGGCCCCA TGCCTCGGTC GAGGCCAACA CGACCTGGCA TCTCGTCTCG GACATCGAGG TGATCCGCGC CAAGCTCGGG ATCGACCGCT GGACCTGCTT CGGGGGCAGC TGGGGCGCCA CTCTGGCGCT GATCTACGCC ATTTCGCACC CCGAGCGGGT GACGAACCTC GTGCTGCGCG GCGTCTTCCT GATGACCCGG GCGGAACTGG ACTGGTTCTA CGGCGGTGGC GCGGCGACCT TCTTCCCCGA CATCTGGGCG CGGTTCGTGG CACCTGTCCC CGCGGCCGAG AGAGGCGACA TGATCGCCGC CTATCACCGG CGGCTGTTCT CGGGGAACCT GATGGAAGAG AGCCGGTTCG GCCGCGCCTG GGCGAACTGG GAGAACGCGC TCGCCTCGGT CTCGCAAGAC GTTCCGGTGG GCGAGAGCCC CTCGGAATAT GCGCGCGCCT TCGCCCGACT GGAGAATCAC TATTTCTCGA ATGCAGGTTT CCTCGAGCAG GACGGCTGGA TCCTCGCCAA CCGCTCGCGG ATCGCGCACA TTCCGGCCGT GATCGTGCAG GGCCGCTATG ACATGATCTG CCCCCCGCTC TCTGCCTGGA AACTGGCTGA GGGCTGGGAC AAGGCGGACC TGCGGCTGGT GCCCTTCGCG GGCCACGCAC TTTCGGAACC CGGCATCAGC GCCGAGCTGG TGCGCGTGAT GGACACGCTT CCCCGCTAG
|
Protein sequence | MDQRSGQKRA VEFLYPSIDP FDQRVIDMGD GHRIYVEQCG NPQGEAVLVL HGGPGGGCSP SMRRYFDPVR YRVVLFDQRG CGRSRPHASV EANTTWHLVS DIEVIRAKLG IDRWTCFGGS WGATLALIYA ISHPERVTNL VLRGVFLMTR AELDWFYGGG AATFFPDIWA RFVAPVPAAE RGDMIAAYHR RLFSGNLMEE SRFGRAWANW ENALASVSQD VPVGESPSEY ARAFARLENH YFSNAGFLEQ DGWILANRSR IAHIPAVIVQ GRYDMICPPL SAWKLAEGWD KADLRLVPFA GHALSEPGIS AELVRVMDTL PR
|
| |