Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_2053 |
Symbol | |
ID | 5209015 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 2543890 |
End bp | 2545824 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640595658 |
Product | peptidase S9 prolyl oligopeptidase |
Protein accession | YP_001276387 |
Protein GI | 148656182 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.197732 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCATT CGCAGATCGC GCCATACGGT TCGTGGCGCT CGCCGATCAC TGCTGCGCTG GTGGCAACAT CGGGCGTTTC GTTGAACGAC GTTGCTCTCG ACGGCGACGA CATCTACTGG CTCGAAGGGC GACCGGCTGA AGGCGGTCGG GTGGTCATTG TGCGGCGCGC CGCCAATGGA ACGATTGCCG ATGTGACCCC GCCCGGCTTC AACGTGCGCA CCCGCGTCCA CGAATATGGC GGCGCACCAT ACACAGTTGA TCAGGGGGTC GTCTATTTCA GTAACTTTGC CGATCACCGG GTCTACCGCC AGCAAGCGGG TGAAACGCCA ACCCCCGTTA CTCCTGAAGC GCCGTTGCGC TATGCGGATA TGACTGTCGA TCGCGGGCGG AACCGTCTGA TCTGCGTGCG CGAGGATCAT TCCGGCGATG GCGAGGCGGT CAATGCGATT GTTGCCGTTC CACTCGACGG AACAACCGGG CAGCAGGTTC TGGTCGCAGG TTCGGATTTC TATGCCCATC CGCGTCTCAG CCCGGATGGA ACCTGGCTGG CGTGGCTCTC CTGGAACCAC CCGAACATGC CGTGGGACGC TGCCGAACTG TGGGTTGCGC CATTGCGCGA GGATGGCTCA CCGGGAACTG CCGAACGGAT CGCGGGTGGA CCGGACGATG CCGCTTTTCA GCCTGAGTGG GGACCCGACG GCGCGCTCTA TTTCGTTGCC GAACGTACCG GATGGTGGAA CCTCTACCGC TGGCACGCTG GCAATGTCGA GGCGCTCTGC CCGATGGAAG CCGAGTTCGG TCTGCCGCTC TGGGTCTTCA GTGCACGCAC CTATGCCGTC GAATCGGCGG GACGACTGGT ATGCACGTAT ATCGAGCGCG GTGAGCAGAA GATGGCGACG CTCGACACGC AGAGCGGAAT GCTGACGCCG CTCACTCTGC CATTCAGCGA CTTTGGTTTC AGCGGTCCTC GCGCCGCCAA TGGCAGGGTT GTCTTCATCG GCGCTTCATC GAACACGCCC TCGACGCTGG TAATGCTCGA TCTGGCGAGC GATGCACTGA TGACTATTCG CCGTTCGATG GATATCCAGA TCGATCCCGG CTATATCTCG ACGCCGCAGG TGGTGGAATT TCCCACCGAA GGCGGCGTGA CTGCGTTCGG CTTCTATTAT CCGCCGCACA ACCGCGACTT TCGTGCGCCG GAAGGTGAAA AGCCGCCTTT GCTCGTTTTG AGCCATGGCG GACCGACCGG CGCAACATCG GCGTCGTTCG ATGTCGGCAT TCAGTTCTGG ACGAGTCGCG GCATTGCGGT AATGGATGTC AACTACGGCG GCAGCACCGG GTTCGGTCGT GCGTACCGTC AGCGTCTCGA CGGACGCTGG GGAGTCGTTG ATGTGGATGA TTGCTGCAAC GCAGCAACAT ACCTGGCGGC GCAGGGTCTC GCCGATCCAG CGCGATTGAT CATTGCTGGC GGCAGCGCCG GCGGCTACAC GACCCTGGCG GCGCTCACCT TCCGCCGGGT GTTCAAGGCT GGCGCCAGTT ACTACGGCGT CAGCGACCTG GAGGCGCTGG CGCGTGATAC GCACAAATTC GAGTCGCGTT ACCTCGACCG CCTGATCGGA CCGTATCCTG AACGCATCGA TCTCTACCAC GCGCGTTCGC CGATCCACCA TATCGAACAA CTCAACTGCC CGGTCATCTT TCTGCAAGGA CTGGAAGACA GAGTCGTGCC GCCGGATCAG TCCGAACGAA TGGCGGCAGC CCTGCGCACA AAAGGGATAC CGGTTGCATA CCTGGCATTC GAAGGCGAAC AGCATGGCTT CCGCAAGGCA GAGACGATCA TCCGGGCGCT GGAAGCCGAG TTGTACTTCT ACGCGCGCAT CCTGGGATTT GAACCCGCCG ATCCGGTCGA ACCGATTCAG ATCGACAATC TTTAG
|
Protein sequence | MSHSQIAPYG SWRSPITAAL VATSGVSLND VALDGDDIYW LEGRPAEGGR VVIVRRAANG TIADVTPPGF NVRTRVHEYG GAPYTVDQGV VYFSNFADHR VYRQQAGETP TPVTPEAPLR YADMTVDRGR NRLICVREDH SGDGEAVNAI VAVPLDGTTG QQVLVAGSDF YAHPRLSPDG TWLAWLSWNH PNMPWDAAEL WVAPLREDGS PGTAERIAGG PDDAAFQPEW GPDGALYFVA ERTGWWNLYR WHAGNVEALC PMEAEFGLPL WVFSARTYAV ESAGRLVCTY IERGEQKMAT LDTQSGMLTP LTLPFSDFGF SGPRAANGRV VFIGASSNTP STLVMLDLAS DALMTIRRSM DIQIDPGYIS TPQVVEFPTE GGVTAFGFYY PPHNRDFRAP EGEKPPLLVL SHGGPTGATS ASFDVGIQFW TSRGIAVMDV NYGGSTGFGR AYRQRLDGRW GVVDVDDCCN AATYLAAQGL ADPARLIIAG GSAGGYTTLA ALTFRRVFKA GASYYGVSDL EALARDTHKF ESRYLDRLIG PYPERIDLYH ARSPIHHIEQ LNCPVIFLQG LEDRVVPPDQ SERMAAALRT KGIPVAYLAF EGEQHGFRKA ETIIRALEAE LYFYARILGF EPADPVEPIQ IDNL
|
| |