Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2525 |
Symbol | |
ID | 3916846 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2727747 |
End bp | 2729882 |
Gene Length | 2136 bp |
Protein Length | 711 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640445282 |
Product | prolyl oligopeptidase |
Protein accession | YP_497795 |
Protein GI | 87200538 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1505] Serine proteases of the peptidase family S9A |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.440403 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCTCGA TCCGCCCCTT GCTCGCCGCC TCGGCGCTGG CCTGCCTCGC CATGTCCATG ACGGCCGAGG CGGCGCCGGC CGCGATGAAG TACCCGCAGA CCGAGCGAGG CACCGTGGTC GAGACGGCCT TCGGCGAGAA GGTCGCCGAT CCCTACCGCT GGCTCGAGGC CGACGTCCGC GTGGATCCGA AGGTGGCCGC CTGGGTCGAT GCCCAGAGCA GGTTCACCGA CGCCTATCTC AAGGCCTTGC CCGAGCGTCC GGCCTTCGAG CAAAGGCTGA AGACGCTGTT CGACTTCGAA CGCTTCGGGC TTCCGGTGAA GGCGGGTGAT CTTCTGTTCT TCCGCCACAA CTCCGGCCTC CAGAACCAGT CGGTGCTCTA TGTGCGCAAG GCCGATGGCA GCGGCGAGCG ACGGGTGCTG ATCGACCCCA ACGGCTGGGC CAAGGACGGC GCGACCGCGC TCGACGACTG GCAGCCTTCG CCCGACGGAA CGAAGGTGGC GTATTCGGTT CAGGACGGCG GCTCGGACTG GCGCACGCTC AAGGTGATCG ATGTCGCCAG CGGGCAGGTG CTGTCCGATA CGGTCGAGCA CGTGAAGTTC TCGCACATCG CCTGGGCGGG CAACGAAGCG GTCGTCTATT CGCGTTTCCC TGCGCCCAAG GCGGGCGAGG CGTTCCAGGC GGTCAGTTCC AACCAGTCGG TCTGGCTGCA CAAGCTGGGT ACGCCGCAGT CGGAGGACCG CCTGCTTCAT GCCACGCCCG ACAATCCCCG GCTCTACCAT TCGGCCGAGA CCACCCATGA CCAGCGCTGG CTGGTGGTGT CGACCAGCAC CGGCAGCGAG AAGGGCAACG CGGTCGGCCT CGCCCGGATA GGCGGCGACT GGAAGGTCCA GCCGCTGGTG AGCACGCTTG CCGACGAGTG GTCGCTGATC GCCGGGATCG GGGACCGGCT GTGGTTCGTG ACCAGCAAGG ATGCGCCGCG CAAGAAGGTG GTCATGGTCG ACATGTCGGG CGCCGCGCCG GTCACCACGA CCGTCGTGCC GGAAAGCGAC GACGTGCTGG AAAGCGCGAA GGTCGTCGGG GATCGTCTGG TTCTCGGCTA TCTGCGCGAC GTCAAGGCCG AACTGCGGTT GGCGACGCTC GACGGCAAGC CTGCCGGAAC CCTTGCCCTG CCGGGCATCG GGAGCATCGG CGGCGTGGTC GGGGAGCCGG GCGACCCGCA GGGCCACTTC GCGTTCTCGG GCTTCACCCA GCCCGCCACG ATCTATGCCT TCGACGCTGG CGATGCCGCG TCCGCCAAGG TCTGGGCGGC GCCGAAGCTG ACCTTCGATC CGGCCCGGTT CGAGACGCGG CAGGTGTTCT ATCCTTCGAA GGACGGAACC CGGATTCCGA TGTTCGTTGT CCGCCGCAAG GACCTTGCCG GTCCGCTGCC GACGATCCTC TATGGCTACG GCGGTTTCAA CATTTCGGTC CTTCCGGCCT TTTCGGCGGG ACGCATGGCC TGGCTCGATG CCGGCGGTGC GTTCGCCGTC GCCAACATCC GGGGTGGGGG CGAGTATGGC GAGGCCTGGC ATCTGGCCGG CAAGGGGCCG ACCAAGCAGA ACGTGTTCGA CGATTTCATC GCCGCCGGGG AATGGCTGAA GGCCAACGGC GTCACGTCCG CCAATGGTCT TGCGGTCGAG GGCGGGTCGA ACGGAGGCCT GCTCGTCGGG GCGGTCGTCA ACCAGCGGCC CGATCTGTTC GCGGCGGCGG TCCCCGCGGT CGGCGTGATG GACATGCTGC GCTTCGACAA GTTCACTGCC GGGCGCGAAT GGGTGTTCGA TTACGGCTAT CCGGAGAAGG AGGAGGACTG GCGCCGCCTG CGCGCCTACT CGCCCTATCA CAATATCGCG TCGGGCAAGG ACTACCCGGC GATCCTCGTG ACCACCGCCG ATACCGACGA CCGGGTGGTT CCGGGCCATA GCTTCAAGTA CGCGGCGGCG CTCCAGGCGG CCTCGATCGG CAGCAAGCCG CACCTCATCC GCATCGAGAC GCGCGCGGGG CACGGATCGG GCAAGCCCGT CGCGAAGCTG ATCGCCGAGA ATGCCGACGT CTACGCCTTC GTCGCGCACT GGACGGGACT GACGCCGAAG GAGTGA
|
Protein sequence | MPSIRPLLAA SALACLAMSM TAEAAPAAMK YPQTERGTVV ETAFGEKVAD PYRWLEADVR VDPKVAAWVD AQSRFTDAYL KALPERPAFE QRLKTLFDFE RFGLPVKAGD LLFFRHNSGL QNQSVLYVRK ADGSGERRVL IDPNGWAKDG ATALDDWQPS PDGTKVAYSV QDGGSDWRTL KVIDVASGQV LSDTVEHVKF SHIAWAGNEA VVYSRFPAPK AGEAFQAVSS NQSVWLHKLG TPQSEDRLLH ATPDNPRLYH SAETTHDQRW LVVSTSTGSE KGNAVGLARI GGDWKVQPLV STLADEWSLI AGIGDRLWFV TSKDAPRKKV VMVDMSGAAP VTTTVVPESD DVLESAKVVG DRLVLGYLRD VKAELRLATL DGKPAGTLAL PGIGSIGGVV GEPGDPQGHF AFSGFTQPAT IYAFDAGDAA SAKVWAAPKL TFDPARFETR QVFYPSKDGT RIPMFVVRRK DLAGPLPTIL YGYGGFNISV LPAFSAGRMA WLDAGGAFAV ANIRGGGEYG EAWHLAGKGP TKQNVFDDFI AAGEWLKANG VTSANGLAVE GGSNGGLLVG AVVNQRPDLF AAAVPAVGVM DMLRFDKFTA GREWVFDYGY PEKEEDWRRL RAYSPYHNIA SGKDYPAILV TTADTDDRVV PGHSFKYAAA LQAASIGSKP HLIRIETRAG HGSGKPVAKL IAENADVYAF VAHWTGLTPK E
|
| |