Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3331 |
Symbol | |
ID | 3915978 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 3551124 |
End bp | 3552656 |
Gene Length | 1533 bp |
Protein Length | 510 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640446116 |
Product | peptidase S1C, Do |
Protein accession | YP_498600 |
Protein GI | 87201343 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.107916 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGATACG CTTACGGTGT GACTTCGGCG CTGCTGCTCG CAGGCAGCGC ACTGACCCTG GTGACGGGTT TTCCGGCAGG TGCGCAGGTC GCGCAGAACG ACCAGTCCGA GATGCGGACG GTCGTGCCAC GCGCGGGCGC CCCGGCAAGT TTTGCAGACC TTACCCAGCA ACTCCAGCCG GCGGTCGTGA ACATCTCCAC CCGCCAGCGC GTGAAGGTGC AGGCCAACAA CCCCTTCGCC GGCACGCCGT TCGGCGACCT GTTCGGCGGC GGACAAGGCA ACGGCCCGCA GACGCGCGAG GCTCAGTCGC TGGGTTCGGG CTTCATCATT TCCGCTGACG GCTATGTCGT GACGAACAAC CACGTCATTA CCGCCGACGG GCAGGGCGAG GTCGAATCGA TCACCGTCAC CACCCCCGAC GGCACCGAAT ACCCGGCCAA GCTTATCGGC AAGGACGCGG CATCGGACCT TGCCGTGCTC AAGATCAGCC GCCCCACCGC CTTCCCGTTC GTGAAGTTCG GCGATTCGCG CAAGGCACGC GTGGGTGACT GGGTGATCGC GATCGGCAAC CCGTTCGGAC TCGGTGGCAC GGTCACGCAG GGGATCATCT CGGCGGTCTA CCGCAACACC GGTTCGGGTT CGGCCTATGA CCGATACCTG CAGACCGACG CGTCGATCAA CCGGGGCAAC TCGGGCGGTC CGATGTTCGA CATGCAGGGC AACGTGATCG GCATCAACAA TGCGATCTTC TCGCCCACCG GCGGTTCGGT CGGCATCGGT TTCGCCATCC CCGCCGAAAT CGCCGCCCCC ATCGTGGACA AGCTGCGCGC AGGCCAGGCG ATCGACCGCG GTTACTTGGG CGTGCGCATC CAGCCGCTTT CGGAAGACCT TGCGGCCTCG CTTGGCCTGC CCAAGAACCG GGGCGAGTTC ATCCAGGGCG TCGAGCCCGG CCAGGCTGCG GCCAAGGCTG GCATCCAGGC GGGCGACGTC GTGGTCAAGG TCGACGGCAA GGACGTGACG CCAGACCAGA CCCTGTCGTT CATCGTGGCC AACACCGCGC CCGGCAAGCG CATCCCGATC GAGCTGATCC GCAACGGCCA GCGCCTGACG GTGCAGACCG TCGTCGCCAA GCGTCCGACC GAGGAGGAAC TCGCCCAGCA GAGCTTCGAC CCCGATGCCC AGCAGGACGA CGACCAGTTC GGCGCTCGCC CGCAGCAGCA GGGGCCCAGC GTTCTCCAGA ACGCGCTCGG CGTCGCAGCG ATCCCGCTGA CCCCGCAGAT CGCACGCCAG CTTGGCGCGG GCGAGGATGC CAAGGGCGTC GTAATCACGG CGGTCGACGG ATCGTCCGAT GCCGCGGCCA AGGGCCTGCA GCGTGGCGAC ATCGTGCTTT CGGCCAACTA CGTCACGGTT ACCACCTTGG CCGATCTCGA ACGGATCGTG CGCAACGCCA AGACCGAAGG TCGTGAAGCG GTGCTGCTGC GCATCCAGCG CCGTGGCCAG CCACCCATCT ACATGCCTGT CCGCGTGCGC TGA
|
Protein sequence | MRYAYGVTSA LLLAGSALTL VTGFPAGAQV AQNDQSEMRT VVPRAGAPAS FADLTQQLQP AVVNISTRQR VKVQANNPFA GTPFGDLFGG GQGNGPQTRE AQSLGSGFII SADGYVVTNN HVITADGQGE VESITVTTPD GTEYPAKLIG KDAASDLAVL KISRPTAFPF VKFGDSRKAR VGDWVIAIGN PFGLGGTVTQ GIISAVYRNT GSGSAYDRYL QTDASINRGN SGGPMFDMQG NVIGINNAIF SPTGGSVGIG FAIPAEIAAP IVDKLRAGQA IDRGYLGVRI QPLSEDLAAS LGLPKNRGEF IQGVEPGQAA AKAGIQAGDV VVKVDGKDVT PDQTLSFIVA NTAPGKRIPI ELIRNGQRLT VQTVVAKRPT EEELAQQSFD PDAQQDDDQF GARPQQQGPS VLQNALGVAA IPLTPQIARQ LGAGEDAKGV VITAVDGSSD AAAKGLQRGD IVLSANYVTV TTLADLERIV RNAKTEGREA VLLRIQRRGQ PPIYMPVRVR
|
| |