Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0230 |
Symbol | |
ID | 3916218 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 237523 |
End bp | 238908 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640442955 |
Product | peptidase M48, Ste24p |
Protein accession | YP_495512 |
Protein GI | 87198255 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0393595 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGCGCA GCGCCCGATT CCCCAGCATC ATCCTTGCCC GCCTGCTGGC GGCTCTTGCC GCGCTTGCGC TGTGCGTGGA ACCGGCCGCG GCGCAATCGG TGCTGCGCGA TGCCGAGACC GAGGCATTGT TCCGCGATGC TTCCGCGCCG ATCTTCAAGG CGGCGGGGTT CAATCCCAAT GCGGTGGACC TGGTCCTGCT CAACGACGGG TCGATCAACG CCTTCGTCGC GGGCGGGCAG GCGATCTACA TCCATTCGGG CCTGATCGGC GCGGCCGACA ACGTCAACGA ATTGCAGGGC GTGATCGCGC ACGAGCTGGG CCACATCACC GGCGGCCACA TCATCCGCTA TGACGAAGGG CTGAAGCCCG CGACCGGCAT CACCGTGCTG AGCCTCCTGC TGGGCGGACT GGCGGCGGCG GCGGGATCGC CCGACGCCGC GATGGGCGTT TTCATGGCCG GGCAGCAGGC CGCGCTGGGC AAGTTCCTGG CTTTCAGTCG CGCGCAGGAA AGCTCTGCCG ACGCGGCGGG CGCGCAGTTC CTGGCGAAGG CGGGGATTTC CGGGCGTGGC TCGATCGAGT TCTTCAAGAA GCTCCAGAAC CAGGAGTTCC GCTACGGCTA CAGCCCGCGC CGCAACCCTG ACGCGGAATT CTACAGCACC CACCCGATGA CCGCGGACCG CCTGACCACG CTGCAGGACA CCTACGAGAA GGACCCGGCC TGGAACAGCC CGCCTCCCGC GGAACTGCAG GCGCGCTTCC TGCGGGTGAA GGCCAAGCTC TATGGTTATC TCGCCGAGCC GCAGGACACC CTGCGCGCCT ATCCCGAATA CCTGACCGAC GTCCCCGCGC GCTATGCCCG GGCCTATGCC TTCCACAAGG AAGCGTTCGT CGACAAGGCG CTGGACGAGA CGAAGGCGCT GATCGCCAAG GACCCGAAGA ACCCCTATTT CCTCGAGCTG GAAGGGCAGA TCCTGCTCGA ATCCGGCCGC CCGGCCGAAG CGATCCCGCC GCTGCGCGAG GCGACGGCGC TGACCGGCAA CGAGCCGCTG ATCGCCACGA CCTTCGGCCA TGCGCTGATC GCGACCGAGG ACAAGGACAA CTTCGCCGAG GCCGAAAAGG TGCTCAAGAC GGCGGTCGCG CGCGACAAGG ACAACCCCTT CACCTGGTAC CAGCTCGGCG TGGTCTACGA GGCCAAGGGC GACATTCCCC GCGCACGGCT GGCAAGCGCA GAGCAGCAGT TGATGAACAT GCAACTCGGC GATGCGGTGC GCAGCGCCGA AGCCGCCGAG GCCGCGCTGC CCAAGGGCAC GCCCGACTGG CTGCGCGCGC AGGATATCGC CATGTCGGCG CGGGCAATGC TGGAACGCCA GAAGAAGTCG CGCTAG
|
Protein sequence | MKRSARFPSI ILARLLAALA ALALCVEPAA AQSVLRDAET EALFRDASAP IFKAAGFNPN AVDLVLLNDG SINAFVAGGQ AIYIHSGLIG AADNVNELQG VIAHELGHIT GGHIIRYDEG LKPATGITVL SLLLGGLAAA AGSPDAAMGV FMAGQQAALG KFLAFSRAQE SSADAAGAQF LAKAGISGRG SIEFFKKLQN QEFRYGYSPR RNPDAEFYST HPMTADRLTT LQDTYEKDPA WNSPPPAELQ ARFLRVKAKL YGYLAEPQDT LRAYPEYLTD VPARYARAYA FHKEAFVDKA LDETKALIAK DPKNPYFLEL EGQILLESGR PAEAIPPLRE ATALTGNEPL IATTFGHALI ATEDKDNFAE AEKVLKTAVA RDKDNPFTWY QLGVVYEAKG DIPRARLASA EQQLMNMQLG DAVRSAEAAE AALPKGTPDW LRAQDIAMSA RAMLERQKKS R
|
| |