Gene Saro_0230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0230 
Symbol 
ID3916218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp237523 
End bp238908 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content67% 
IMG OID640442955 
Productpeptidase M48, Ste24p 
Protein accessionYP_495512 
Protein GI87198255 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0393595 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGCGCA GCGCCCGATT CCCCAGCATC ATCCTTGCCC GCCTGCTGGC GGCTCTTGCC 
GCGCTTGCGC TGTGCGTGGA ACCGGCCGCG GCGCAATCGG TGCTGCGCGA TGCCGAGACC
GAGGCATTGT TCCGCGATGC TTCCGCGCCG ATCTTCAAGG CGGCGGGGTT CAATCCCAAT
GCGGTGGACC TGGTCCTGCT CAACGACGGG TCGATCAACG CCTTCGTCGC GGGCGGGCAG
GCGATCTACA TCCATTCGGG CCTGATCGGC GCGGCCGACA ACGTCAACGA ATTGCAGGGC
GTGATCGCGC ACGAGCTGGG CCACATCACC GGCGGCCACA TCATCCGCTA TGACGAAGGG
CTGAAGCCCG CGACCGGCAT CACCGTGCTG AGCCTCCTGC TGGGCGGACT GGCGGCGGCG
GCGGGATCGC CCGACGCCGC GATGGGCGTT TTCATGGCCG GGCAGCAGGC CGCGCTGGGC
AAGTTCCTGG CTTTCAGTCG CGCGCAGGAA AGCTCTGCCG ACGCGGCGGG CGCGCAGTTC
CTGGCGAAGG CGGGGATTTC CGGGCGTGGC TCGATCGAGT TCTTCAAGAA GCTCCAGAAC
CAGGAGTTCC GCTACGGCTA CAGCCCGCGC CGCAACCCTG ACGCGGAATT CTACAGCACC
CACCCGATGA CCGCGGACCG CCTGACCACG CTGCAGGACA CCTACGAGAA GGACCCGGCC
TGGAACAGCC CGCCTCCCGC GGAACTGCAG GCGCGCTTCC TGCGGGTGAA GGCCAAGCTC
TATGGTTATC TCGCCGAGCC GCAGGACACC CTGCGCGCCT ATCCCGAATA CCTGACCGAC
GTCCCCGCGC GCTATGCCCG GGCCTATGCC TTCCACAAGG AAGCGTTCGT CGACAAGGCG
CTGGACGAGA CGAAGGCGCT GATCGCCAAG GACCCGAAGA ACCCCTATTT CCTCGAGCTG
GAAGGGCAGA TCCTGCTCGA ATCCGGCCGC CCGGCCGAAG CGATCCCGCC GCTGCGCGAG
GCGACGGCGC TGACCGGCAA CGAGCCGCTG ATCGCCACGA CCTTCGGCCA TGCGCTGATC
GCGACCGAGG ACAAGGACAA CTTCGCCGAG GCCGAAAAGG TGCTCAAGAC GGCGGTCGCG
CGCGACAAGG ACAACCCCTT CACCTGGTAC CAGCTCGGCG TGGTCTACGA GGCCAAGGGC
GACATTCCCC GCGCACGGCT GGCAAGCGCA GAGCAGCAGT TGATGAACAT GCAACTCGGC
GATGCGGTGC GCAGCGCCGA AGCCGCCGAG GCCGCGCTGC CCAAGGGCAC GCCCGACTGG
CTGCGCGCGC AGGATATCGC CATGTCGGCG CGGGCAATGC TGGAACGCCA GAAGAAGTCG
CGCTAG
 
Protein sequence
MKRSARFPSI ILARLLAALA ALALCVEPAA AQSVLRDAET EALFRDASAP IFKAAGFNPN 
AVDLVLLNDG SINAFVAGGQ AIYIHSGLIG AADNVNELQG VIAHELGHIT GGHIIRYDEG
LKPATGITVL SLLLGGLAAA AGSPDAAMGV FMAGQQAALG KFLAFSRAQE SSADAAGAQF
LAKAGISGRG SIEFFKKLQN QEFRYGYSPR RNPDAEFYST HPMTADRLTT LQDTYEKDPA
WNSPPPAELQ ARFLRVKAKL YGYLAEPQDT LRAYPEYLTD VPARYARAYA FHKEAFVDKA
LDETKALIAK DPKNPYFLEL EGQILLESGR PAEAIPPLRE ATALTGNEPL IATTFGHALI
ATEDKDNFAE AEKVLKTAVA RDKDNPFTWY QLGVVYEAKG DIPRARLASA EQQLMNMQLG
DAVRSAEAAE AALPKGTPDW LRAQDIAMSA RAMLERQKKS R