Gene Saro_3331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3331 
Symbol 
ID3915978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3551124 
End bp3552656 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content67% 
IMG OID640446116 
Productpeptidase S1C, Do 
Protein accessionYP_498600 
Protein GI87201343 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.107916 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGATACG CTTACGGTGT GACTTCGGCG CTGCTGCTCG CAGGCAGCGC ACTGACCCTG 
GTGACGGGTT TTCCGGCAGG TGCGCAGGTC GCGCAGAACG ACCAGTCCGA GATGCGGACG
GTCGTGCCAC GCGCGGGCGC CCCGGCAAGT TTTGCAGACC TTACCCAGCA ACTCCAGCCG
GCGGTCGTGA ACATCTCCAC CCGCCAGCGC GTGAAGGTGC AGGCCAACAA CCCCTTCGCC
GGCACGCCGT TCGGCGACCT GTTCGGCGGC GGACAAGGCA ACGGCCCGCA GACGCGCGAG
GCTCAGTCGC TGGGTTCGGG CTTCATCATT TCCGCTGACG GCTATGTCGT GACGAACAAC
CACGTCATTA CCGCCGACGG GCAGGGCGAG GTCGAATCGA TCACCGTCAC CACCCCCGAC
GGCACCGAAT ACCCGGCCAA GCTTATCGGC AAGGACGCGG CATCGGACCT TGCCGTGCTC
AAGATCAGCC GCCCCACCGC CTTCCCGTTC GTGAAGTTCG GCGATTCGCG CAAGGCACGC
GTGGGTGACT GGGTGATCGC GATCGGCAAC CCGTTCGGAC TCGGTGGCAC GGTCACGCAG
GGGATCATCT CGGCGGTCTA CCGCAACACC GGTTCGGGTT CGGCCTATGA CCGATACCTG
CAGACCGACG CGTCGATCAA CCGGGGCAAC TCGGGCGGTC CGATGTTCGA CATGCAGGGC
AACGTGATCG GCATCAACAA TGCGATCTTC TCGCCCACCG GCGGTTCGGT CGGCATCGGT
TTCGCCATCC CCGCCGAAAT CGCCGCCCCC ATCGTGGACA AGCTGCGCGC AGGCCAGGCG
ATCGACCGCG GTTACTTGGG CGTGCGCATC CAGCCGCTTT CGGAAGACCT TGCGGCCTCG
CTTGGCCTGC CCAAGAACCG GGGCGAGTTC ATCCAGGGCG TCGAGCCCGG CCAGGCTGCG
GCCAAGGCTG GCATCCAGGC GGGCGACGTC GTGGTCAAGG TCGACGGCAA GGACGTGACG
CCAGACCAGA CCCTGTCGTT CATCGTGGCC AACACCGCGC CCGGCAAGCG CATCCCGATC
GAGCTGATCC GCAACGGCCA GCGCCTGACG GTGCAGACCG TCGTCGCCAA GCGTCCGACC
GAGGAGGAAC TCGCCCAGCA GAGCTTCGAC CCCGATGCCC AGCAGGACGA CGACCAGTTC
GGCGCTCGCC CGCAGCAGCA GGGGCCCAGC GTTCTCCAGA ACGCGCTCGG CGTCGCAGCG
ATCCCGCTGA CCCCGCAGAT CGCACGCCAG CTTGGCGCGG GCGAGGATGC CAAGGGCGTC
GTAATCACGG CGGTCGACGG ATCGTCCGAT GCCGCGGCCA AGGGCCTGCA GCGTGGCGAC
ATCGTGCTTT CGGCCAACTA CGTCACGGTT ACCACCTTGG CCGATCTCGA ACGGATCGTG
CGCAACGCCA AGACCGAAGG TCGTGAAGCG GTGCTGCTGC GCATCCAGCG CCGTGGCCAG
CCACCCATCT ACATGCCTGT CCGCGTGCGC TGA
 
Protein sequence
MRYAYGVTSA LLLAGSALTL VTGFPAGAQV AQNDQSEMRT VVPRAGAPAS FADLTQQLQP 
AVVNISTRQR VKVQANNPFA GTPFGDLFGG GQGNGPQTRE AQSLGSGFII SADGYVVTNN
HVITADGQGE VESITVTTPD GTEYPAKLIG KDAASDLAVL KISRPTAFPF VKFGDSRKAR
VGDWVIAIGN PFGLGGTVTQ GIISAVYRNT GSGSAYDRYL QTDASINRGN SGGPMFDMQG
NVIGINNAIF SPTGGSVGIG FAIPAEIAAP IVDKLRAGQA IDRGYLGVRI QPLSEDLAAS
LGLPKNRGEF IQGVEPGQAA AKAGIQAGDV VVKVDGKDVT PDQTLSFIVA NTAPGKRIPI
ELIRNGQRLT VQTVVAKRPT EEELAQQSFD PDAQQDDDQF GARPQQQGPS VLQNALGVAA
IPLTPQIARQ LGAGEDAKGV VITAVDGSSD AAAKGLQRGD IVLSANYVTV TTLADLERIV
RNAKTEGREA VLLRIQRRGQ PPIYMPVRVR