Gene Saro_2398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2398 
Symbol 
ID3916717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2564711 
End bp2566306 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content68% 
IMG OID640445153 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_497668 
Protein GI87200411 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCGGTCAA AGCAAAACAG CGCAAGAAGC GCAGGCATGC GTGTCGCCAA ACAGATTACT 
GCGTTCCTCA CCCTGCTGTT CGCCGCGCTT GCGGCACCGG CGGCTCTTGC CGATCCGGCA
GACATTTCGG CGGCAAGCCG CTCGGTCGTT CGCGTCGTCA TCATCGAAAG CGACGGCGAC
CGCGCCAACC TCATCACCCA CGGCACCGGC TTTGCCGTCA CGCCCAATCT CATCGTCACC
AACGCGCACG TGGTCGAGGA ACTGCGCCGC GACGACACCC TGATCGCTGG CGTCGTCCCC
GCCGAGGGAC GCAACGGCTT CCCGGCAAAG CTGGTGGCCT ACTCGCCGGG CAACGACCTC
GCGCTGCTCA GGATCGAAGG TGGCGGATCG ATCACCCCGA TCACCCTGTT CCCCGGCGCG
CCGGGCGATG GATCGGAAGT CTATGCGGTC GGCTATCCCG GCAACGTCGA CCTCGCGCAG
GGGCTCTCGA TGGCCGATCT TGTCACCCCC CAGGCAGCCG TCAAGACGCG CGGCTATCTG
TCCGGCGGGC GCTCCTCGCG CTCGTTCGAC ACGTTGCTCC ACACCGCTCC GCTCGGCTCG
GGCAACTCTG GCGGCCCGCT GCTCGATTCC TGCGGGCGGG TGATCGGGGT CAACTCGTTC
GGCACGGTCA GCGACAACAG CACCGATTCG GCCTTCTACT TCGCCATCTC GATGCGCGAA
CTTTCGGCCT TCCTGCGCCG CGCCAACGTG GATGCGCACA CCAGCGGCCT TCCCTGCCGC
TCCATCGCCG ATCTCGATCG CGCCGAGGCA GAGCGCGCGG CCGGCGAACA GGCCCGTCTT
GCCGCTCAGA CCGCAGCCCA GGCCGACGCG AAGCAACGCG CGATGGACAA GGCGCGCCGC
GACGCGGAAC TGGCGATTCT CTCCGAACGC GACAACGGCA TGGCGCTTGC CGCGCTGCTT
CTCGTCGCGG CGCTCGGCGC GGGCGGATGG GGCATGGTCC AGGCCTCGCG CCATCGCGGG
CGGTTCCAGC GCAAGCACGT GTTTGGCGCA GGCGCACTGC TGCTGGCAGC GGTCGTGACC
TGGTTCCTCC GCCCCTCGCT CGCCAGCATC GACCAGCGCG CCCGCGAGCT TGTGCCCGCG
GCTGACGCCA GCAGCCCCGC AGGCTCGGCG TCAGGCATGG CCGAGGCAGG CAGCACCCGC
ATGGTCTGCG TCCTCGATCC GGAACGCAGC CGGGTCACCG TCTCAGACAT CACCGACGTC
CCCTTCGAAT GGAGCGGCGA CGGCTGCGTC AACGGCAAGA CCCAGTACGG CCTGGCACGC
GACGGCTGGT CGCGAATCCT CGTGCCCAAC GGCGAAGAGA CGGTTTCGGT CAACTCCTAC
GATCCAGACA GCCACACCTA CACGGTCGAG CGATTCCTCG TCGGGCTCGA CGCAATGACC
AAGGCGCGCG CCGAACGCGC CCGCCTCAAC GCCCCTGCCT GCGGTGCGGG CGAGGATGCG
GCGCGGAAAT TCGGGGATAG TCAGCAGGCT ATCAAGGCCC TGCTCCCGCC CGAGCCCAAC
GAACGGATGC GCTACAACTG CCAGCCGGCG CCCTGA
 
Protein sequence
MRSKQNSARS AGMRVAKQIT AFLTLLFAAL AAPAALADPA DISAASRSVV RVVIIESDGD 
RANLITHGTG FAVTPNLIVT NAHVVEELRR DDTLIAGVVP AEGRNGFPAK LVAYSPGNDL
ALLRIEGGGS ITPITLFPGA PGDGSEVYAV GYPGNVDLAQ GLSMADLVTP QAAVKTRGYL
SGGRSSRSFD TLLHTAPLGS GNSGGPLLDS CGRVIGVNSF GTVSDNSTDS AFYFAISMRE
LSAFLRRANV DAHTSGLPCR SIADLDRAEA ERAAGEQARL AAQTAAQADA KQRAMDKARR
DAELAILSER DNGMALAALL LVAALGAGGW GMVQASRHRG RFQRKHVFGA GALLLAAVVT
WFLRPSLASI DQRARELVPA ADASSPAGSA SGMAEAGSTR MVCVLDPERS RVTVSDITDV
PFEWSGDGCV NGKTQYGLAR DGWSRILVPN GEETVSVNSY DPDSHTYTVE RFLVGLDAMT
KARAERARLN APACGAGEDA ARKFGDSQQA IKALLPPEPN ERMRYNCQPA P