Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2398 |
Symbol | |
ID | 3916717 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 2564711 |
End bp | 2566306 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640445153 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_497668 |
Protein GI | 87200411 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCGGTCAA AGCAAAACAG CGCAAGAAGC GCAGGCATGC GTGTCGCCAA ACAGATTACT GCGTTCCTCA CCCTGCTGTT CGCCGCGCTT GCGGCACCGG CGGCTCTTGC CGATCCGGCA GACATTTCGG CGGCAAGCCG CTCGGTCGTT CGCGTCGTCA TCATCGAAAG CGACGGCGAC CGCGCCAACC TCATCACCCA CGGCACCGGC TTTGCCGTCA CGCCCAATCT CATCGTCACC AACGCGCACG TGGTCGAGGA ACTGCGCCGC GACGACACCC TGATCGCTGG CGTCGTCCCC GCCGAGGGAC GCAACGGCTT CCCGGCAAAG CTGGTGGCCT ACTCGCCGGG CAACGACCTC GCGCTGCTCA GGATCGAAGG TGGCGGATCG ATCACCCCGA TCACCCTGTT CCCCGGCGCG CCGGGCGATG GATCGGAAGT CTATGCGGTC GGCTATCCCG GCAACGTCGA CCTCGCGCAG GGGCTCTCGA TGGCCGATCT TGTCACCCCC CAGGCAGCCG TCAAGACGCG CGGCTATCTG TCCGGCGGGC GCTCCTCGCG CTCGTTCGAC ACGTTGCTCC ACACCGCTCC GCTCGGCTCG GGCAACTCTG GCGGCCCGCT GCTCGATTCC TGCGGGCGGG TGATCGGGGT CAACTCGTTC GGCACGGTCA GCGACAACAG CACCGATTCG GCCTTCTACT TCGCCATCTC GATGCGCGAA CTTTCGGCCT TCCTGCGCCG CGCCAACGTG GATGCGCACA CCAGCGGCCT TCCCTGCCGC TCCATCGCCG ATCTCGATCG CGCCGAGGCA GAGCGCGCGG CCGGCGAACA GGCCCGTCTT GCCGCTCAGA CCGCAGCCCA GGCCGACGCG AAGCAACGCG CGATGGACAA GGCGCGCCGC GACGCGGAAC TGGCGATTCT CTCCGAACGC GACAACGGCA TGGCGCTTGC CGCGCTGCTT CTCGTCGCGG CGCTCGGCGC GGGCGGATGG GGCATGGTCC AGGCCTCGCG CCATCGCGGG CGGTTCCAGC GCAAGCACGT GTTTGGCGCA GGCGCACTGC TGCTGGCAGC GGTCGTGACC TGGTTCCTCC GCCCCTCGCT CGCCAGCATC GACCAGCGCG CCCGCGAGCT TGTGCCCGCG GCTGACGCCA GCAGCCCCGC AGGCTCGGCG TCAGGCATGG CCGAGGCAGG CAGCACCCGC ATGGTCTGCG TCCTCGATCC GGAACGCAGC CGGGTCACCG TCTCAGACAT CACCGACGTC CCCTTCGAAT GGAGCGGCGA CGGCTGCGTC AACGGCAAGA CCCAGTACGG CCTGGCACGC GACGGCTGGT CGCGAATCCT CGTGCCCAAC GGCGAAGAGA CGGTTTCGGT CAACTCCTAC GATCCAGACA GCCACACCTA CACGGTCGAG CGATTCCTCG TCGGGCTCGA CGCAATGACC AAGGCGCGCG CCGAACGCGC CCGCCTCAAC GCCCCTGCCT GCGGTGCGGG CGAGGATGCG GCGCGGAAAT TCGGGGATAG TCAGCAGGCT ATCAAGGCCC TGCTCCCGCC CGAGCCCAAC GAACGGATGC GCTACAACTG CCAGCCGGCG CCCTGA
|
Protein sequence | MRSKQNSARS AGMRVAKQIT AFLTLLFAAL AAPAALADPA DISAASRSVV RVVIIESDGD RANLITHGTG FAVTPNLIVT NAHVVEELRR DDTLIAGVVP AEGRNGFPAK LVAYSPGNDL ALLRIEGGGS ITPITLFPGA PGDGSEVYAV GYPGNVDLAQ GLSMADLVTP QAAVKTRGYL SGGRSSRSFD TLLHTAPLGS GNSGGPLLDS CGRVIGVNSF GTVSDNSTDS AFYFAISMRE LSAFLRRANV DAHTSGLPCR SIADLDRAEA ERAAGEQARL AAQTAAQADA KQRAMDKARR DAELAILSER DNGMALAALL LVAALGAGGW GMVQASRHRG RFQRKHVFGA GALLLAAVVT WFLRPSLASI DQRARELVPA ADASSPAGSA SGMAEAGSTR MVCVLDPERS RVTVSDITDV PFEWSGDGCV NGKTQYGLAR DGWSRILVPN GEETVSVNSY DPDSHTYTVE RFLVGLDAMT KARAERARLN APACGAGEDA ARKFGDSQQA IKALLPPEPN ERMRYNCQPA P
|
| |