Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1837 |
Symbol | |
ID | 3918397 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1935151 |
End bp | 1937463 |
Gene Length | 2313 bp |
Protein Length | 770 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640444579 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_497111 |
Protein GI | 87199854 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00570496 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGCGT GGAAGTTCGC GGGGCAATCG GCCCTTTTGG TTCTGGCCTT GGCCCTGACA GGCTGCGGTG GAGGCGGCGG AGTAGCGAGC ACGCCGACGC CGACGCCGAC GCCCAGCCCG ACCCCATCGC CGGCCCCCAC GCCAACGCCG ACGCCCACGC CCACGCCCAC GCCTACGCCG GTTTCCTACA ATACCGCCGA ATATCGCAAT TCGGACGGCC CCGTGTTCCA CGGCGCGATT ACCGCGTGGG AAGCGGGGGC ATCCGGCGCG GGGACGACGA TTGCCATCGT GGACAGCGGG ATCGATACCG CCAACAGCGA GTTCGCGGGC CGGATCTCGT CAGCATCGGC GGACGTCGCC GGATCGCGCG GCGTAAGGAA CACCGGGGAC GATCACGGAA CGCAGGTCGC GCTCGTCGCG GCGGCAGCGC GCAACGATTA CGGAGTGATG GGCATCGCCT GGGGCGCCAC AGTACAGATG CTGCGCGCGG ATACACCCGG AACCTGCGCA AGTGCGGATG GGTGCACATT CTCCGACACG AACATCGCCG AAGGAATCGA CGCAGCGGTT GGAGCGGGCG CCCGGGTGGT CAATCTCTCG CTTGGTGGCG AAGCGGCGGA TACAAACCTG CGCGCCGCCG TCGCCCGCGC CACGTCCGCC GGGCTGATCG TGGTGGTATC AGCCGGCAAC GACGGGCAGA GCACCGATCC GGCCATTGAC CCGAACAACC CCGACCCCTT CGCGGTGTCG CTGCTCCAGG CAGGCGGTGG CAACGTGATC ATCGTCGGAT CGGTCAGCTC CGCCGGGCTG GTCTCGGACT TCAGCAACCG TGCAGGCTCG TCGGCATCCT CGTTCCTCAT GGCCCAAGGC GAGACGATCT GCTGCGTGTA CGAAGGCAGC CAGATCCGCG TGGTCGACGG TTTCCGCACG GCCGTGAATG GAACAAGCTT CGCCGCGCCC CAGGTCTCGG GCGCGGTCGC TCTTCTCGCC CAGGCGTTCC CGAACCTCAC GGGCGCGCAA ATCGTCAGCC TCCTGCTGAG TACGGCGCGC GAGGGCGGGG CAATCGGCAC CGACCCTATC TACGGGCGCG GCATTCTCGA TATCGCGCGA GCCTTTTCGC CGCAGGGTAG CACTGGCCTC GCCGGAACCA GCGTCGCCGT CTCGCTCGAC AGCGTTGCGG GCACGACTTC GGGCGCGATG GGCGATGCGG TTTCGGGGGC CTCGCTCGGC ACGGTCATGC TCGACGGCTA CGGCAGGGCC TATGCTCTCG ATCTCGGACG GGGGCTCCGT TTCGGTCAGG CGCGTCAGCC CTTGCTGGAG GCGCTGCGGG CCGGGGCGCG GCCAACCGGG TTCGAGGCCG GCGACGTGTC GCTCGCCTTC TCCGTGGACC GGCGGTTCGG GGCCGTGCCG TTGCGCCTTG CGCCGGGCGA GCGGGAGCAG GCTCGCGTGC TGGCATCGAC GCTGGTGACG CGGATCGGGA ACGCGCGCGA ACTCGCGCTC GGATGGAACA CATCCTCCGA TGCGCTGGCG GCACGTTTGC AGAAACGGCG CGAGGCGCAG TTCCTCGTCG CCGGTACGGC CAATGGCATG TTCGAGCGTC CCGAGCTTGG CTTCGCGGCA CGGCAGAGGT TCGGCGGCAT GGGCGTCACC GTCAGCGGCG GTCGCAGCCT TGTCTGGCGG CCCGAAGCAT TGCGCGACCG TCGCGAGGAC AGGCTGGCGC GCCTTGGCGT GGCGTTGGAT GGCCACGGGG GCGATGCCCT CGACTGGCGG CTTGGCCTGG GCCTCATGCG CGAGGAGCGT ACCGTGCTGG GCGCACGCTT CGCCGATGCG CTGGGCGGCG GCGGTGCCAC GACCGGCACG ATCGAGCCCG GCGTTACCTG GCGGCCGATG CGCGGATGGC ACGTGGGCGC AACGGGCAGC CTGGGCTTCA CCCGGGTCGC GGGAACCTCT GTGGCGCCCG GCGGGTCGAG GCTGGTGTCG TCGTCGTGGG CGTTCGACGT TGCGCGCGAT GGCGCCCTGA TCGAGGGCGA CCGCCTTGGC CTGCGCCTGT CGCAGCCGTT GCGGGTGGAA AGCGGCGGGT TGGCGCTGAA CCTTCCGGTC GCTTGGGACT ATGCGACGCA GACTGCCCGT TTTGGCCGCG TCCCTTTATC CCTTGCGCCA AAGGGGCGCG AGCTTGACGC GGAAGTGGCG TGGTCGGCGC GGCTGTGGGA CGGCGCCTTT TCCGCCAGCC TGTTCTGGCG GCGCGATCCG GGGCACTTCG CCGCCGCGCC CGACGATGCC GGCGGGGCGG TGCGCTGGAC GGCGGGGTTC TAG
|
Protein sequence | MIAWKFAGQS ALLVLALALT GCGGGGGVAS TPTPTPTPSP TPSPAPTPTP TPTPTPTPTP VSYNTAEYRN SDGPVFHGAI TAWEAGASGA GTTIAIVDSG IDTANSEFAG RISSASADVA GSRGVRNTGD DHGTQVALVA AAARNDYGVM GIAWGATVQM LRADTPGTCA SADGCTFSDT NIAEGIDAAV GAGARVVNLS LGGEAADTNL RAAVARATSA GLIVVVSAGN DGQSTDPAID PNNPDPFAVS LLQAGGGNVI IVGSVSSAGL VSDFSNRAGS SASSFLMAQG ETICCVYEGS QIRVVDGFRT AVNGTSFAAP QVSGAVALLA QAFPNLTGAQ IVSLLLSTAR EGGAIGTDPI YGRGILDIAR AFSPQGSTGL AGTSVAVSLD SVAGTTSGAM GDAVSGASLG TVMLDGYGRA YALDLGRGLR FGQARQPLLE ALRAGARPTG FEAGDVSLAF SVDRRFGAVP LRLAPGEREQ ARVLASTLVT RIGNARELAL GWNTSSDALA ARLQKRREAQ FLVAGTANGM FERPELGFAA RQRFGGMGVT VSGGRSLVWR PEALRDRRED RLARLGVALD GHGGDALDWR LGLGLMREER TVLGARFADA LGGGGATTGT IEPGVTWRPM RGWHVGATGS LGFTRVAGTS VAPGGSRLVS SSWAFDVARD GALIEGDRLG LRLSQPLRVE SGGLALNLPV AWDYATQTAR FGRVPLSLAP KGRELDAEVA WSARLWDGAF SASLFWRRDP GHFAAAPDDA GGAVRWTAGF
|
| |