Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_0895 |
Symbol | |
ID | 5710585 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 912355 |
End bp | 914220 |
Gene Length | 1866 bp |
Protein Length | 621 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641266805 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_001532241 |
Protein GI | 159043447 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.693546 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.575051 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGACT ATTTCCTCGT CAAGACCCCG CTCACGCTGG AGAACTGCCT GTCCCATGAC GGCGCGCTGG TTCTCGAACA ACATGCCGAA TTGCGCGCCA TGCTCGAAGC GCGGGCGCCG GCGGCGGCGG GTCTGTTTGC CGAGCCCCTG ATCAGCCGGG GCAACGACCA GGCGGCGGCG TCGGTGTCGT GGTACGGCGA TGTGGATGGC CAGCCGGTGC CCCTGTCGCG GCTCGATTCC GCCTCGCGTG CAGAGGTGCA GGCACGGCTG CAGGCGCAGG TGACGCCGCT CCTGCCGCTG CTGGATGACC CGGAGGTCGG CGCGATGCTG TCGCGGGCGC TCTACACGCT GGAGCCAGGT TCGATCGTGG CGGTCGACGG CACCCCGCTG TTGCTGAACT GGGGGATGTT GCCGGACGGG TTCGAGCGGG ACCGCAGCGC GCGGGCCAAT CATTTCGCGC AGACCCTGGG GGCGTTCGTG CCGTTCCCGG CGCCGCCGCC CCTCGGCCCG TCCGAAGCGC AGAAGTATCG CGAGGCGATC GCGGCTCCGC GGGCCGCGGC TGGCGACACA GGTCCCGCGC CCGCGACAGG TGCCGCTGCA GCCGCCGCCG CGACCGGCGG TATGGCGGCA GCATCGGCTG CCAAGGCCCC ACCACCGCCC CCGTCCCCGC CGCCTGCCGA GGCGGAACCG CGCCGGGTGG GGCCGGGGGG CTGGGTGCCC CTGCTGGTGC TGACGGTGCT GGCCGCGGTC GTGCTGCTCT GGCTGTTGCT GCCGGGCACG CGGCTCTTCC CGAACGACCC GTCGGAACAG GCGATCTCGG ACGTGGCGGC GGCGGAACTG GCCGAGGAGG TGAACGTCGC CCTGGAGGCG CGGCTGGCCA GCCTCCAGGC GGCGCTGGAC GGGGCGCAAT GCCGGGCGGA TGGCACGTTG CTGATGCCCG ACGGGATGAC CATCGAGGGC CTGCTGCCGC CCGACCCCCG TGATCCGAAC GACCGCGCCG GGGCCATCGT GCCCGCCGAT CTCACCCCGA TCCTGCCGCC CGATCCCGCG CGGGTGGCGG TGCCGACAGC CACCGGCACG CTGGAGACGG CGAACCTGCT GGCGCTGGTC GATGCGCGCA CGGCGCTGGT GATCGCGCAG ACGGCGACGG GCACCGGGAC CGGGACGGGG TTCTTCGTGG GGCCGGACCT CCTGGTGACG AATTTCCACG TGGTTGAGGG GGCTGCCGCC GACAGCATCT TCGTGACCAA TGAGGCGCTG GGCGCCGTGC GCCAGGCGCA GTTGCTCAAG CAGTCGGGAC CGTTGCAGGC CACGGGGGCG GATTTCGCCC TGCTGCGCGT GCCCGGGGCG AACCAGCCCG CGTTCGACAT TCTGCAGGGC ACCGAAAGCC TGCGGCTGCA GGCGGTGATC GCGGCGGGCT ATCCGGGGGA CATCCTGCGC ACCGACGCGC AGTTTTCCCA GTTGCGCGCG GGGGATCTGA GCGCTGTGCC GCAACTGGCG GTCACCGACG GGACGGTCAG TGTCGAGCAG GACATGGCGC CGCGCAACAC CCGCGTCGTG GTCCATTCCG CGCCGATTTC CACCGGCAAC TCGGGCGGGC CGCTTCTGGA CAGTTGCGGA CGTCTGGTGG GGGTGAACAC CTTCGTGGTG CAGGGCCCCT TGCGGAACCT GAACTTCGCG CTGGCCAGTC CCGAGCTGCT GGGCTTCCTG CAAGGGACGG GGGCTTTGCC CAATGTGGTT TCCAGCCCGT GCAGGCCGCA GGTCGCGCGC CCGTCGCCGC CGCCCGCCGT GGCGGCCTTG CCCGCGCCCG GGGCACCGGC GGAGGGCATC CCGGCCCTGC CGCTGCCCGG CGCGACGCCC GAGTAG
|
Protein sequence | MADYFLVKTP LTLENCLSHD GALVLEQHAE LRAMLEARAP AAAGLFAEPL ISRGNDQAAA SVSWYGDVDG QPVPLSRLDS ASRAEVQARL QAQVTPLLPL LDDPEVGAML SRALYTLEPG SIVAVDGTPL LLNWGMLPDG FERDRSARAN HFAQTLGAFV PFPAPPPLGP SEAQKYREAI AAPRAAAGDT GPAPATGAAA AAAATGGMAA ASAAKAPPPP PSPPPAEAEP RRVGPGGWVP LLVLTVLAAV VLLWLLLPGT RLFPNDPSEQ AISDVAAAEL AEEVNVALEA RLASLQAALD GAQCRADGTL LMPDGMTIEG LLPPDPRDPN DRAGAIVPAD LTPILPPDPA RVAVPTATGT LETANLLALV DARTALVIAQ TATGTGTGTG FFVGPDLLVT NFHVVEGAAA DSIFVTNEAL GAVRQAQLLK QSGPLQATGA DFALLRVPGA NQPAFDILQG TESLRLQAVI AAGYPGDILR TDAQFSQLRA GDLSAVPQLA VTDGTVSVEQ DMAPRNTRVV VHSAPISTGN SGGPLLDSCG RLVGVNTFVV QGPLRNLNFA LASPELLGFL QGTGALPNVV SSPCRPQVAR PSPPPAVAAL PAPGAPAEGI PALPLPGATP E
|
| |