Gene Dshi_0895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_0895 
Symbol 
ID5710585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp912355 
End bp914220 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content73% 
IMG OID641266805 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_001532241 
Protein GI159043447 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.693546 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.575051 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGACT ATTTCCTCGT CAAGACCCCG CTCACGCTGG AGAACTGCCT GTCCCATGAC 
GGCGCGCTGG TTCTCGAACA ACATGCCGAA TTGCGCGCCA TGCTCGAAGC GCGGGCGCCG
GCGGCGGCGG GTCTGTTTGC CGAGCCCCTG ATCAGCCGGG GCAACGACCA GGCGGCGGCG
TCGGTGTCGT GGTACGGCGA TGTGGATGGC CAGCCGGTGC CCCTGTCGCG GCTCGATTCC
GCCTCGCGTG CAGAGGTGCA GGCACGGCTG CAGGCGCAGG TGACGCCGCT CCTGCCGCTG
CTGGATGACC CGGAGGTCGG CGCGATGCTG TCGCGGGCGC TCTACACGCT GGAGCCAGGT
TCGATCGTGG CGGTCGACGG CACCCCGCTG TTGCTGAACT GGGGGATGTT GCCGGACGGG
TTCGAGCGGG ACCGCAGCGC GCGGGCCAAT CATTTCGCGC AGACCCTGGG GGCGTTCGTG
CCGTTCCCGG CGCCGCCGCC CCTCGGCCCG TCCGAAGCGC AGAAGTATCG CGAGGCGATC
GCGGCTCCGC GGGCCGCGGC TGGCGACACA GGTCCCGCGC CCGCGACAGG TGCCGCTGCA
GCCGCCGCCG CGACCGGCGG TATGGCGGCA GCATCGGCTG CCAAGGCCCC ACCACCGCCC
CCGTCCCCGC CGCCTGCCGA GGCGGAACCG CGCCGGGTGG GGCCGGGGGG CTGGGTGCCC
CTGCTGGTGC TGACGGTGCT GGCCGCGGTC GTGCTGCTCT GGCTGTTGCT GCCGGGCACG
CGGCTCTTCC CGAACGACCC GTCGGAACAG GCGATCTCGG ACGTGGCGGC GGCGGAACTG
GCCGAGGAGG TGAACGTCGC CCTGGAGGCG CGGCTGGCCA GCCTCCAGGC GGCGCTGGAC
GGGGCGCAAT GCCGGGCGGA TGGCACGTTG CTGATGCCCG ACGGGATGAC CATCGAGGGC
CTGCTGCCGC CCGACCCCCG TGATCCGAAC GACCGCGCCG GGGCCATCGT GCCCGCCGAT
CTCACCCCGA TCCTGCCGCC CGATCCCGCG CGGGTGGCGG TGCCGACAGC CACCGGCACG
CTGGAGACGG CGAACCTGCT GGCGCTGGTC GATGCGCGCA CGGCGCTGGT GATCGCGCAG
ACGGCGACGG GCACCGGGAC CGGGACGGGG TTCTTCGTGG GGCCGGACCT CCTGGTGACG
AATTTCCACG TGGTTGAGGG GGCTGCCGCC GACAGCATCT TCGTGACCAA TGAGGCGCTG
GGCGCCGTGC GCCAGGCGCA GTTGCTCAAG CAGTCGGGAC CGTTGCAGGC CACGGGGGCG
GATTTCGCCC TGCTGCGCGT GCCCGGGGCG AACCAGCCCG CGTTCGACAT TCTGCAGGGC
ACCGAAAGCC TGCGGCTGCA GGCGGTGATC GCGGCGGGCT ATCCGGGGGA CATCCTGCGC
ACCGACGCGC AGTTTTCCCA GTTGCGCGCG GGGGATCTGA GCGCTGTGCC GCAACTGGCG
GTCACCGACG GGACGGTCAG TGTCGAGCAG GACATGGCGC CGCGCAACAC CCGCGTCGTG
GTCCATTCCG CGCCGATTTC CACCGGCAAC TCGGGCGGGC CGCTTCTGGA CAGTTGCGGA
CGTCTGGTGG GGGTGAACAC CTTCGTGGTG CAGGGCCCCT TGCGGAACCT GAACTTCGCG
CTGGCCAGTC CCGAGCTGCT GGGCTTCCTG CAAGGGACGG GGGCTTTGCC CAATGTGGTT
TCCAGCCCGT GCAGGCCGCA GGTCGCGCGC CCGTCGCCGC CGCCCGCCGT GGCGGCCTTG
CCCGCGCCCG GGGCACCGGC GGAGGGCATC CCGGCCCTGC CGCTGCCCGG CGCGACGCCC
GAGTAG
 
Protein sequence
MADYFLVKTP LTLENCLSHD GALVLEQHAE LRAMLEARAP AAAGLFAEPL ISRGNDQAAA 
SVSWYGDVDG QPVPLSRLDS ASRAEVQARL QAQVTPLLPL LDDPEVGAML SRALYTLEPG
SIVAVDGTPL LLNWGMLPDG FERDRSARAN HFAQTLGAFV PFPAPPPLGP SEAQKYREAI
AAPRAAAGDT GPAPATGAAA AAAATGGMAA ASAAKAPPPP PSPPPAEAEP RRVGPGGWVP
LLVLTVLAAV VLLWLLLPGT RLFPNDPSEQ AISDVAAAEL AEEVNVALEA RLASLQAALD
GAQCRADGTL LMPDGMTIEG LLPPDPRDPN DRAGAIVPAD LTPILPPDPA RVAVPTATGT
LETANLLALV DARTALVIAQ TATGTGTGTG FFVGPDLLVT NFHVVEGAAA DSIFVTNEAL
GAVRQAQLLK QSGPLQATGA DFALLRVPGA NQPAFDILQG TESLRLQAVI AAGYPGDILR
TDAQFSQLRA GDLSAVPQLA VTDGTVSVEQ DMAPRNTRVV VHSAPISTGN SGGPLLDSCG
RLVGVNTFVV QGPLRNLNFA LASPELLGFL QGTGALPNVV SSPCRPQVAR PSPPPAVAAL
PAPGAPAEGI PALPLPGATP E