Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3911 |
Symbol | |
ID | 5714440 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009956 |
Strand | + |
Start bp | 139092 |
End bp | 140549 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641276824 |
Product | protease Do |
Protein accession | YP_001542120 |
Protein GI | 159046449 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATTGCT TACCCGCCCT TCCACGCGCT CTTGCGGTCC TTGCCCTACT GCCCGCTGGT GGCTCAGCCC TGCCGGCCAT AGCGCAACAG ACAGATTTCG ACAGTCCGAC CGATTTCTCC GAAATGGTGA CGGAACGCCT GCCTGCCGTG GTCGGTATCC TGTCGACCGG CCCGGCGCCT GATCCGTCGC CTGCGATCCA GCCGCAACTG CCGCCCGGGA TGCGAGAGTT CTTCGGCGGC CCAACGCCGC CGACCCCGCA AGGCCCGATG CGGACCCAAG GATCGGGGTT CATCATTTCG CAGGATGGTC TTGTCGTGAC GAATAACCAT GTGATCGCAG GCGCGGAACA GATCGAGGTG ATTATGAATG ACGATCGGCG GCTCGATGCG GAGTTGATCG GGACCGACCC CGCGACTGAT ATAGCGCTCC TGAGGATCGA AAATGTCACC GATCTGCCCC ATGTCGTCTG GGGGTCTTCC GACGATCTGT CCATCGGCGA GTGGGTCGTG GCCATCGGCA ATCCCTTTGG CCTCGGGGGT ACGGTGACCG CCGGGATCGT GTCTGCACGC GCTCGGGATA TTAACGCGGG ACCCTATGAC AGTTTCATAC AGACCGATGC CGCAATCAAT TCCGGCAACT CGGGAGGCCC GCTCTTTGAT GTCTCGGGCG ACGTCGTGGG TGTCAACACC GCGATCTTCT CGCCGACGGG CGGCAATGTC GGCATCGGCT TCGCTGTACC GTCGGCCGTA GCCGAGCGCA TTGTGGACGA TCTGCAAGAC GATGGACGAG TCGAGCGAGG CTGGCTCGGC GTGCAGGTGC AACCGGTGGA CGAGGCCCTA GCCCGGGCTT TCAAGTTTGA AGACCCCCAA GGGGTACTGC TTGCGGATGT CACGAAGGGC AGCCCGGCCT TCGAGGCGGG GTTGGAGCCG GGCGACGTCC TGCTCGAAAT CGACGGCGCC GCCGTCGACA CACCCCGCGA TCTGACTTTT GCCGTGGCTG ACACGCCCGT CGGTGCCACG GTCATGGTGA CCTACCTGCG CAACGGCAAC AGAGTTCAGG CGGAGGTTAC CATCGGGCAG CGGCCCGACC TGCAGTCCGC CACAGCCCAG CCTGATGCCG GGGGCCGCGA TGACAGGTCG GACGCGGACG GCCCGCGCAT AGGCGTGTCC GTCGCGCCGC TGTCTGGTGA GCTCCGCGCG CAAGCCGGTA TCCCGAACGA GGTGTCGGGC CTGCTGGTGC AATCAGTTAC GCAGGGCAGT CCCGCCGCCG AAGCAGGTCT TCGCGCCGGC GACGTCCTTG TGGAGGCTGC TGATGTCACG TTGGGGCAGG TCGAAACACT GCGCGATGCC ATCGCGCGCG CGGCCGAAGA GGGTGAGACG CTGCTGATCC GGGTTTTCCG CGGGCAAGGG TACAATTTCG TGGTGGTCAA TCTGAGCGAA GAGGTCGTGT CGAAGTAA
|
Protein sequence | MYCLPALPRA LAVLALLPAG GSALPAIAQQ TDFDSPTDFS EMVTERLPAV VGILSTGPAP DPSPAIQPQL PPGMREFFGG PTPPTPQGPM RTQGSGFIIS QDGLVVTNNH VIAGAEQIEV IMNDDRRLDA ELIGTDPATD IALLRIENVT DLPHVVWGSS DDLSIGEWVV AIGNPFGLGG TVTAGIVSAR ARDINAGPYD SFIQTDAAIN SGNSGGPLFD VSGDVVGVNT AIFSPTGGNV GIGFAVPSAV AERIVDDLQD DGRVERGWLG VQVQPVDEAL ARAFKFEDPQ GVLLADVTKG SPAFEAGLEP GDVLLEIDGA AVDTPRDLTF AVADTPVGAT VMVTYLRNGN RVQAEVTIGQ RPDLQSATAQ PDAGGRDDRS DADGPRIGVS VAPLSGELRA QAGIPNEVSG LLVQSVTQGS PAAEAGLRAG DVLVEAADVT LGQVETLRDA IARAAEEGET LLIRVFRGQG YNFVVVNLSE EVVSK
|
| |