Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1540 |
Symbol | |
ID | 5713197 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 1601019 |
End bp | 1602053 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641267455 |
Product | protease |
Protein accession | YP_001532883 |
Protein GI | 159044089 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.107774 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.168358 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCTCT GGCTCCCCCT CGCGCTCTGC CTGCTGGCCC TGGTCGCACC CGCCCGGGCA TCCGAACTGA CCGCCCCCGA AGAGCGCCTG ATTTCCCTGT TCGAGACGTC GCGCGCCGCC GTGGTGTCGA TCACCACCGG CCAGCGCCGG GTCGATCCCT GGATGCGCCG GGCCGAAATC GTGCCCAGCG GCTCCGGCTC GGGGTTCGTG TGGGACCGCG ACGGCCATGT GGTCACCAAC GCCCATGTCA TCCGCGGCGC GGCCCGGGCG GATGTGCACA TGGCCGACGG GCGCGTGCTG CCCGCCCGGC TGGTGGGCAC GGCCCCGCAA TACGACCTCG CGGTGCTGCG CGTCGATCTC GGCACGCGCC GTCCCGACCC GCTCCCCCTG GGGCGCAGCG ACGCGCTCCG CGTGGGTCAA AGCGTGCTGG CCATCGGCAA TCCGTTCGGG CTGGACTGGA CGCTGACCAC GGGCATCGTC TCGGCGCTGG AGCGCGAGAT CCCGCTGGGC ACCGGCACGA TCGAGGGGCT TATCCAGACC GACGCGGCGA TCAATCCGGG CAATTCCGGC GGCCCGCTTC TGGACAGCTC CGGGCGGCTG ATCGGCGTGA ACACCGCGAT CTTCAGCCCC TCGGGCTCCA GTGCCGGAAT CGGCTTTGCC GTGCCCGTGG ACCGGGTCGC CCGCGTGGTG CCGCAACTCA TCGCCCGGGG CATGTATCGC CCGCCGGTCC TCGGCATCCG TTTCGATCCG CGCATCGACG CGCTGGCCCG GCAGAACGGC GTCGAAGGCG CCGTGATCCT CGCGATAGAA CCGGGCGGCC CCGCCGCCGC CGCAGGTCTG CGCCCGGCCC GGCGGGATGG GGCGGGCTTT CTCGTGCCCG GCGACGTGAT CCAGCGCCTG GCGGGCCGCC CCATCGCCAG CGGCAGCGAC CTGCGCAGCG TGCTCGACGA TTTCGACCCG GGCACCGAGG TGACCCTCGA GGTCTGGCGC GACGGCACCC GGCGCGAGGT CCGCGTCACC CTGGCCGCGC CCTGA
|
Protein sequence | MRLWLPLALC LLALVAPARA SELTAPEERL ISLFETSRAA VVSITTGQRR VDPWMRRAEI VPSGSGSGFV WDRDGHVVTN AHVIRGAARA DVHMADGRVL PARLVGTAPQ YDLAVLRVDL GTRRPDPLPL GRSDALRVGQ SVLAIGNPFG LDWTLTTGIV SALEREIPLG TGTIEGLIQT DAAINPGNSG GPLLDSSGRL IGVNTAIFSP SGSSAGIGFA VPVDRVARVV PQLIARGMYR PPVLGIRFDP RIDALARQNG VEGAVILAIE PGGPAAAAGL RPARRDGAGF LVPGDVIQRL AGRPIASGSD LRSVLDDFDP GTEVTLEVWR DGTRREVRVT LAAP
|
| |