Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_2733 |
Symbol | degP |
ID | 5713632 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 2894377 |
End bp | 2895882 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641268658 |
Product | protease do precursor |
Protein accession | YP_001534067 |
Protein GI | 159045273 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0138399 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.054673 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAATAGAA CAGACCCTCA GCGACCCCAG GCCCGGATCC TGGCGAAGGT CGCAACGCCC GCGGGCGTGC CCCGTGCGGC GCAGGCCGCC GCCGGGGCGC TGCTTGCGCT GGCCCTGCTG CTCGCGCAGA CGCTGATCGT GCAGGCACGT GAAATCCCCG GCAGTTTTGC CGACCTCGCC GAGCGTGTGA GCCCGGCGGT CGTCAACATC ACAACCTCGA CCAATGTCGC CACCCCGGGC GGGCCGCAAC CGATGGTGCC CGAGGGCTCG CCCCTCGAAG ACTTCTTCCG CGATTTCATG GATCGGCAGG AGCAGGATGG TCGCCCCGCA CCCCGGCAGC GGCGCTCCAA CGCGCTCGGC TCCGGCTTCG TGATCTCCGA GGACGGCTAT ATCGTCACGA ACAATCACGT GATCGAGCAG GCCGACGAGA TCCTGATCGA GTTCTTCTCC GGTGAAGAGC TGGCGGCAGA GGTCGTCGGC ACCGACCCCA ACACCGATAT CGCGCTCTTG AAGGTCGAAA GCGACACGCC GCTGCCGTTC GTCACCTTCG GGGACAGCGA TGCGGCCCGC GTGGGCGATT GGGTGATGGC CGTGGGCAAC CCGCTGGGCC AGGGTTTCTC GGTCTCGGCC GGGATCGTCT CGGCGCGCAA CCGGGCGCTT TCGGGCACCT ATGACGATTA CATCCAGACC GACGCGGCGA TCAACCGGGG CAACTCGGGC GGACCGCTGT TCAACATGGA CGGTGAAGTC ATCGGCGTGA ACACCGCGAT CCTGTCGCCC AACGGTGGCT CTATCGGGAT CGGTTTTTCC ATGGCAGCCG GTGTCGTGAC CAACGTGGTC GATCAGCTCA AGGAATTCGG CGAGACCCGC CGCGGCTGGC TGGGCGTGCG CATCCAGGAC GTGACCGACG ACGTGGCCGA AGCCCTCGGG CTGGAGCAGG CCGCCGGGGC GCTCGTGACC GATGTGCCGG ACGGCCCGTC GCTCGATGCC GGGATGGAGG CGGGGGACGT GATCCTCACC TTCGACGGGC GCGACGTGGA GGACACCCGT GAGCTGGTCC AGATCGTCGG CAACACAGCG GTCGGCAAGG CCGTGCGCGT CGTGGTGTTC CGCGACGGCG CAACCCAGAC CCTGCTGGTG ACCCTTGGGC GTCGCGAAGA GGCGGAGCGG GCGATCCCGG CTTCCGCGTC TGCGGATGAA GAGATCCTCG AGAAGGAGAT CATGGGCCTG ACCGTCAGCG AGTTGACCGA TGAGCTGCGC GAGCAGCTCG GGATCGCGGC GAGCGATACC GGGCTTGTCG TGGCCGATAT CGACGAGACC TCGGAGGCCT TCGACAAGGG TCTGCGCGCG GGCGATCTCA TCGTCGAGGC CGCACAGGTC CGTGTGACGA CCATCGAAGA GTTCGAAGAG CGGGTCGAGG CCGCCAAGGA GGCGGGGCGC AAGTCCATCC TCGTGCTGGT GCGCCGGGAT GGCGACCCCC GTTTCGTGGC CCTTTCGCTG AGCTGA
|
Protein sequence | MNRTDPQRPQ ARILAKVATP AGVPRAAQAA AGALLALALL LAQTLIVQAR EIPGSFADLA ERVSPAVVNI TTSTNVATPG GPQPMVPEGS PLEDFFRDFM DRQEQDGRPA PRQRRSNALG SGFVISEDGY IVTNNHVIEQ ADEILIEFFS GEELAAEVVG TDPNTDIALL KVESDTPLPF VTFGDSDAAR VGDWVMAVGN PLGQGFSVSA GIVSARNRAL SGTYDDYIQT DAAINRGNSG GPLFNMDGEV IGVNTAILSP NGGSIGIGFS MAAGVVTNVV DQLKEFGETR RGWLGVRIQD VTDDVAEALG LEQAAGALVT DVPDGPSLDA GMEAGDVILT FDGRDVEDTR ELVQIVGNTA VGKAVRVVVF RDGATQTLLV TLGRREEAER AIPASASADE EILEKEIMGL TVSELTDELR EQLGIAASDT GLVVADIDET SEAFDKGLRA GDLIVEAAQV RVTTIEEFEE RVEAAKEAGR KSILVLVRRD GDPRFVALSL S
|
| |