Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dd703_2805 |
Symbol | |
ID | 8089632 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dickeya dadantii Ech703 |
Kingdom | Bacteria |
Replicon accession | NC_012880 |
Strand | - |
Start bp | 3277076 |
End bp | 3278506 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 644836883 |
Product | serine endoprotease |
Protein accession | YP_002988404 |
Protein GI | 242240223 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGAA AATCATTGAT GTTGAGCGCG CTGGCGCTGA GTCTGGCGAT GGCTGTCGGT ACGCTGCCTG CCGCTGCGGC GGAATCGGTT TCTTCCTCCA GTACGCAACT GCCCAGTTTG GCGCCGATGC TGGAGCAGGT GATGCCTTCC GTGGTGAATG TCTACGTCGA TGGACATACG GCGGCGGCCA AACGCGCGAA TGTACCGCCG CAGTTGCAGC CGTTCTTCGG CGAAAATTCA CCGTTCTGTC AGGAAGGGTC GCCTTTCCAA TCCTCGCCGA TGTGTCAGGG CGACGATGAG GAGGGGGATG CTGCGCCGCA GCAAGCGTTT CAGGCGCTGG GCGCCGGGGT GATCATCAAT GCCGCTAAAG GCTATGTCGT GACCAATAAC CATGTGGTGG ATAACGCGGA TAAGATTCAG GTGCGCCTGA ACGATGGCCG TAAGTATGAC GCCAAGGTGA TTGGTAAGGA TCCGCGCTCA GACGTGGCGT TGATTCAGTT GCAGGACTTC AGCAATCTGA CGGCGATTCA CATCGCCGAT TCCGATCAAC TGCGCGTCGG CGATTATGCG GTCGCGATCG GCAACCCTTA CGGGCTGGGC GAAACCGCGA CGTCCGGCAT CATTTCCGCA TTGGGGCGCA GCGGCCTGAA TATTGAAAAT TATGAAGATT TCATTCAGAC GGACGCCGCC ATCAACCGGG GTAACTCCGG CGGGGCGTTG GTCAATCTGA ATGGTGAACT GATTGGTTTG AATACCGCAA TCCTGGCGCC GAGCGGCGGT AATGTCGGCA TCGGTTTCGC CATTCCCAGC AACATTGTGA AGAATCTGGT CAGCCAGATT GTGGAGTACG GCGAGGTGAA ACGCGGTGAA CTGGGTATTC TGGGCACCGA ATTGAACGCC GATCTGTCCA AAGCGATGAA ACTGGATACC CAGCGCGGTG CGTTCGTCAG CCAGGTACAG CCGAACTCCG CGGCGGCGAA AGCCGGGATC AAAGCCGGGG ATGTCATTGT TTCCCTTAAC GGCAAGGCGA TAAGCAGTTT CGCCGCACTA CGGGCTCAAG TCGGCTCTCT GCCGGTGGGG AGTTCGCTGT CGCTGGGGTT GATTCGCGAT GGTAAACCCG TGACGGTGAA TGTCACTCTA CAGCAGAGCG CGCAAACGCA GGTGGCTTCG GGAAATCTGA ATTCAGCCAT TGAAGGGGCC GAACTGAGTA ATACTCAGGT GAACGGTCAA AAAGGCGTGC GGGTCGACCA GGTGAAACCG GGTTCTGCCG CGGCGCGTAT TGGTCTGAAG CCGGATGACG TCATTCTTGG GGTTAATCAA CAGCCGGTCG AAAATATTGG TGAACTGCGC AAGATTATTG ACAGTAAACC GCCGGTGTTG GCGTTGAATA TTCGTCGTGG CGATACGGTA CTTTATCTGT TGATTCAATA A
|
Protein sequence | MKRKSLMLSA LALSLAMAVG TLPAAAAESV SSSSTQLPSL APMLEQVMPS VVNVYVDGHT AAAKRANVPP QLQPFFGENS PFCQEGSPFQ SSPMCQGDDE EGDAAPQQAF QALGAGVIIN AAKGYVVTNN HVVDNADKIQ VRLNDGRKYD AKVIGKDPRS DVALIQLQDF SNLTAIHIAD SDQLRVGDYA VAIGNPYGLG ETATSGIISA LGRSGLNIEN YEDFIQTDAA INRGNSGGAL VNLNGELIGL NTAILAPSGG NVGIGFAIPS NIVKNLVSQI VEYGEVKRGE LGILGTELNA DLSKAMKLDT QRGAFVSQVQ PNSAAAKAGI KAGDVIVSLN GKAISSFAAL RAQVGSLPVG SSLSLGLIRD GKPVTVNVTL QQSAQTQVAS GNLNSAIEGA ELSNTQVNGQ KGVRVDQVKP GSAAARIGLK PDDVILGVNQ QPVENIGELR KIIDSKPPVL ALNIRRGDTV LYLLIQ
|
| |