Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dd1591_3801 |
Symbol | |
ID | 8117234 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dickeya zeae Ech1591 |
Kingdom | Bacteria |
Replicon accession | NC_012912 |
Strand | + |
Start bp | 4303312 |
End bp | 4304376 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644854168 |
Product | serine endoprotease |
Protein accession | YP_003006080 |
Protein GI | 251791359 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02038] periplasmic serine pepetdase DegS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000336047 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTAACCA AACTGTTACG TTCGGCGCTC TTCGGTGTCA TCGTGGCTGG CATCCTGTTG CTAACCATCC CGACACTGCG CTCCAGCCAG GGCTGGTTCA AACCCAGCAG CGACAGCAGC CAGGAAAACC CGGTCAGTTA CTACCAGGGA GTCCGTCGTG CTGCACCCGC CGTGGTGAAC GTATACAATC AGGCCAACGA TCCGAGAACC CAAAGCGAAA TGAACGTCCG CACGCTGGGT TCTGGCGTGA TAATGAACAA TAAAGGGTAC ATCCTGACCA ACAAGCATGT GATCAGTAAT GCTGAGCGAA TTGTGGTTAC ACTGCAGGAT GGCCGTATGT TCGAAGCGCT GCTGGTAGGC TCAGACAGCC TGACGGATTT GGCGGTACTG AAAATTGAAG GTGCCAATCT GCCGGAAATT CCCATCAATC CCAAACGGGT ATTTCATGTC GGCGACGTCG TGATGGCTAT CGGTAACCCC TATAACCTCG GCCAGACGGT GACGCAAGGG ATTATCAGCG CGACCGGTCG GGTTGGGCTA ACGCCATCCG GTCGACAGAA CTTTCTGCAA ACCGATGCGT CCATTAATCG CGGCAACTCC GGTGGCGCGC TGATTAACAC ACAGGGCGAA CTGGTCGGCA TCAATACCTT GTCATTTGAC AAAAGCGATG ATGGCGGCAC CCCGGAAGGT CTCGGTTTTG CCATACCCAC TGCACTGGCA ACCAAGATCA TGAATAAGTT GATCCGTGAT GGTCGGGTAA TTCGCGGTTA TATCGGCATC CGTGGCGCCC AGCGTGAACA GTTGGGCAAT CAGGTGATTG GTCTGGAACG GTTGCAGGGT ATCGTCGTCA GTAAGATAGA CGACGGTGGT CCGGCAGATA AAGCCGGGAT GAAGGAAGGC GACTTGCTGT TGGAAGTGAA TAACAAACCC GCCCGATCGG TGCTGGAAAC CATGGATCAG GTGGCGGAAA TTCGCCCTGG TTCGGTCATT CCAGTGGTTA TTTTACGGGA TAATCAGGAA ATAAAACTGA ATATGACCAT CCAGGAATAC CCCACCACCG ACTGA
|
Protein sequence | MLTKLLRSAL FGVIVAGILL LTIPTLRSSQ GWFKPSSDSS QENPVSYYQG VRRAAPAVVN VYNQANDPRT QSEMNVRTLG SGVIMNNKGY ILTNKHVISN AERIVVTLQD GRMFEALLVG SDSLTDLAVL KIEGANLPEI PINPKRVFHV GDVVMAIGNP YNLGQTVTQG IISATGRVGL TPSGRQNFLQ TDASINRGNS GGALINTQGE LVGINTLSFD KSDDGGTPEG LGFAIPTALA TKIMNKLIRD GRVIRGYIGI RGAQREQLGN QVIGLERLQG IVVSKIDDGG PADKAGMKEG DLLLEVNNKP ARSVLETMDQ VAEIRPGSVI PVVILRDNQE IKLNMTIQEY PTTD
|
| |