Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dd1591_3800 |
Symbol | |
ID | 8117233 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dickeya zeae Ech1591 |
Kingdom | Bacteria |
Replicon accession | NC_012912 |
Strand | + |
Start bp | 4301855 |
End bp | 4303225 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644854167 |
Product | protease Do |
Protein accession | YP_003006079 |
Protein GI | 251791358 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00053297 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA CATCGTTGTT ATATAGCGCA CTGGCACTCG GTATTGGACT GTCGTTATCT TCACTGCCTT CAGCCAACGC CGCGTTGCCT GCCGTCGTAG AAGGTCAGGC ACTGCCCAGC CTGGCGCCGA TGCTGGAAAA AGTACTGCCC GCCGTCGTGA GCGTGCATGT GGAAGGTACG CAGGTGCAGC GTCAGCGCAT CCCGGAAGAG TTTAAATTCT TCTTCGGCCC TAACACTCCA TCCGAAAAAC AGAATACTCG CCCGTTTGAA GGACTGGGTT CCGGCGTCAT CATCAACGCC GAAAAAGGCT ATGTGTTGAC CAATAACCAC GTCGTCAACA ACGCAGATAA AATCCAGGTG CAGTTGAATG ACGGTCGTGA ATATGATGCC AAGTTGATCG GCCGCGATGA GCAGACCGAT ATCGCCCTGT TGCAGTTAAG CGACGCCAAG AACCTCACCG AAGTCAAACT GGCTGATTCC GATCAGTTGC GCGTCGGTGA CTTCGCGGTT GCCGTCGGCA ACCCATTCGG ACTCGGCCAA ACGGCAACGT CCGGCATCAT TTCTGCGTTG GGTCGTAGCG GGTTGAATCT GGAAGGATTG GAAAACTTCA TCCAGACCGA TGCTTCCATC AACCGCGGTA ACTCCGGCGG AGCGTTGGTT AACCTGCGAG GCGAATTGAT CGGCATTAAT ACCGCAATTC TGGCGCCAAG CGGCGGCAAC GTGGGCATCG GCTTTGCCAT TCCAAGCAAC ATGGCGCAGA ATTTGGCGCA ACAGTTAGTG GAATTCGGTG AAGTGAAACG CGGCCTGCTG GGTATCAAGG GCAGCGAGAT GACCTCTGAA ATCGCAAAAG CCTTCAAGGT TGAGGCCCAG CGCGGCGCTT TCGTCAGCGA GGTGATTCCG AAATCAGCGG CGGCCAAAGC AGGCATTAAA GCCGGCGATG TGTTGGTTTC TCTGGATGGC AAGCCCATTA ATAGCTTTGC CGAGTTACGC GCCAAAGTCG GCACCACCGC ACCGGGTAAA ACCGTGCGCA TCGGCCTGCT ACGTGACGGG AAACAACAAG AAGTCGCAGT GGTACTGGAC AATAGCGCCA ACGTCACGAC AAACGCAGAC ACCTTGTCCC CAGCCTTGCA GGGCGCCACC CTTAACAACG GTCAGTTGAA AGACGGCAGT AAAGGCGTGG TGATCGACAA TGTGGGCAAA GATAGTGCAG CCGCTAAAGT CGGCTTGCAA AAAGGCGACA TTATCGTCGG CGTGAACCGT GAACGCGTAG AAAACATCAC TCAGTTGCGC AAAATTCTGG AAGCCAAACC CTCGGTCTTG GCCCTGAACA TTGTTCGCGG CGATGAGAGT ATCTATCTTC TGCTGCGTTA A
|
Protein sequence | MKKTSLLYSA LALGIGLSLS SLPSANAALP AVVEGQALPS LAPMLEKVLP AVVSVHVEGT QVQRQRIPEE FKFFFGPNTP SEKQNTRPFE GLGSGVIINA EKGYVLTNNH VVNNADKIQV QLNDGREYDA KLIGRDEQTD IALLQLSDAK NLTEVKLADS DQLRVGDFAV AVGNPFGLGQ TATSGIISAL GRSGLNLEGL ENFIQTDASI NRGNSGGALV NLRGELIGIN TAILAPSGGN VGIGFAIPSN MAQNLAQQLV EFGEVKRGLL GIKGSEMTSE IAKAFKVEAQ RGAFVSEVIP KSAAAKAGIK AGDVLVSLDG KPINSFAELR AKVGTTAPGK TVRIGLLRDG KQQEVAVVLD NSANVTTNAD TLSPALQGAT LNNGQLKDGS KGVVIDNVGK DSAAAKVGLQ KGDIIVGVNR ERVENITQLR KILEAKPSVL ALNIVRGDES IYLLLR
|
| |