Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dd703_3642 |
Symbol | |
ID | 8088951 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dickeya dadantii Ech703 |
Kingdom | Bacteria |
Replicon accession | NC_012880 |
Strand | + |
Start bp | 4229175 |
End bp | 4230545 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 644837718 |
Product | protease Do |
Protein accession | YP_002989221 |
Protein GI | 242241040 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.393796 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAG CATCGTTGTT GTATAGCGCG CTGGCACTCA GCATAGGTCT ATCCTTGTCC TCGCTTCCTA CGGCTAACGC CGCGCTGCCT TCGGTTGTGT CGGGTCAACC GTTGCCCAGC CTGGCACCAA TGCTGGAAAA AGTCCTTCCG GCCGTCGTGA GCGTCCATGT AGAAGGCACA CAAATTCAGC GCCAACGCAT CCCCGAAGAA TTCAAATTTT TCTTCGGCCC AAATACCCCC TCGGAAAAGC AGAGCAGCCG CCCATTTGAA GGACTGGGCT CCGGCGTCAT CATCAGTGCG GAAAAAGCCT ATGTGCTGAC CAACAACCAC GTCATCAACA ACGCGGATAA AATTCGTGTT CAGTTGAACG ACGGTCGGGA ATACGACGCA AAACTGATCG GCCGCGACGA ACAGACCGAT ATCGCCCTTT TGCAATTGGT GGATGCTAAA AACCTCACTG AGATCAAAAT GGCTGACTCT GATCAGTTGC GCGTAGGTGA CTTTGCTGTT GCCGTGGGCA ACCCATTCGG TCTGGGCCAG ACCGCGACCT CGGGAATTAT TTCCGCGCTG GGCCGCAGCG GCCTGAATCT GGAAGGGTTG GAAAACTTTA TTCAGACTGA TGCTTCCATC AACCGGGGCA ATTCCGGCGG AGCATTGGTT AACCTGAAAG GCGAGCTGAT CGGCATCAAC ACCGCTATTC TCGCACCGGG CGGCGGCAAC ATCGGCATCG GTTTCGCTAT CCCCAGCAAC ATGGCGCAAA ACCTGGCCCA GCAGTTGGTA GAATTTGGCG AAGTCAAACG CGGTCTGTTG GGCATCAAGG GCAGCGAGAT GACGTCTGAA ATCGCCAAAG CGTTTAAGGT TGAGGCGCAA CGCGGCGCAT TCGTCAGCGA GGTCATTCCG AAATCTGCGG CCGCCAAGGC CGGTATCAAA GCAGGGGATG TGCTGATTTC GCTGGATGGC AAGCCGATTA ACAGTTTTGC CGAACTCCGA GCGAAAATCG GCACTACTGC GCCAGGCAAG ACAGTTCGTG TCGGCCTGTT ACGTGACGGT AAACAGCAAG AAGTCTCCGT CGTGCTGGAC AACAGCGCCA ATGCAACCAC CAACGCCGAC AATCTTTCGC CTGCGCTGCA AGGCGCTTCA CTCACCAACG GTCAGTTGAA AGACGGCAGC AAAGGCGTAC TGATTGAGAA TGTCGCCAAG GACAGTGCGG CGGCCAAGGT CGGCTTACAG AAAGGCGATA TTATCGTGGG CGTCAATCGC GAGCGTGTTG AAAGCATCTC GCAATTGCGC AAGATCCTTG ACAGCAAACC CTCCGTGCTG GCGCTGAATA TCGTCCGCGG TGAAGAAAGT ATCTATCTGT TGTTACGTTG A
|
Protein sequence | MKKASLLYSA LALSIGLSLS SLPTANAALP SVVSGQPLPS LAPMLEKVLP AVVSVHVEGT QIQRQRIPEE FKFFFGPNTP SEKQSSRPFE GLGSGVIISA EKAYVLTNNH VINNADKIRV QLNDGREYDA KLIGRDEQTD IALLQLVDAK NLTEIKMADS DQLRVGDFAV AVGNPFGLGQ TATSGIISAL GRSGLNLEGL ENFIQTDASI NRGNSGGALV NLKGELIGIN TAILAPGGGN IGIGFAIPSN MAQNLAQQLV EFGEVKRGLL GIKGSEMTSE IAKAFKVEAQ RGAFVSEVIP KSAAAKAGIK AGDVLISLDG KPINSFAELR AKIGTTAPGK TVRVGLLRDG KQQEVSVVLD NSANATTNAD NLSPALQGAS LTNGQLKDGS KGVLIENVAK DSAAAKVGLQ KGDIIVGVNR ERVESISQLR KILDSKPSVL ALNIVRGEES IYLLLR
|
| |