Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dd1591_1042 |
Symbol | |
ID | 8120315 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dickeya zeae Ech1591 |
Kingdom | Bacteria |
Replicon accession | NC_012912 |
Strand | + |
Start bp | 1200081 |
End bp | 1201529 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 644851438 |
Product | serine endoprotease |
Protein accession | YP_003003393 |
Protein GI | 251788672 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.88791 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACAAA AATCATTGAT GTTGAGCGCG CTGGCGCTGA GTCTGGCGAT AGCGGTGGGG ACATTGCCGG CCACTGCCAG CGCGGCAGAA TCGGTATCGT CGTCCGGCGC TCAGTTGCCA AGCCTGGCGC CCATGCTGGA GAAAGTCATG CCGTCGGTGG TGAATATTTC CGTAGAGGGG CATACCGCCG CCAGTCAGGG GGCCAGCGTG CCGCCGCAAA TGCAGCCCTT CTTCGGCGAC AATTCGCCTT TCTGTCAGGA AGGATCGCCT TTCCGGTCAT CGCCGATGTG CCAGGGCGAA GATGACGACG AGGACGGTGG TGAGGAAGGT GCGCCGCCGC AGGCATTTCA GGCGTTGGGT GCCGGGGTTA TCATCAATGC CGCGAAAGGC TATGTGGTGA CCAACAACCA CGTGGTGGAC AATGCCGATA AAATTCAGAT TCGCCTCAAT GATGGGCGTA AATACGATGC GAAAGTGATT GGTAAAGACC CGCGATCGGA TGTGGCGCTG ATTCAATTGA AGGATTTCCG TAATCTGACC GAGATTAAGA TGGCGGATTC CGATCAACTG CGGGTAGGGG ATTATGCGGT GGCGATCGGC AACCCTTACG GTTTGGGCGA AACCGCCACC TCAGGGATCA TCTCGGCGTT AGGGCGCAGT GGTCTGAATA TTGAAAACTA TGAAGATTTT ATCCAAACCG ATGCCGCTAT CAACCGGGGT AACTCCGGTG GTGCGCTGGT CAACCTGAAT GGTGAACTGA TTGGCCTGAA CACGGCTATT CTGGCGCCGG GCGGCGGTAA TATCGGTATC GGTTTTGCCA TCCCCAGCAA TATTGTGAAG AACCTGATTA ACCAAATCGT CGAATACGGC GAAGTGAAAC GCGGCGAGTT GGGGATCATG GGGACGGAGC TGAATTCCGA TATTGCCAAG GCGATGAAAG TCGATGCCCA GCGTGGGGCG TTCGTCAGCC AGGTACAGCC GAATTCCGCC GCCGCCAAAG CCGGTATCAA GGCAGGAGAT GTGGTGGTAT CGATGAACGG TAAAGCCATC GGCAGTTTCT CGGCGTTGCG TGCTCAGATT GGTTCGTTGC CGGTCGGCAG CAAACTGACG CTGGGGCTGA TTCGCGACGG GAAACCCGCG ACGGTAGACG TCACATTGCA GCAGAGTGCG CAGTCGCAGG TGGAATCCGG TAACCTGAAT TCGGCGATTG AAGGCGCTGA GTTGAGTAAC ACCCAGGTAG ACGGCCAGAA GGGCGTTAAA GTGGATAAGG TGAAACCGGA TTCTGCGGCG GCGAAAATCG GCCTTAAGCC TGATGATGTC ATCCTGGGGG TGAACCAACA GCCAGTGGAA AATATTGGCG AGCTACGCAA GATTATCGAC AGCAAGCCGC CGGTACTGGC GCTGAGTATT CGACGCGGCA ACAGTGATCT TTATCTGCTG ATTCAATAA
|
Protein sequence | MKQKSLMLSA LALSLAIAVG TLPATASAAE SVSSSGAQLP SLAPMLEKVM PSVVNISVEG HTAASQGASV PPQMQPFFGD NSPFCQEGSP FRSSPMCQGE DDDEDGGEEG APPQAFQALG AGVIINAAKG YVVTNNHVVD NADKIQIRLN DGRKYDAKVI GKDPRSDVAL IQLKDFRNLT EIKMADSDQL RVGDYAVAIG NPYGLGETAT SGIISALGRS GLNIENYEDF IQTDAAINRG NSGGALVNLN GELIGLNTAI LAPGGGNIGI GFAIPSNIVK NLINQIVEYG EVKRGELGIM GTELNSDIAK AMKVDAQRGA FVSQVQPNSA AAKAGIKAGD VVVSMNGKAI GSFSALRAQI GSLPVGSKLT LGLIRDGKPA TVDVTLQQSA QSQVESGNLN SAIEGAELSN TQVDGQKGVK VDKVKPDSAA AKIGLKPDDV ILGVNQQPVE NIGELRKIID SKPPVLALSI RRGNSDLYLL IQ
|
| |