Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PC1_0289 |
Symbol | |
ID | 8131202 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pectobacterium carotovorum subsp. carotovorum PC1 |
Kingdom | Bacteria |
Replicon accession | NC_012917 |
Strand | - |
Start bp | 335148 |
End bp | 336518 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644863569 |
Product | protease Do |
Protein accession | YP_003015884 |
Protein GI | 253686694 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0319386 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA CATCATTATT ATTTAGTGCA CTGGCAATGA GTATAGGTTT GACCCTGTCC ACGCTTCCCG TAGCGAATGC TGCGTTGCCT GCCGTGGTAC AAGGGCAACA GACGCCAAGC CTGGCCCCGA TGCTGGAAAA AGTCTTGCCA GCCGTCGTCA GCGTGCATGT TGAAGGTACA CAGGTACAGC GCCAGCGCGT ACCGGAGGAG TTCAAGTTCT TCTTTGGTCC GAACTTCCCA ACGGACAAAC AAAGCTCTCG TCCGTTTGAA GGATTGGGCT CTGGCGTCAT TATTGATGCC GCAAAAGGTT ACGTGCTCAC CAATAATCAC GTCATCAATA ACGCCGACAA AATTCGCGTC CAGCTCAATG ACGGGCGTGA GTATGAAGCA AAACTGATTG GTCGCGATGA GCAAACCGAT ATCGCCCTGC TACAGCTGAA TGACGCCAAG AATCTGGTCG CGGTGAAAAT GGCTGATTCC GATCAACTCC GCGTCGGTGA CTTCGCCGTT GCCGTGGGTA ACCCGTTCGG CCTTGGTCAG ACCGCGACAT CCGGCATCAT CTCTGCACTG GGGCGTAGCG GGCTGAATCT GGAAGGGTTG GAAAACTTCA TCCAGACCGA TGCGTCTATC AACCGTGGTA ACTCCGGCGG TGCGTTGGTT AACCTCAACG GTGAGCTGAT CGGTATTAAC ACGGCGATTT TGGCGCCGGG CGGCGGGAAC ATCGGTATCG GTTTCGCTAT CCCCAGCAAC ATGGCTCAAA ATCTGGCACA GCAGCTGGTT GAATTTGGCG AGGTCAAACG CGGACTGCTG GGTATTAAAG GCAGCGAAAT GACGTCTGAG ATGGCGAAAG CCTTCAACGT CGATGCACAA CGCGGTGCCT TCGTCAGCGA AGTCTTACCG AAATCTGCCG CCGCGAAAGC GGGTATCAAG GCGGGCGACG TGTTGACGAC GCTGGATGGT AAACCGATCA GCAGCTTTGC GGAACTGCGG GCGAAAGTCG GTACTACCGC GCCGGGCAAG ACCGTGAAAA TCGGCCTGCT GCGTGATGGC AAACCTCAGG AAGTGTCCGT CGTGTTGGAT AACAGCTCAT CAGCGTCAAC CAGCGCCGAA ACACTTTCAC CGTCATTGCA GGGCGCGTCT CTGACCAATG GCCAATTGAA AGACGGCAGC AAGGGCGTAC AGATTGATAA CGTCGCCAAA GACACACCTG CGGCGCAGGT TGGTCTGCAG AAAGGCGATA TCATCATCGG CGTGAACCGT GAGCGTATTG AAAACATCAC GCAGTTGCGC AAGCTGCTGG AAGCGAAACC TTCCGTTCTG GCTTTGAATA TCGTCCGGGG CGAAGAAACG ATTTATCTGC TGTTACGTTA A
|
Protein sequence | MKKTSLLFSA LAMSIGLTLS TLPVANAALP AVVQGQQTPS LAPMLEKVLP AVVSVHVEGT QVQRQRVPEE FKFFFGPNFP TDKQSSRPFE GLGSGVIIDA AKGYVLTNNH VINNADKIRV QLNDGREYEA KLIGRDEQTD IALLQLNDAK NLVAVKMADS DQLRVGDFAV AVGNPFGLGQ TATSGIISAL GRSGLNLEGL ENFIQTDASI NRGNSGGALV NLNGELIGIN TAILAPGGGN IGIGFAIPSN MAQNLAQQLV EFGEVKRGLL GIKGSEMTSE MAKAFNVDAQ RGAFVSEVLP KSAAAKAGIK AGDVLTTLDG KPISSFAELR AKVGTTAPGK TVKIGLLRDG KPQEVSVVLD NSSSASTSAE TLSPSLQGAS LTNGQLKDGS KGVQIDNVAK DTPAAQVGLQ KGDIIIGVNR ERIENITQLR KLLEAKPSVL ALNIVRGEET IYLLLR
|
| |