Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpAngola_A0987 |
Symbol | degP |
ID | 5799450 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pestis Angola |
Kingdom | Bacteria |
Replicon accession | NC_010159 |
Strand | - |
Start bp | 1009817 |
End bp | 1011262 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641338976 |
Product | serine endoprotease |
Protein accession | YP_001605548 |
Protein GI | 162418209 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0018316 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAA CAACTTTAGT ATTAAGTGCA TTGGCATTGA GCATTGGTTT CGCCATGGGC CCGGTTTCTT CCGTCGTTGC GGCAGAGACG GCAGCATCGA GTAGCCAGCA GCTCCCTAGC CTGGCGCCAA TGCTAGAGAA AGTAATGCCT TCAGTGGTCA GTATCAACGT TGAAGGTAGT GCGCCTGTAA GCAGTGCTGG TGCACGCGGT ATGCCACCAC AATTCCAGCA GTTTTTTGGT GATAACTCGC CATTCTGTCA GGACGGTTCA CCGTTCCAAG GCTCGCCAAT GTGTCAAGGG GATCTGGGCG GACTAGGGCA GGGAATGCCA AGTAAGCGGG AATTCCGTTC GCTTGGTTCA GGTGTCATTA TTGATGCGGG CAAGGGGTAT GTCGTTACCA ATAACCACGT GGTCGATAAT GCGAACAAGA TCAGCGTAAA ACTGAGCGAT GGCCGCAGTT TTGATGCCAA GGTGATCGGT AAAGATCCAC GTACCGATAT CGCACTGTTA CAACTGAAAG ACGCTAAAAA TCTGACTGCG ATTAAGATTG CCAATTCGGA TCAACTGCGT GTCGGTGATT ATACCGTCGC TATCGGGAAC CCGTATGGCT TGGGTGAAAC CGTGACATCC GGTATTGTCT CTGCTTTAGG GCGCAGTGGT TTGAATGTAG AAAACTATGA AAACTTTATC CAGACTGATG CGGCGATTAA CCGTGGTAAT TCCGGCGGCG CATTAATCAA CCTGAACGGT GAGTTGATTG GTATTAACAC CGCTATTCTG GCACCGGATG GCGGTAACAT TGGTATTGGC TTTGCTATCC CAAGCAACAT GGTGAAGAAC CTGACATCAC AGATGGTTGA GTTTGGTCAG GTAAAACGCG GTGAACTGGG CATTATGGGG ACCGAGCTAA ACTCTGAACT GGCAAAAGCC ATGAAGGTTG ATGCGCAGAA AGGTGCCTTT ATCAGCCAGG TCGTGCCTAA ATCTGCTGCG GCAAAAGCGG GTATCAAAGC GGGCGATATC ATTGTCAGTA TGAATGGGAA AGCCATCAAT AGTTTTGCAG GGTTCCGCGC CGAGATCGGC ACGTTACCTG TTGGCAGCAA AATGACCTTG GGTCTGCTGC GTGATGGCAA ACCGATCAAT GTGGATGTCG TCCTGGAGCA GAGCAGCCAC AGTCAGGTGG AATCCGGCAA TCTCTACACC GGTATTGAGG GGGCTGAACT GAGTAACAGC GACGTTAGCG GCAAGAAAGG GGTGAAAGTT GATAGCGTAA AACCAGGCAC TGCTGCGGCG CGTATCGGCC TGAAAAAAGG TGATATCATC ATGGGGATTA ACCAGCAACC AGTCCAGAAC CTAGGTGAGC TGCGGAAAAT CCTCGATGCT AAACCACCGG TATTGGCGTT GAATATTCAA CGTGGTGATA CTTCACTCTA TTTATTGATG CAGTAA
|
Protein sequence | MKKTTLVLSA LALSIGFAMG PVSSVVAAET AASSSQQLPS LAPMLEKVMP SVVSINVEGS APVSSAGARG MPPQFQQFFG DNSPFCQDGS PFQGSPMCQG DLGGLGQGMP SKREFRSLGS GVIIDAGKGY VVTNNHVVDN ANKISVKLSD GRSFDAKVIG KDPRTDIALL QLKDAKNLTA IKIANSDQLR VGDYTVAIGN PYGLGETVTS GIVSALGRSG LNVENYENFI QTDAAINRGN SGGALINLNG ELIGINTAIL APDGGNIGIG FAIPSNMVKN LTSQMVEFGQ VKRGELGIMG TELNSELAKA MKVDAQKGAF ISQVVPKSAA AKAGIKAGDI IVSMNGKAIN SFAGFRAEIG TLPVGSKMTL GLLRDGKPIN VDVVLEQSSH SQVESGNLYT GIEGAELSNS DVSGKKGVKV DSVKPGTAAA RIGLKKGDII MGINQQPVQN LGELRKILDA KPPVLALNIQ RGDTSLYLLM Q
|
| |