Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_2762 |
Symbol | |
ID | 5453665 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | + |
Start bp | 2961039 |
End bp | 2962496 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640878343 |
Product | protease Do |
Protein accession | YP_001414027 |
Protein GI | 154253203 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.280827 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGTGG AAATGTTGTC GGAGTCGTCT TACGCGCAAG CGGTCGTCAT GGCGCTTGCC GGAATCGTCC TTGTCGCCGT GCTCGCCTTC TTTCCCTGGG CGGCGCGGGC GCAGGACAAG GAAGTGCCTT CAAGCCGCGC GCAGGTGCAG CTCTCTTATG CACCCGTCGT GAAGCGCGTC GCGCCCGCCG TGGTCAATGT CTATACGAAG CGTGTCGTGA AGGAGCAGGC GCGTTCCCCC TTCCTCAACG ATCCCTTCTT CCAGCAATTC TTCGGCAACC GCTTCAGCTT CGGCATGCCG CGCGAGCGCG TGCAGCAGTC GCTCGGTTCC GGCGTCATCG TCGACCCGTC GGGCCTCATC GTCACCAACT ACCATGTCAT CAAGGACGGC CAGCAATTCA CCGTTGCGCT GTCGGACCGG CGCGAATTCG AGGCCGAACT CGTTCTCTCC GACGAGCGCA CCGACCTTGC CGTGCTGCGG ATCGACAATG GCAAGCAGAA GCTGCCGCAT CTGACCTTCA AAAATTCGGA TTCCGTGGAA GTCGGCGATC TCGTGCTCGC GATCGGCAAT CCCTTCGGCG TCGGCCAGAC GGTCACGTCG GGCATCGTCT CGGCCTTGGC GCGCACGCAT GTCGGCGTCT CCGATTACCA GTTCTTCATT CAGACGGATG CCGCCATCAA TCCGGGTAAT TCCGGCGGCG CGCTCGTCAC GATGGATGGG CGCCTCATCG GCATCAACAG CGCCATCTAT TCGCAGACAG GCGGCAGCAT CGGCATCGGC TTCGCGATCC CCTCCAACAT GGTCGAGAGC GTTGTCGCCT CCGCGCGCGA TGGCGATCAT GTCCGGCGTC CATGGTTCGG CGCATCGCTG CAGCCGGTCG ATGCCGAGCT CGCGCAGTCG CTCGGGCTCG ATCGTCCGGG CGGCAGTCTC ATCCGCGACA TCTATCCCGG CGGTCCGGCG GACAAGGCGG GGCTGAAAGT CGGCGACGTT ATCCGCGCCA TTGATGGCTT CGATGTCGAA GAGCCTCAGG CGGTGCGCTA CCGTTTCGCT ACCAAGGGGC TCGGCGGTAA GGTGAATGTC GGCTATAGCC GGGACGGCGC GCGGCGCGAA ACGCAGGTGG CGCTCATCGC GCCGCCGGAA GACCCGCCTG CCGATGAAAC GAAAATCGCC GGCCGCAATC CCTTCGCCGG CGCCACCGCC GCCAATCTCT CGCCCGCGAT AGCCGACAAG CTCGGTCTCG ACATTGTGGG AGAGCAGGGC GTCGTCATCG TGAATGTCGA GCCCGGCTCC GCCGCCAATC AGGTCCAGTT CCAGCGCGGC GATATCATCG TCGAAGTCGC GGCCACGAAA ATCGCCACGG TGCGCGACCT CGTCCGCATC ACCGCCGCCA CACGTCCGCA ATGGGACTTC GCCATCAAAC GCGGCGACCG CGTCTTCTCC GCGACGGTGG GCGGCTAA
|
Protein sequence | MVVEMLSESS YAQAVVMALA GIVLVAVLAF FPWAARAQDK EVPSSRAQVQ LSYAPVVKRV APAVVNVYTK RVVKEQARSP FLNDPFFQQF FGNRFSFGMP RERVQQSLGS GVIVDPSGLI VTNYHVIKDG QQFTVALSDR REFEAELVLS DERTDLAVLR IDNGKQKLPH LTFKNSDSVE VGDLVLAIGN PFGVGQTVTS GIVSALARTH VGVSDYQFFI QTDAAINPGN SGGALVTMDG RLIGINSAIY SQTGGSIGIG FAIPSNMVES VVASARDGDH VRRPWFGASL QPVDAELAQS LGLDRPGGSL IRDIYPGGPA DKAGLKVGDV IRAIDGFDVE EPQAVRYRFA TKGLGGKVNV GYSRDGARRE TQVALIAPPE DPPADETKIA GRNPFAGATA ANLSPAIADK LGLDIVGEQG VVIVNVEPGS AANQVQFQRG DIIVEVAATK IATVRDLVRI TAATRPQWDF AIKRGDRVFS ATVGG
|
| |