Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_0943 |
Symbol | |
ID | 5454152 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | + |
Start bp | 1013426 |
End bp | 1014901 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640876514 |
Product | protease Do |
Protein accession | YP_001412223 |
Protein GI | 154251399 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.250261 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGAATG AAAGCAATAT CGGGCGGCAG AGGACGCTGG CGCTGGCAGT GGCGGGCGCA CTCGTATTCG GCGCCTGGTC GTCCGGCGCC TTGGCGCAGG AGCCCCTGGA CACGCGCTCA GCCGCGCCTG CGTCGACGGC CCTGGAAAAT CTCCCGAGCT TCGCCGACCT CGTCGAGAAG GTGAACCCGG CTGTGGTCAG CATTCGTGTC GACGAGGAGG TTGCCGCCCG TAGCTCCGGT GTCCCCGATC TTCCGTTCCC GCCCGGCAGC CCCTTCGAGA AATTCTTCCG TGACATGCAG CCCCAGCAGG GCCCCGACGG CGCGCCGCCG CGCCGGCATG CTACAGCACT CGGCTCCGGA TTTCTGATTT CCGCGGATGG CTTTGTGGTC ACCAACAATC ACGTCGTTGG CGACGGCAAG GACATCACTG TCGTTCGCAG CGACGGCAGC GAGATGAAGG CGAAACTCAT CGGCCGGGAT CCGAAGACGG ATCTCGCGCT TGTGAAAGTC GAAAGCAAGG AACCTCTGCC TTACGTGGTG TTCGGCAATT CCGACAATGT GCGCGTGGGA GACTGGGTGC TCGCGGTCGG CAATCCCTTC GGCCTTGGAG GCACCGTCAC CACCGGCATT GTTTCCGCGC GCGGCCGTGA AATAGGCGCT GGCCCCTATG ACGATTTCAT TCAGATCGAT GCTTCGATCA ACAAGGGAAA TTCAGGCGGT CCGACCTTCG ACGTCCGGGG TAATGTTGTG GGCGTCAACA CGGCCATCTT TTCGCCCACT GGCGGCAGTG TCGGTATCGG CTTCGCCATT CCGTCCTCGA TCGCGCAGAA CGTCATCGCT CAGCTGAAGG AAGACGGAAA GGTCACGCGC GGCTGGCTCG GCGTCACCAT TCAACAGGTT GACGAGGACG TCGCCTCCAC GCTCGCCCTG GACAAGCCCC GTGGCGCGCT CGTCGCACAG GTTGCGGAAG ACAGCCCCGC GAAGAAAGCC GGCATCCAGA CGGGCGACGT CATTCTCAAT GTTGACGGAA AAGAAATGGA AGACGTCCGT GCCGTCAGCC GCACGGTTGC GGATCTGCAG CCAGATACGC GCTCGCAGAT CGTCCTGTGG CGCGATGGCA AGCGGAAAAA CATCTCCGCG CAGATTGGCA CCTTCCCCGA GGAGATCGCG GCCGCAGCGG CTTCGCCCAC TGGGGAGGCG CCGGCCGCCG GCACGACGGA GAGCCTGGGC CTTGCGCTTA CCCGCTCGCC GGAAGGCGTC ATGGTGCAGA GCGTCGACCC CGCCAGCGAT GCTGCCGAAA AGGGCGTCCG TCCCGGCGAC ATTATCGTCA AGGTATCCGG CAAGGACGTG ACGGAGCCCG CCGATGTGGT GGCGCGCGTC GCGGAAGCAG GAAAGGCCGA CAAGAACTCG GTCCTGCTTC TTCTCCGAAC CGACAATCAG CAGCGTTTCG TTGCACTGAC GCTTGAGAAA TCCTGA
|
Protein sequence | MRNESNIGRQ RTLALAVAGA LVFGAWSSGA LAQEPLDTRS AAPASTALEN LPSFADLVEK VNPAVVSIRV DEEVAARSSG VPDLPFPPGS PFEKFFRDMQ PQQGPDGAPP RRHATALGSG FLISADGFVV TNNHVVGDGK DITVVRSDGS EMKAKLIGRD PKTDLALVKV ESKEPLPYVV FGNSDNVRVG DWVLAVGNPF GLGGTVTTGI VSARGREIGA GPYDDFIQID ASINKGNSGG PTFDVRGNVV GVNTAIFSPT GGSVGIGFAI PSSIAQNVIA QLKEDGKVTR GWLGVTIQQV DEDVASTLAL DKPRGALVAQ VAEDSPAKKA GIQTGDVILN VDGKEMEDVR AVSRTVADLQ PDTRSQIVLW RDGKRKNISA QIGTFPEEIA AAAASPTGEA PAAGTTESLG LALTRSPEGV MVQSVDPASD AAEKGVRPGD IIVKVSGKDV TEPADVVARV AEAGKADKNS VLLLLRTDNQ QRFVALTLEK S
|
| |