Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_2386 |
Symbol | |
ID | 7970429 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | + |
Start bp | 2550065 |
End bp | 2551540 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644792968 |
Product | protease Do |
Protein accession | YP_002944279 |
Protein GI | 239815369 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.444921 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACACCC GACTGACTTC CCCCCATGCC CTGGTGCTTG CCTTGGCGAC CGCCGGGGTC ATCGGTGCCG TCGGTGCCGG CGCCTACACC AGTGCCCGCG CCGTGAATGC GCCGACCACC AACGCCACGA GCGTGGCGCC CGCGGCCATG GTGACGCTGC CCGACTTCTC CACCATCACT TCGCGCGACG GTCCCGCGGT GGTCAACATC AGCGTGACGG GCACCGCCAA GGCGTCCGAG GACCAGGCCG CGGCCGAGAT CCAGGGCATC GATCCGGACG ATCCGATGTT CCAGTTCTTC CGCCGCTTCC AGGGCCAGAT CGGCCCGCGC GCCCAGCAGC GCGAAGTGCC GGTGCGCGCG CAGGGCTCGG GCTTCATCGT GAGCCCCGAC GGCATCATCA TGACCAACGC GCACGTCGTG AAAGACGCCA AGGAGGTCAC CGTCAAGTTG ACCGACCGGC GCGAATACCG CGCGAAGGTG CTCGGCGCCG ACGCCAAGAC CGACATCGCG GTGCTCAAGA TCGATGCCAG GAACCTGCCC ACGCTCGCGC TCGGCAACAC CAAGGACCTG AAGGTCGGCG AATGGGTGCT GGCCATCGGC TCGCCCTTCG GCTTCGAGAG CACGGTGACG GCCGGCGTCG TGAGCGCCAA GGGCCGCTCG CTGCCCGATG ACAGCTACGT GCCGTTCATC CAGACCGACG TCGCGGTGAA CCCCGGCAAC TCCGGCGGCC CGCTGCTGAA CACGCGCGGC GAAGTGGTCG GCATCAATTC GCAGATCTAC AGCCGCAGCG GCGGCTACCA GGGCGTGTCG TTCGCCATTC CGATCGACGT GGCGGTGCAG GTGAAGGACC AGATCGTCGC GACCGGCAAG GCCACGCATG CGCGCCTTGG CGTGGCGGTG CAGGAAGTGA ACCAGGCCTT TGCCGACTCC TTCAAGCTGG ACAAGCCCGA GGGCGCGCTG GTCTCCAACA TCGAGAAGGG CGGCCCCGGC GACAAGGCCG GCCTGAAGGC GGGCGACGTG ATCCGCAAGG TGGACGGCCA GCCCATCGTC TCGTCGGGCG ACCTGCCTGC GGTCATCGGG CAGCAGACGC CGGGCAAGAA GGTCACGCTC GAAGTCTGGC GCCAGGGCGA GCGGCAGGAG CTTTCGGCCA AGCTCGGCGA TGCGAGCGAC AAGCCCGCGC AGGTCGCCAA GAACGAGAGC GCGGCGGGGC AGGGCAAGCT CGGCCTTGCG TTGCGGCCGC TGCAGCCGCA GGAAAAGCGC GAAGCCGCCA TCGAGAACGG GCTGCTCGTC GAGGATGTGG CGGGCCCGTC CGCCATGGCC GGCGTGCAGG CGGGCGATGT GCTGCTGGCC ATCAACGGCA CGCCCGCCAG GAGCCTGGAG CAGGTGCGCG AGGTGGTGGC CAAGGCCGAC AAGTCGGTCG CGCTCTTGAT CCAGCGCGGC GAAGACAAGA TCTTCGTGCC GGTGCGGATC GGGTGA
|
Protein sequence | MNTRLTSPHA LVLALATAGV IGAVGAGAYT SARAVNAPTT NATSVAPAAM VTLPDFSTIT SRDGPAVVNI SVTGTAKASE DQAAAEIQGI DPDDPMFQFF RRFQGQIGPR AQQREVPVRA QGSGFIVSPD GIIMTNAHVV KDAKEVTVKL TDRREYRAKV LGADAKTDIA VLKIDARNLP TLALGNTKDL KVGEWVLAIG SPFGFESTVT AGVVSAKGRS LPDDSYVPFI QTDVAVNPGN SGGPLLNTRG EVVGINSQIY SRSGGYQGVS FAIPIDVAVQ VKDQIVATGK ATHARLGVAV QEVNQAFADS FKLDKPEGAL VSNIEKGGPG DKAGLKAGDV IRKVDGQPIV SSGDLPAVIG QQTPGKKVTL EVWRQGERQE LSAKLGDASD KPAQVAKNES AAGQGKLGLA LRPLQPQEKR EAAIENGLLV EDVAGPSAMA GVQAGDVLLA INGTPARSLE QVREVVAKAD KSVALLIQRG EDKIFVPVRI G
|
| |