Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_4645 |
Symbol | |
ID | 7972855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | + |
Start bp | 4935035 |
End bp | 4936891 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644795229 |
Product | peptidase M61 domain protein |
Protein accession | YP_002946516 |
Protein GI | 239817606 |
COG category | [R] General function prediction only |
COG ID | [COG3975] Predicted protease with the C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.348977 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGCGC CGACCGTTTC CCGGCGCGCG CGAGCGGCTG CGCCGGCCGC CGCGGTGCAC TACCGCATCG AGTGCGCCGA CCTGCATGCG CACCTCTTCG CCGTCACCCT GACCATCGAG GCCCCCGCCG CGCAGCAGCG CGTGGCGCTG CCGGTGTGGA TTCCGGGCAG CTACCTGGTG CGCGAGTTCG CCAAGAACCT GCAGGGCCTG CGCGCCACGC AGGGACGCCG CAAGCCCGCG CTGACGCAGC TCGACAAGTG CAGCTGGCAG GTCGACTGCG TGCCGGGCCA GCCGCTGGTG CTGCACTACC AGGTCTGCGC CTACGACAAC TCGGTGCGCA CCGCCTGGCT CGACGCCGGC CGCGGCTTCT TCAACGGCAC CAGCCTGTGC CTGCGGGTCG AGGGCCAGAC CGATGCGCCG CATGCGCTGG ACATCAGCGC GCCCGCCCTG CCGCCCGGCG ACGATGCGCG CTGGTCGTGC GCCACCGCGC TCGTGCCCGA GAAGATCGAC AAGCTCGGCT TCGGCCGCTA CCTGGCCGCT GGCTACGACG AGCTGGCCGA CAGCCCGGTC GAGATGGGCC CCTTCTGGAG CGCCGAGTTC GAGGCCTGCG GCGTGCCGCA CCGCTTCGTC ATCGCAGGTG CCGCTGCCTC GTTCGACGGC GAGCGCCTGA TCGCCGACAC GCAGGCCATC TGCGAAGCCG AGATCCGCTT CTGGCATGGC GACAAGGCCG GCAAGCGAGG CGGCCCCAAG CTGCCGATCG ACCGCTACGT GTTCATGCTC AACGCGGTGG ACGACGGCTA CGGCGGCCTG GAGCACCGGC ACTCCACCGC GCTGATCTGC AACCGGCGCG ACCTGCCCCA GCGCGGCGCG AAGAAGCAGC CCGAGGGCTA CACCACGCTG ATGGGCCTCA TCAGCCACGA GTACTTCCAC ACCTGGAACG TCAAGCGCAT GCGCCCGGCC GAGTTCGCGC ACTACGACTA CAGCCGCGAG AACTACACGC AGATGCTGTG GTTCTTCGAG GGCTTCACCA GCTACTACGA CGACCTGCTG CTGCGCCGTG CCGGCCGCAT CGACGACGCT GGCTACCTGC GCCTGCTCAA CAAGACCGTG AACCAGGTGC TGCAGACACC GGGCCGGCTC GTGCAGTCGG TGGCCGAGGC GAGCTTCGAC GCCTGGGTCA AGTACTACCG GCAGGACGAG CAAACGCCCA ACGGCACCGT CAGCTACTAC ACCAAGGGCG CGCTGGTGGC GCTGTGCTTC GACCTCAGCT TGCGCCGCGA AGGCAAGGGC ACGCTCGACG ACGTGATGCG CCACCTCTGG ACCGAGGGCG GCGGCGGCCC GATCAGCGAA GCCGACGTGG CCGCGGCGCT CCAGGCCGTG GGCGGCCGCT CCTATGCGGC GGAAATCGCG CAGTGGATCC ACTCGACCGA CGAGCTGCCG CTGGCAGATT TGCTGCGCGC GCATGGCGTG GCTGCGCTCG AAGACCCGTC GCAGCAGGCG CAGGCGCTGG GCCTGCGCGT GGCCGAGGCC AACGGCAGCG TCCAGGTCAA GGTGGTGCTG CGCGGCGGCG CGGCCGAAAA GGCCGGCTTC TCGGCCCACG ACGAATGGAT CGGCGTCGAG CTGCCCGCTG TGGGCCGCAA GGGCCAGCAG CGCCCCGCGC AGGCCTGGCG CATCGCCAAG CTCGACGACC TGTCGCTGTA CCTTGGCGAC GCCACCCGCT GCACCGCGCT GGTGGCGCGC GACCGCAAGC TGCTGAGGCT GCCGCTGGTG CTGCCGGAGG GCGCCACCAC CTGGCGCCTG TTCTCGCACG ACGCCGCAAA GATCGCGGCC TGGCTCGCGC CGGGCGCGCG GCGCTGA
|
Protein sequence | MAAPTVSRRA RAAAPAAAVH YRIECADLHA HLFAVTLTIE APAAQQRVAL PVWIPGSYLV REFAKNLQGL RATQGRRKPA LTQLDKCSWQ VDCVPGQPLV LHYQVCAYDN SVRTAWLDAG RGFFNGTSLC LRVEGQTDAP HALDISAPAL PPGDDARWSC ATALVPEKID KLGFGRYLAA GYDELADSPV EMGPFWSAEF EACGVPHRFV IAGAAASFDG ERLIADTQAI CEAEIRFWHG DKAGKRGGPK LPIDRYVFML NAVDDGYGGL EHRHSTALIC NRRDLPQRGA KKQPEGYTTL MGLISHEYFH TWNVKRMRPA EFAHYDYSRE NYTQMLWFFE GFTSYYDDLL LRRAGRIDDA GYLRLLNKTV NQVLQTPGRL VQSVAEASFD AWVKYYRQDE QTPNGTVSYY TKGALVALCF DLSLRREGKG TLDDVMRHLW TEGGGGPISE ADVAAALQAV GGRSYAAEIA QWIHSTDELP LADLLRAHGV AALEDPSQQA QALGLRVAEA NGSVQVKVVL RGGAAEKAGF SAHDEWIGVE LPAVGRKGQQ RPAQAWRIAK LDDLSLYLGD ATRCTALVAR DRKLLRLPLV LPEGATTWRL FSHDAAKIAA WLAPGARR
|
| |