Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_5252 |
Symbol | |
ID | 7972685 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | + |
Start bp | 5573102 |
End bp | 5574172 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644795846 |
Product | peptidase M4 thermolysin |
Protein accession | YP_002947120 |
Protein GI | 239818210 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3227] Zinc metalloprotease (elastase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0068319 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCTTC GCTCGCCGAG CCTTTTACCG CCCAGCTTCG TACCGCCGTA CCTGCTGGAC CGGCTGGCGC AACATGCCGG TGCGCATGCC AGCGCGAAGG CGGCGCAGAC GCTGATGATC GACCTGCAGC ACCGCGGCCT GCGCGAGGCG GTGGCCGGGC AGGGCGTGTC GTCGGGCCCC GCGCCCAGCT ATGTGCGGCG CGGCTCGCCC GCGCGCGCCA TCCACGACGC AGAGCACACC ATGGTCCTGC CGGGGCGGCT GGTGCGTGCC GAAGGACAGG CCGCCACCGG CGACATCGCC GCCGACGAGG CCTACGACTA CCTGGGCGCC ACCTACCGCC TGTACCACGA CATCTTCGAG CGCGATTCCA TCGACGGCGC GGGCATGCCG CTCACGGGCA GCGTGCACTA CGGCAACGAC TACGACAACG CCTTCTGGAA CGGCCAGCAG ATGGTGTTCG GCGACGGCGA CGGCGAGGTC ATGAACCGCT TCACCATCGC GGTGGACATC ATCGGCCACG AGCTCACGCA CGGCGTGATC GACCACGAGT CGGGCCTGGT CTACCAGGGG CAGCCGGGCG CGCTCAACGA GTCGATCTGC GACGTGTTCG GCGCGCTGGT CAAGCAGCAC CTGCTCAAGC AGACCGCGCA GCAGGCCGAC TGGCTGGTCG GCGCGGGGCT CTTCACCGGC AAGGTCAAGG CGCGCGCACT GCGCTCGATG GCCGAGCCCG GCACGGCCTA CGACGACCCG GTGCTCGGCA AGGACCCGCA GCCTGCCCAC ATGAAGGACT TTGTCGACAC GCGCCAGGAC AATGGCGGCG TGCACATCAA TTCCGGCATT CCGAACCGCG CCTTCCATCT CGCTGCCACG GCCATCCAGG GCCCGGCATG GGAGACGGCC GGGCGCGTCT GGTACGACAC GGTGTGCGAC CGGCGGCTGC GCCAGGACGC CGATTTCCTG GCCTTCGCGC AGCTCAGCGT GGAAAATGCA GCCAGGCGCT TCGGCGCCGG CAGCGCGGCG CACCAGGCCG TCGGCGCTAC ATGGAACACC GTGGGAGTCA CACCATCATG A
|
Protein sequence | MPLRSPSLLP PSFVPPYLLD RLAQHAGAHA SAKAAQTLMI DLQHRGLREA VAGQGVSSGP APSYVRRGSP ARAIHDAEHT MVLPGRLVRA EGQAATGDIA ADEAYDYLGA TYRLYHDIFE RDSIDGAGMP LTGSVHYGND YDNAFWNGQQ MVFGDGDGEV MNRFTIAVDI IGHELTHGVI DHESGLVYQG QPGALNESIC DVFGALVKQH LLKQTAQQAD WLVGAGLFTG KVKARALRSM AEPGTAYDDP VLGKDPQPAH MKDFVDTRQD NGGVHINSGI PNRAFHLAAT AIQGPAWETA GRVWYDTVCD RRLRQDADFL AFAQLSVENA ARRFGAGSAA HQAVGATWNT VGVTPS
|
| |