Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_1731 |
Symbol | |
ID | 7974473 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | + |
Start bp | 1861786 |
End bp | 1863141 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644792330 |
Product | glucarate dehydratase |
Protein accession | YP_002943647 |
Protein GI | 239814737 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | [TIGR03247] glucarate dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00228751 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACTTC CAGACTCCGC TTCTCCCATC TCCGGCGCAC CCGTGGTCAC CGCCATGCGC GTGGTGCCCG TCGCGGGCCA CGACAGCATG CTCATGAACC TGAGCGGCGC GCACGGCCCG TTCTTCACGC GCAACCTGCT GATCCTGACC GACAGCGCGG GCCACACCGG TGTCGGCGAA GTGCCGGGCG GCGAGAAGAT CCGCCAGACG CTCGAAGATG CGCGCGGCCT CATCGTGGGC CAGCCCCTGG GCAAACACCA CGCAGTGCTC AACAGCATGC GCAGTGCCTT CGCCAACCGC GACAGCGGCG GCCGCGGCCA GCAGACCTTC GACCTGCGCG TGACCATCCA TGCGGTCACC GCCGTCGAGG CGGCCTTGCT CGACCTGCTG GGCCAGCACC TCGAAGTGCC GGTGGCCGCA CTGCTCGGCG AAGGCCAGCA GCGCGATGCC GTGCAGATGC TCGGCTACCT GTTCTACGTG GGCGACCGCA CCAGGACCGA CCTGCCCTAC GAGAGCGACC CCGGCGGCGC CGACGACTGG TTTCGCCTGC GCCACGAAGA AGCCATGACG CCCGAAGCCA TCGTGCGCCT GGCCGAGGCC ACGCACGCGC GCTACGGCTT CACCGACTTC AAGCTCAAGG GCGGCGTGCT GCGCGGCGAG GAAGAAGTCG AGGCCATCCG CGCGCTGCAC GAGCGCTTTC CCAAGGCGCG CGTCACGCTC GACCCCAACG GCGGCTGGCT GTTGGCCGAT GCGATCCGCC TGTGCCGCGA CCTGCATGGC GTCATGGCCT ATGCCGAGGA CCCCTGCGGC GCCGAAGGCG TGTTCTCGGG CCGCGAGGTG ATGGCCGAGT TCCGCCGCGC CACGGGCCTG CCGACCGCCA CCAACATGGT CGCCACCGAC TGGCGCGAGA TGGTGCACAG CCTCTCGCTG CAGTCGGTCG ACATTCCGCT GGCCGATCCG CACTTCTGGA CGATGCAGGG CTCGGTGCGC GTGGCGCAGC TGTGCCAGGC CTGGGGCCTG ACCTGGGGCT CGCATTCGAA CAACCACTTC GATGTGTCGC TGGCCATGTT CACGCACGTG GCCGCGGCCG CGCCGGGCAA GGTGACGGCC ATCGACACGC ACTGGATCTG GCAGGACGGC CAGCGCCTGA CCAAGGCCCC GCTGCAGATC GAGGGCGGCT ACGTGAAGGT GCCCACGCGC GGCGGCCTGG GCGTGGAGCT CGACATGGAC GAAGTCGAGA AGGCGCACCA GCTGTACCTG AAGCATGGCC TGGGCGCGCG CAACGATGCG CAGGCGATGC AATACCTGAT CCCGGGGTGG ACTTTCGACA ATAAAAAGCC CTGCATGGTG CGCTGA
|
Protein sequence | MTLPDSASPI SGAPVVTAMR VVPVAGHDSM LMNLSGAHGP FFTRNLLILT DSAGHTGVGE VPGGEKIRQT LEDARGLIVG QPLGKHHAVL NSMRSAFANR DSGGRGQQTF DLRVTIHAVT AVEAALLDLL GQHLEVPVAA LLGEGQQRDA VQMLGYLFYV GDRTRTDLPY ESDPGGADDW FRLRHEEAMT PEAIVRLAEA THARYGFTDF KLKGGVLRGE EEVEAIRALH ERFPKARVTL DPNGGWLLAD AIRLCRDLHG VMAYAEDPCG AEGVFSGREV MAEFRRATGL PTATNMVATD WREMVHSLSL QSVDIPLADP HFWTMQGSVR VAQLCQAWGL TWGSHSNNHF DVSLAMFTHV AAAAPGKVTA IDTHWIWQDG QRLTKAPLQI EGGYVKVPTR GGLGVELDMD EVEKAHQLYL KHGLGARNDA QAMQYLIPGW TFDNKKPCMV R
|
| |