Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_1051 |
Symbol | |
ID | 7972022 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | + |
Start bp | 1151004 |
End bp | 1152755 |
Gene Length | 1752 bp |
Protein Length | 583 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644791647 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_002942968 |
Protein GI | 239814058 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGGACC AAACCCCCAA GAAGAAGCTG CGCTCGACCG AATGGTTCGG CTCGGCCGAC AAGAACGGCT TCATGTACCG CAGCTGGATG AAGAACCAGG GCATACCGGA CCACGAGTTC GACGGCCGGC CGATCATCGG CATCTGCAAC ACCTGGTCGG AGCTCACGCC CTGCAATGCG CACTTCCGCA AGATCGCCGA GCATGTGAAG CGCGGCATCT CGGAGGCGGG CGGCTTCCCG GTCGAGTTTC CGGTGTTCTC CAACGGCGAA TCGAACCTGC GGCCCACCGC CATGCTCACG CGCAACCTTG CGAGCATGGA TGTCGAAGAG GCGATCCGCG GCAACCCGGT CGATGCGGTG GTGCTGCTCA CGGGCTGCGA CAAGACCACG CCCGCATTGC TGATGGGCGC GGCCAGCTGC GACATCCCCG CCATCGTGGT CACGGGCGGG CCCATGCTCA ACGGCAAGCT CGACGGCAAG GACATCGGCT CGGGCACGGC CGTGTGGCAG CTGCACGAGT CGCTCAAGGC CGGCGAGATC AACCTGCACC AGTTCCTCTC GGCCGAAGGC GGCATGTCGC GTTCGGCCGG CACTTGCAAC ACCATGGGCA CGGCCTCGAC CATGGCCTGC ATGGCCGAGG CGCTGGGCAC CTCGCTGCCG CACAACGCGG CCATTCCGGC GGTCGACGCG CGGCGCTACG TGCTCGCGCA GATGTCGGGC ATGCGCGCGG TCGAGATGGC GAAGGAGGGG CTCACGCTGT CGAAGATCCT CACGCGCGAG GCCTTCGAGA ATGCCATTCG CGTCAATGCC GCCATCGGCG GGTCGACCAA TGCGGTGATC CACCTGAAGG CCATTGCGGG GCGCATCGGC GTCGACCTGG AACTGGAAGA CTGGACGCGC ATCGGCAGCA ACACGCCGAC CATCGTCGAC CTGTTGCCCT CGGGCCGCTT CCTGATGGAG GAGTTCTACT ACGCGGGCGG CCTGCCCGCG GTGCTGCGGC GCCTGGGCGA GAACGGTCTC TTGCCGCACC CCGGCGCGCT CACCGTCAAC GGCCAGTCGA TCTGGGACAA CGTGCGCGAG GCGCCGAGCC TCAACGACGA GGTGATCCGC CCGCTCGACA AGCCGCTGAT CGCCGACGGC GGCATCCGCA TCCTGCGCGG CAATTTGTCG CCGCGCGGCG CGGTGCTCAA GCCCTCGGCC GCATCGCCCG AACTGCTCAA GCACCGCGGC CGCGCGGTGG TGTTCGAGAA CCTGGAACAC TACAAGGAGC GCATCGTCGA CGAGAGCCTC GAGATCGACG CCAGCTCGGT GATGGTGATG AAGAACTGCG GCCCCAAGGG CTACCCCGGC ATGGCCGAGG TCGGCAACAT GGGCTTGCCG CCCAAGCTGC TGCGCCAGGG CGTGAAGGAC ATGGTGCGCA TCTCGGATGC ACGCATGAGC GGCACGGCCT ACGGCACGGT GGTGCTGCAC GTCGCGCCCG AGGCGGCCGA CGGCGGACCG CTTGCGGCGG TGCGCGACGG CGACTGGATC GAGCTCGACT GCGATGCGGG CCGCCTGCAC CTGGACATCA GCGACGAAGA GCTGGCCGCG CGCCTGGCGT CGCTCACCAG CACCGACGCG CAGCCCATGA GCACGCGCGG CGGCGGCTAC CAGAAGCTCT ACGTCAACCA CGTGCTGCAG GCCGACGAAG GCTGCGACTT CGATTTTCTG GTGGGCTGCA GAGGCTCCGC CGTTCCCCGC CATTCACACT GA
|
Protein sequence | MPDQTPKKKL RSTEWFGSAD KNGFMYRSWM KNQGIPDHEF DGRPIIGICN TWSELTPCNA HFRKIAEHVK RGISEAGGFP VEFPVFSNGE SNLRPTAMLT RNLASMDVEE AIRGNPVDAV VLLTGCDKTT PALLMGAASC DIPAIVVTGG PMLNGKLDGK DIGSGTAVWQ LHESLKAGEI NLHQFLSAEG GMSRSAGTCN TMGTASTMAC MAEALGTSLP HNAAIPAVDA RRYVLAQMSG MRAVEMAKEG LTLSKILTRE AFENAIRVNA AIGGSTNAVI HLKAIAGRIG VDLELEDWTR IGSNTPTIVD LLPSGRFLME EFYYAGGLPA VLRRLGENGL LPHPGALTVN GQSIWDNVRE APSLNDEVIR PLDKPLIADG GIRILRGNLS PRGAVLKPSA ASPELLKHRG RAVVFENLEH YKERIVDESL EIDASSVMVM KNCGPKGYPG MAEVGNMGLP PKLLRQGVKD MVRISDARMS GTAYGTVVLH VAPEAADGGP LAAVRDGDWI ELDCDAGRLH LDISDEELAA RLASLTSTDA QPMSTRGGGY QKLYVNHVLQ ADEGCDFDFL VGCRGSAVPR HSH
|
| |