Gene Vapar_1051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_1051 
Symbol 
ID7972022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp1151004 
End bp1152755 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content68% 
IMG OID644791647 
Productdihydroxy-acid dehydratase 
Protein accessionYP_002942968 
Protein GI239814058 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGACC AAACCCCCAA GAAGAAGCTG CGCTCGACCG AATGGTTCGG CTCGGCCGAC 
AAGAACGGCT TCATGTACCG CAGCTGGATG AAGAACCAGG GCATACCGGA CCACGAGTTC
GACGGCCGGC CGATCATCGG CATCTGCAAC ACCTGGTCGG AGCTCACGCC CTGCAATGCG
CACTTCCGCA AGATCGCCGA GCATGTGAAG CGCGGCATCT CGGAGGCGGG CGGCTTCCCG
GTCGAGTTTC CGGTGTTCTC CAACGGCGAA TCGAACCTGC GGCCCACCGC CATGCTCACG
CGCAACCTTG CGAGCATGGA TGTCGAAGAG GCGATCCGCG GCAACCCGGT CGATGCGGTG
GTGCTGCTCA CGGGCTGCGA CAAGACCACG CCCGCATTGC TGATGGGCGC GGCCAGCTGC
GACATCCCCG CCATCGTGGT CACGGGCGGG CCCATGCTCA ACGGCAAGCT CGACGGCAAG
GACATCGGCT CGGGCACGGC CGTGTGGCAG CTGCACGAGT CGCTCAAGGC CGGCGAGATC
AACCTGCACC AGTTCCTCTC GGCCGAAGGC GGCATGTCGC GTTCGGCCGG CACTTGCAAC
ACCATGGGCA CGGCCTCGAC CATGGCCTGC ATGGCCGAGG CGCTGGGCAC CTCGCTGCCG
CACAACGCGG CCATTCCGGC GGTCGACGCG CGGCGCTACG TGCTCGCGCA GATGTCGGGC
ATGCGCGCGG TCGAGATGGC GAAGGAGGGG CTCACGCTGT CGAAGATCCT CACGCGCGAG
GCCTTCGAGA ATGCCATTCG CGTCAATGCC GCCATCGGCG GGTCGACCAA TGCGGTGATC
CACCTGAAGG CCATTGCGGG GCGCATCGGC GTCGACCTGG AACTGGAAGA CTGGACGCGC
ATCGGCAGCA ACACGCCGAC CATCGTCGAC CTGTTGCCCT CGGGCCGCTT CCTGATGGAG
GAGTTCTACT ACGCGGGCGG CCTGCCCGCG GTGCTGCGGC GCCTGGGCGA GAACGGTCTC
TTGCCGCACC CCGGCGCGCT CACCGTCAAC GGCCAGTCGA TCTGGGACAA CGTGCGCGAG
GCGCCGAGCC TCAACGACGA GGTGATCCGC CCGCTCGACA AGCCGCTGAT CGCCGACGGC
GGCATCCGCA TCCTGCGCGG CAATTTGTCG CCGCGCGGCG CGGTGCTCAA GCCCTCGGCC
GCATCGCCCG AACTGCTCAA GCACCGCGGC CGCGCGGTGG TGTTCGAGAA CCTGGAACAC
TACAAGGAGC GCATCGTCGA CGAGAGCCTC GAGATCGACG CCAGCTCGGT GATGGTGATG
AAGAACTGCG GCCCCAAGGG CTACCCCGGC ATGGCCGAGG TCGGCAACAT GGGCTTGCCG
CCCAAGCTGC TGCGCCAGGG CGTGAAGGAC ATGGTGCGCA TCTCGGATGC ACGCATGAGC
GGCACGGCCT ACGGCACGGT GGTGCTGCAC GTCGCGCCCG AGGCGGCCGA CGGCGGACCG
CTTGCGGCGG TGCGCGACGG CGACTGGATC GAGCTCGACT GCGATGCGGG CCGCCTGCAC
CTGGACATCA GCGACGAAGA GCTGGCCGCG CGCCTGGCGT CGCTCACCAG CACCGACGCG
CAGCCCATGA GCACGCGCGG CGGCGGCTAC CAGAAGCTCT ACGTCAACCA CGTGCTGCAG
GCCGACGAAG GCTGCGACTT CGATTTTCTG GTGGGCTGCA GAGGCTCCGC CGTTCCCCGC
CATTCACACT GA
 
Protein sequence
MPDQTPKKKL RSTEWFGSAD KNGFMYRSWM KNQGIPDHEF DGRPIIGICN TWSELTPCNA 
HFRKIAEHVK RGISEAGGFP VEFPVFSNGE SNLRPTAMLT RNLASMDVEE AIRGNPVDAV
VLLTGCDKTT PALLMGAASC DIPAIVVTGG PMLNGKLDGK DIGSGTAVWQ LHESLKAGEI
NLHQFLSAEG GMSRSAGTCN TMGTASTMAC MAEALGTSLP HNAAIPAVDA RRYVLAQMSG
MRAVEMAKEG LTLSKILTRE AFENAIRVNA AIGGSTNAVI HLKAIAGRIG VDLELEDWTR
IGSNTPTIVD LLPSGRFLME EFYYAGGLPA VLRRLGENGL LPHPGALTVN GQSIWDNVRE
APSLNDEVIR PLDKPLIADG GIRILRGNLS PRGAVLKPSA ASPELLKHRG RAVVFENLEH
YKERIVDESL EIDASSVMVM KNCGPKGYPG MAEVGNMGLP PKLLRQGVKD MVRISDARMS
GTAYGTVVLH VAPEAADGGP LAAVRDGDWI ELDCDAGRLH LDISDEELAA RLASLTSTDA
QPMSTRGGGY QKLYVNHVLQ ADEGCDFDFL VGCRGSAVPR HSH