Gene Vapar_3689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_3689 
Symbol 
ID7973922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp3889601 
End bp3891337 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content67% 
IMG OID644794273 
Productdihydroxy-acid dehydratase 
Protein accessionYP_002945571 
Protein GI239816661 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACTT CCCCCAAGAA AAAACCCGAA GACCTGCGCA GCCAGCAATG GTTCGGCCGC 
CACGACCGCG ATGGCTTCAT CTACCGCAGC TGGGTCAAGG GCAAGGGCGT GCCGCACGAC
CAGTTCGACG GGCGCCCGGT CATCGGCATC TGCAACACCT TCAGCGAGCT CACGCCCTGC
AACTCGCACT TCCGCACGCT CGCCGAGCAG GTGAAGATCG GCGTCTACGA AGCGGGCGGC
TTTCCGCTCG AATTCCCGGT GATGTCGCTC GGCGAAACGC TCTTGCGCCC CACCGCCATG
CTGTACCGCA ACCTCGCGAG CATGGACGTG GAAGAAAGCA TCCGCGGCAA TCCGCTGGAC
GGCGTGGTGC TGCTCATGGG CTGCGACAAG ACCACGCCCG CGCTCATGAT GGGTGCGGCC
AGCGTCGACC TGCCGACCAT CGGCGTCTCC GGCGGCCCGA TGCTTTCGGG CAAGTGGCGC
GGCCAGGAAC TGGGCTCGGG CACCGGCGTG TGGCAGATGA GCGAGCAGGT GCGCGCCGGC
ACGCTCAAGC TGCAGGACTT CTTCGAGGCC GAGAGCTGCA TGCACCGCAG CCACGGCCAC
TGCATGACCA TGGGCACCGC CAGCACCATG GCCAGCATGG TCGAGTCGCT GGGCATCGGC
CTGCCCGGCA ACGCCGCCTA CCCGGCGGTG GACGGCCGGC GCAACGTGCT CGCGCGCATG
GCGGGGCGGC GCATCGTCGA CATGGTCCAT GAAGACCTCC ACATGTCGAA GATCCTCACG
CGCCAGGCCA TCGAGAACGC CATCAAGGTC AACGCCGCCA TCGGCGGCTC CACCAACCTC
GTCATCCACC TGCTGGCCAT TGCGGGGCGC ATCGGCGTCG ATCTTTCGCT GGACGACTTC
GACCGCCTGG CCTCGGACCT CCCCTGCCTG GTCGACCTGC AGCCTTCGGG CCGCTTCCTG
ATGGAAGACT TCTGCTATGC GGGCGGGCTG CCGGTGGTCA TCAAGGAGAT CGCGCAGTAC
CTGCACAAGG ATGTGATCAC GGCCAACGGC CAGACACTGT GGGACAACGT GAAGGACGCC
GAGAACTACA ACCCGCAGGT GATCCGCCCG CTGGCCGAGC CCTTCAAGGA CAAGGCCGGC
ATCTGCGTGC TGCGCGGCAA TCTCGCGCCC AACGGCGCCA TCATCAAGCC CAGTGCCGCC
ACGCCCGAGC TGCTGGTGCA CAAGGGCCGC GCGGTGGTGT TCGAAAGCGC CGACGACCTG
CACAAGCGCA TCGACGACGA GAACCTCGAC ATCGACGAGC ACTGCGTGAT GGTGCTGAAG
AACTGCGGCC CGCGCGGCTA TCCGGGCATG GCCGAGTCGG GCAACATGCC GCTGCCGCCG
AAAGTGCTGC GCAAGGGCAT CACCGACATG GTCCGCATCA GCGACGCGCG CATGAGCGGC
ACGGCCTACG GCACGGTGGT GCTGCACACG GCGCCCGAGG CGGCCGCGGG CGGACCGCTC
GCGCTGGTGC AGGACGGCGA CATCGTCGAG CTGGACGTGC CCAACCGCAA ACTGCACCTG
CACGTGAGCG ACGAAGAGCT CGCCAGGCGG CTCGAGAAGT GGGTCGCGCC CAAGGCGCCG
CTCGATTCGG GTTACTGGAA GCTGTACGTC GACACGGTGC TGCAAGCCGA CCAGGGCGCC
GACCTGGCCT TCCTGCGTGG TCGCCGCGGG GCCTTCGTGC CGCGCGACAA TCACTGA
 
Protein sequence
MSTSPKKKPE DLRSQQWFGR HDRDGFIYRS WVKGKGVPHD QFDGRPVIGI CNTFSELTPC 
NSHFRTLAEQ VKIGVYEAGG FPLEFPVMSL GETLLRPTAM LYRNLASMDV EESIRGNPLD
GVVLLMGCDK TTPALMMGAA SVDLPTIGVS GGPMLSGKWR GQELGSGTGV WQMSEQVRAG
TLKLQDFFEA ESCMHRSHGH CMTMGTASTM ASMVESLGIG LPGNAAYPAV DGRRNVLARM
AGRRIVDMVH EDLHMSKILT RQAIENAIKV NAAIGGSTNL VIHLLAIAGR IGVDLSLDDF
DRLASDLPCL VDLQPSGRFL MEDFCYAGGL PVVIKEIAQY LHKDVITANG QTLWDNVKDA
ENYNPQVIRP LAEPFKDKAG ICVLRGNLAP NGAIIKPSAA TPELLVHKGR AVVFESADDL
HKRIDDENLD IDEHCVMVLK NCGPRGYPGM AESGNMPLPP KVLRKGITDM VRISDARMSG
TAYGTVVLHT APEAAAGGPL ALVQDGDIVE LDVPNRKLHL HVSDEELARR LEKWVAPKAP
LDSGYWKLYV DTVLQADQGA DLAFLRGRRG AFVPRDNH