Gene Vapar_3939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_3939 
Symbol 
ID7970368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp4178674 
End bp4179732 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content67% 
IMG OID644794525 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_002945819 
Protein GI239816909 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000297127 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAAA CCAAGGCCTT GAGGTCCGCG TTCGTCACCG GCGCGACCGG CCTTCTCGGC 
AACAACCTGG TGCGCGAACT GGTCGCGCGC GGCGTCTCGG TCAAGGCCCT TGTCCGCTCG
AAGGCCAAGG GCCAGCAGCA GTTTGCGGGC GTGAAGGGCG TCGAACTGGT GCTGGGCGAC
ATGGCCGATG CGCCCGCCTT CGCCGGCGCG CTGCAAGGCT GCGACGTGGT GTTCCACACC
GCCGCGTTCT TCCGGGACAA CTTCAAGGGC GGCAGCCACT GGCAAGAGCT CAAGCGCATC
AATGTGGACG GCACGCGGCA GCTCATCGAG CAGGCCTACG GTGCGGGCAT CCGGCGCTTC
GTCCAGACCT CGTCCATCGC GGTGCTCAAC GGCGAGCCGG GCGTGCCCAT GGACGAGACC
TGCCTGCGCG AGCTGGCCGA CGCCGGCGAC GACTACTACC GCAGCAAGAT CATGGCCGAC
CAGGTCGTGT CGGCCTTTCT GGGCACGCAC CCGGACATGC ATGCGAGCTT CGTCCTGCCC
GGATGGATGT GGGGACCGGC CGATATCGGC CCCACCTCGT CAGGACAATT CGTCAATGAC
GTGGTGCTCG GAAAACTGCC CGGGCTGGTG CCCGGCAGTT TTTCCGTCGT CGACGCCCGC
GATGTGGCCA GGGCGCAGAT CTCGGCCGCG GAGCACGGGC AGCGCGGCGA ACGCTACCTG
GCGGCCGGCC GCCACATGAC GATGCAGGAA CTGGTTCCCC TGGTGGGAAA GATCGCAGGC
ATCAAGACGC CGACGCGCCA TTTGCCCTTT CCGCTCCTGT ATCTGCTGGC GGCGGTGCAG
GAGCTCTACG CGCGGACCAC CGGCAAGCCC ATCCTGCTCA GCCTGGCCAC CGTGCGGCTG
ATGCGCAAGG AGGCGGGGCG CAGCCATTTC AATCACACGA AGAGCGAGCA GAAGCTTCAG
CTGAAGTTTC GCCCGGTCGA GCAGACCGTT GCCGACACGC TCGCCTGGTA TCGCGGCAAT
GGCTGGCTGC CCGGTGTGCC GGCCCGAACC GAATCCTGA
 
Protein sequence
MEKTKALRSA FVTGATGLLG NNLVRELVAR GVSVKALVRS KAKGQQQFAG VKGVELVLGD 
MADAPAFAGA LQGCDVVFHT AAFFRDNFKG GSHWQELKRI NVDGTRQLIE QAYGAGIRRF
VQTSSIAVLN GEPGVPMDET CLRELADAGD DYYRSKIMAD QVVSAFLGTH PDMHASFVLP
GWMWGPADIG PTSSGQFVND VVLGKLPGLV PGSFSVVDAR DVARAQISAA EHGQRGERYL
AAGRHMTMQE LVPLVGKIAG IKTPTRHLPF PLLYLLAAVQ ELYARTTGKP ILLSLATVRL
MRKEAGRSHF NHTKSEQKLQ LKFRPVEQTV ADTLAWYRGN GWLPGVPART ES