Gene Vapar_0100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_0100 
Symbol 
ID7971636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp99370 
End bp100695 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content70% 
IMG OID644790703 
Productfumarylacetoacetase 
Protein accessionYP_002942029 
Protein GI239813119 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID[TIGR01266] fumarylacetoacetase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.56864 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCCG CGCTCAACCA CACGCACGAC ACCAACGCCA GGAGCTGGGT CGCCGCTGCC 
AACACGGAAG GCACCGACTT TCCGATCCAG AACCTGCCCT ACGCAGTGTT CCGTCGCACC
GGCAGCCCGC AGCCGTTTCG CGGCGGCGTG GCCATTGGCG ACCAGGTGCT CGACATGGCC
GCGCTCTCCG CGCGCCAGCT GCTCGACGGA CTCGCGCTCG ATGCGGCCCG TGCCGCCGCG
CTGCCCGCGC TCAACGATTT CTTCGCGCTC GGCGGCACCG CCTGGCGTGC GCTGCGCCAC
GCGGTGTTCG CCTTGCTGCG CGACGATGCA CCGGCCGCCA CGGCCAAGCT CGTGCGCGAA
TGCCTGGTGC CGCAGGCCGA GGTCGAATAC ACCTTGCCCG CGCGCATCGG CGACTACACC
GACTTCTACA CCTCCATCGA CCATGCGCTG AACATCAGCC GCCTGATGAA CCCCGAAGGC
GACGTGACGC CCAACTTCCG CTGGATTCCG ACGGCCTACC ACGGGCGCGT GTCGACCATC
GGCATCAGCG GCCAGCGCTT TCACCGCCCG ATGGGGCAGA CCATGGCGCC GGGCGCCAAG
GCGCCCACCT TCCACGCCTG CGCGCGGCTC GACTACGAGC TCGAGCTTGG CATCTGGATC
GGCGAAGGCA ATGCGGCCGG CGAACCGATT CCGCTCGAAC GCGCGGAAGA GCACATCTTC
GGCATCTGCC TGCTCAACGA CTGGTCGGCG CGCGACATCC AGTTCTGGGA GATGGCGCCG
CTCGGTCCCT TCCTTGCGAA GAATTTCGCG ACCACCATCT CGCCGTGGAT CGTGACGATG
GAAGCGCTCG CGCCCTACCG CCAGGCCTGG ACGCGGCCCG CCGACGAGCC CCAGCCGCTG
GCCTACCTCG AAAGCCGCGG CAACCGCGAA GGCGGCGCGA TCGACATCCG GCTGGAGGTC
TGGCTCGAAA GCGAGAGGGC GCGCAGCGAG GGGAGCGGGC CTTCGCGCCT GTCGCGCACC
AGCTTCAGGC ACCAGTACTG GAGCGTGGCG CAGATGGTGG CGCACCACAC GGTGGGCGGC
TGCAGCCTGA ACCCGGGCGA CCTGTTCGGC AGCGGCACCA TCTCGGGACC GGGGCCGGGC
GAGGCCGGCG CGATCATCGA GCTGACGCGC GCCGCGCAGG ACCCGGTCAC GCTGGCCAAC
GGCGAGCAGC GCGGCTTCCT GGAAGACGGC GATGCAGTGC TGCTGCGCGG ATGGTGCGAG
AAGCCGGGCC ATGCGCGCAT CGGCTTCGGC GAGAGCCGCG GGACGGTGCT GCCCGCCAAG
GGCTGA
 
Protein sequence
MNAALNHTHD TNARSWVAAA NTEGTDFPIQ NLPYAVFRRT GSPQPFRGGV AIGDQVLDMA 
ALSARQLLDG LALDAARAAA LPALNDFFAL GGTAWRALRH AVFALLRDDA PAATAKLVRE
CLVPQAEVEY TLPARIGDYT DFYTSIDHAL NISRLMNPEG DVTPNFRWIP TAYHGRVSTI
GISGQRFHRP MGQTMAPGAK APTFHACARL DYELELGIWI GEGNAAGEPI PLERAEEHIF
GICLLNDWSA RDIQFWEMAP LGPFLAKNFA TTISPWIVTM EALAPYRQAW TRPADEPQPL
AYLESRGNRE GGAIDIRLEV WLESERARSE GSGPSRLSRT SFRHQYWSVA QMVAHHTVGG
CSLNPGDLFG SGTISGPGPG EAGAIIELTR AAQDPVTLAN GEQRGFLEDG DAVLLRGWCE
KPGHARIGFG ESRGTVLPAK G