Gene Vapar_4783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_4783 
Symbol 
ID7970253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp5095332 
End bp5096402 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content68% 
IMG OID644795378 
Productfumarylacetoacetate (FAA) hydrolase 
Protein accessionYP_002946654 
Protein GI239817744 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.155683 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTCG CCACCCTGAA GGACGGCTCG CGCGACGGCC AGCTCGTCGT CGTCTCGCGC 
GACCTCACGC TCGCCCACTA CGCCACTGGC ATCGCCAGCC GGCTGCAGCA GGTGCTGGAC
GACTGGGGCT TCATGAGCCC GCAGCTGCAG GACCTGTACG ACGCGGTCAA CACCGGCCGC
GCACGCCATT CCTTCCCGTT CGACCCCGCC CAGTGCATGG CGCCGCTGCC GCGCGCCTAC
CAGTGGGCCG ACGGCTCGGC CTACCTGAAT CACGTCGAGC TGGTGCGCAA GGCGCGCAAT
GCCGAAGTGC CCGAGAGCTT CTACCAGGAC CCGCTGATGT ACCAGGGCGG CAGCGACGAC
TTCCTCGGCC CGACAGACGA CGTGGTCGTG CCCAGCGAAG CCATGGGCAT CGACTTCGAG
GCCGAGATCG CGGTGATCAC CGGCGACGTG AAGATGGGCG CCACGCCCGA CCAGGCGCTC
GACGGCATCC GCCTGGTGAT GCTGGCCAAC GACGTGAGCC TGCGCAACCT GATCCCTGCC
GAGCTGGCCA AGGGCTTCGG CTTCTTCCAG AGCAAGCCGG CCACCGCCTT CAGCCCCGTG
GCCGTGACGC TCGACGAGAT CGGCGAAGCC TGGCAGCACG GCCGCGTGCA CCTCACGCTG
CAAAGCAGCT GGAACGGCCG CAAGGTCGGC ATGTGCGACG CCGGGCCCGA GATGACCTTC
CATTTCGGCC AGCTCATCGC CCACATCGCC AAGACGCGCA ACGTGCGCGC CGGCAGCATC
GTCGGCAGCG GCACCGTGAG CAACAAGGGC GTGGAAAAGA GCGGCCAGAT GGACTGGCCC
AAGGGCTATT CGTGCATTGC CGAGAAGCGC TGCATCGAAA CCATCCAGGG CGGCGAGCCC
GTGACCGAAT TCATGAAGTT CGGCGACACC ATCCGCATCG AGATGAAGGG GCTCGATGGC
CGCTCGCTGT TCGGCGCGAT CGACCAGGAA ATCGTGTCGG CGGCGGGGCG GGCGAAGGTG
GCGCCGGTGT CGCTGGCGAA CCCGCAGGAC GACGACGGCG CCGAAGGCTG A
 
Protein sequence
MKLATLKDGS RDGQLVVVSR DLTLAHYATG IASRLQQVLD DWGFMSPQLQ DLYDAVNTGR 
ARHSFPFDPA QCMAPLPRAY QWADGSAYLN HVELVRKARN AEVPESFYQD PLMYQGGSDD
FLGPTDDVVV PSEAMGIDFE AEIAVITGDV KMGATPDQAL DGIRLVMLAN DVSLRNLIPA
ELAKGFGFFQ SKPATAFSPV AVTLDEIGEA WQHGRVHLTL QSSWNGRKVG MCDAGPEMTF
HFGQLIAHIA KTRNVRAGSI VGSGTVSNKG VEKSGQMDWP KGYSCIAEKR CIETIQGGEP
VTEFMKFGDT IRIEMKGLDG RSLFGAIDQE IVSAAGRAKV APVSLANPQD DDGAEG