Gene Vapar_5038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_5038 
Symbol 
ID7974156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp5358761 
End bp5361109 
Gene Length2349 bp 
Protein Length782 aa 
Translation table11 
GC content68% 
IMG OID644795631 
Productsalicylyl-CoA 5-hydroxylase 
Protein accessionYP_002946906 
Protein GI239817996 
COG category[C] Energy production and conversion 
COG ID[COG1902] NADH:flavin oxidoreductases, Old Yellow Enzyme family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCTTC AACGCATCCT GTGCATCGGC GGCGGCCCCG CGGGCCTGTA CTTCGCGCTG 
CTGATGAAGG CGCGCAACCC CGCGCTCGAG ATCACCGTGG TGGAGCGCAA CCGCCCCTTC
GACACCTTCG GCTGGGGCGT GGTGCTCAGC GACCAGACGC TCGCCAACCT GCGCGCGGCC
GACGCCGAGA CCGCGGCGCT GATCGGCAGC GAGTTCCACC ACTGGGACGA CATCCAGGTG
TTCTTCAAGG GCCGGCAGGT GCGCTCGGGC GGCCATGGCT TCTGCGGCAT CGGCCGCAAG
CGCCTCCTGA ACATCCTGCA GGAGCGCTGT CTCGCGCTCG GCGTGAAGCT GGTCTTCGAG
ACCGACGCCA CCGACGACCA GGCCATCGCC GCCCAGTACA ACGCCGACCT GGTGATTGCG
AGCGACGGCC TGAACAGCCG CATCCGCCAG CGCTATGCCG ACGTGTTCAA GCCCGACGTC
GACCTGCGCA ACTGCCGCTT CGTGTGGCTG GGCACGCACC AGACCTTCGA CGCCTTCACC
TTCGCCTTCG AACAGACAGA GCACGGCTGG TTCCAGGCCC ATGCCTACCA GTTCGACGAC
AAGACCTCGA CCTTCATCGT CGAGACGCCC GAGGCCGTGT GGAAAGCCCA CGGCCTCGAC
CAGATGGAGC AGCCCGAGGC CATCGCCTTC TGCGAGAAGC TGTTCGCCAA GTACCTGGGC
GGCCATTCGC TCATCAGCAA TGCCACGCAC CTGCGCGGCT CGGCCAACTG GATCCGTTTT
CCGCGCGTGG TGTGCGAGCG CTGGACCCAC CGCATCGACG TCGAGGGCCG CTCGGTGCCC
GTGGTGCTGA TGGGCGACGC GGCGCACACG GCGCACTTCT CGATCGGCTC GGGCACCAAG
CTCGCGCTGG AAGACGCGAT CGACCTGGCC AACGAGTTCG CGCAGGGCGG CACCGTCGAC
CACGTGCTCG AGGGCTACGA GGCACGCCGC AGCGTCGAGG TGCTCAAGAT CCAGAACGCC
GCGCGCAACT CCACCGAATG GTTCGAGAAC GTACCGCGCT ACACCGGCAT GCAGATCGAG
CAGTTCGCTT ATTCGCTGCT CACGCGCTCG CAGCGCATCA GCCACGAGAA CCTGCGTGTG
CGCGACACGC AATGGCTCGG CGGCTACGAG CAGTGGCTCG CCGGCGGCAA GCCCGTTGCG
CCCATGCTGA TGCCGCTGCA GGTGCGCGGC CTCACGCTGA AGAACCGCAT CCTGGTGTCG
CCGATGGCCA CCTACAGCGC GGTCGACGGC GTGCCGCAGG ACTTTTTGCT GGTGCACCTG
GGCGCCCGCG CGCTCGGCGG CGCCGCCATG GTGTTCGTCG AGATGACGAG CCCGACGGCC
GAAGGCCGCA TCACCCCCGG CTGCACGGGC CTGTACAACG ACGCGCAGCA GGCCGCGTTC
AAGCGCATCG TCGACTTCGT GCACACGCAG TCGAGCGCCA AGATCGCCAT GCAGCTCGGC
CACAGCGGCC CCAAGGGCTC GACCCGCGTG GGCTGGGAAG GCACCGACGA GCCGCTCGAA
AGCGGCAACT GGCCGCTGCT GGCCGCCAGC GCGGTGGCCT ACGGCGAACA GAACCAGCTG
CCGGCCGCGA TGACGCGCGA GCAGATGGAC GCCATGCGCG ACCAGTTCGT GGCGTCCACG
CGCCGCGCCG CCGAATGCGG CTTCGACTGG CTCGAGCTGC ACTGCGCGCA CGGCTACCTG
CTGTCGGCCT TCATCAGCCC GCTCACCAAC CTGCGCACCG ACGAATACGG CGGCGGCATC
GAAGCGCGCT GCCGCTATCC GCTCGAGGTG TTCCGCGCGA TGCGCGCCGC CTGGCCGCAA
GACAAGCCGA TGAGCGTGCG CATCTCGGCG CACGACTGGG CCCCTGGCGG CAACACCGAT
GCCGATGCCG TGGTGGTGGC GCGCCTCTTC AAGGAAGCCG GCGCCGATTT CATCGACGTG
TCCTCCGGCC AGACCACGCG CACCGCCAAG CCGGTGTACG GCCGCATGTA CCAGACGCCG
TTCTCGGACC GCATCCGCAA CGAGGTCGGC ATCTCGACCA TCGCGGTGGG CGCCATCACC
GACGCCGACC AGGCCAACAG CATCATCGCG GCCGGCCGCG CCGACCTCTG CGCCATCGCC
CGCCCGCACC TTGCCGATCC CGCATGGACC CTGCACGAGG CCGCCAAGCT GCAAAGCCGC
GACGTCGACT GGCCGAAGCA GTACCTGAGC GGGCGCGACC AGATGTACCG CGAGATCGCC
AAGCAGCAGC AGATCGCCGC GGCTGCGAAC GCCGCATTGG CTGCCTCCAA CAACGAGGAG
ACGCACTGA
 
Protein sequence
MDLQRILCIG GGPAGLYFAL LMKARNPALE ITVVERNRPF DTFGWGVVLS DQTLANLRAA 
DAETAALIGS EFHHWDDIQV FFKGRQVRSG GHGFCGIGRK RLLNILQERC LALGVKLVFE
TDATDDQAIA AQYNADLVIA SDGLNSRIRQ RYADVFKPDV DLRNCRFVWL GTHQTFDAFT
FAFEQTEHGW FQAHAYQFDD KTSTFIVETP EAVWKAHGLD QMEQPEAIAF CEKLFAKYLG
GHSLISNATH LRGSANWIRF PRVVCERWTH RIDVEGRSVP VVLMGDAAHT AHFSIGSGTK
LALEDAIDLA NEFAQGGTVD HVLEGYEARR SVEVLKIQNA ARNSTEWFEN VPRYTGMQIE
QFAYSLLTRS QRISHENLRV RDTQWLGGYE QWLAGGKPVA PMLMPLQVRG LTLKNRILVS
PMATYSAVDG VPQDFLLVHL GARALGGAAM VFVEMTSPTA EGRITPGCTG LYNDAQQAAF
KRIVDFVHTQ SSAKIAMQLG HSGPKGSTRV GWEGTDEPLE SGNWPLLAAS AVAYGEQNQL
PAAMTREQMD AMRDQFVAST RRAAECGFDW LELHCAHGYL LSAFISPLTN LRTDEYGGGI
EARCRYPLEV FRAMRAAWPQ DKPMSVRISA HDWAPGGNTD ADAVVVARLF KEAGADFIDV
SSGQTTRTAK PVYGRMYQTP FSDRIRNEVG ISTIAVGAIT DADQANSIIA AGRADLCAIA
RPHLADPAWT LHEAAKLQSR DVDWPKQYLS GRDQMYREIA KQQQIAAAAN AALAASNNEE
TH