Gene Vapar_4766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_4766 
Symbol 
ID7971776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp5063741 
End bp5064868 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content64% 
IMG OID644795351 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_002946637 
Protein GI239817727 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.578295 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCACA CCGACGCGCC CCCCTTCACG CCCTGGGAGA ACCCCATGGG AACCGACGGC 
TTCGAATTCA TCGAATACGC GGCGCCGGAT CCGGTCGCGA TGGGCAAGGT GTTCGAGCGC
ATGGGCTTCA AGGCCGTGGC GCGGCACCGC CACAAGAACG TGCTGCTGTA CCGCCAGGGC
ACGATCAATT TCATCGTCAA CGCCGAGCCC GACTCCTTCG CGCAGCGCTT TGCGCGCGAA
CACGGCCCGA GCGTCTGCGC CATCGCCTTC CGCGTGCAGG ACGCCAAGCA GGCCTACGAA
CGCGCGATCT CGCTCGGCGC ATGGGGCTTT GCCGACAAGG CGGGTCCGGG CGAATTGAAC
ATTCCCGCCA TCAAGGGCAT CGGCGACAGC CTGATCTACC TGGTCGACCG CTGGCCCGGC
AAGAACGGCG CGAAGCCGGG CGACATCGGC AACATCGGCT TCTACGACGT CGACTTCGAA
CCGCTGCCGG GCGTGGCCTC GCAGGATGCG CTGGCGCCCA AGGGCAACGG CCTGACCTAC
ATCGACCATC TCACGCACAA TGTGTACCGC GGCCGCATGA ACGTCTGGGC CGGCTTCTAC
GAGAAGCTCT TCAACTTCCG CGAGATCAAG TACTTCGACA TCGAAGGCCA GGTCACGGGC
GTGAAGAGCA AGGCCATGAC CAGCCCCTGC GGCAAGATCC GCATCCCCAT CAATGAAGAG
GGCAAGGAGC AGGCCGGCCA GATCCAGGAG TACCTGGACA TGTACCGCGG CGAAGGCATC
CAGCACATCG CGATGGGCTC GGACAACCTG TACGAGACCG TCGACGCGCT GCGCGCCAAC
GGCGTCACCC TGCTCGACAC CATCGACACC TACTACGAGC TGGTCGACAA GCGCATTCCC
GGCCATGGCG AAAGCGTGGC CGAGCTGCAG AAGCGCAAGA TCCTGATCGA CGGCAAGAAG
GACGCGCTGC TGCTGCAGAT CTTCAGCGAG AACCAGCTCG GCCCGATCTT CTTCGAGTTC
ATCCAGCGCA AGGGCGACGA CGGTTTCGGC AACGGCAACT TCAAGGCGCT GTTCGAAAGC
ATCGAGCTCG ACCAGATGCG CCGCGGCGTG CTGTCGGCCG CAAAATAA
 
Protein sequence
MSHTDAPPFT PWENPMGTDG FEFIEYAAPD PVAMGKVFER MGFKAVARHR HKNVLLYRQG 
TINFIVNAEP DSFAQRFARE HGPSVCAIAF RVQDAKQAYE RAISLGAWGF ADKAGPGELN
IPAIKGIGDS LIYLVDRWPG KNGAKPGDIG NIGFYDVDFE PLPGVASQDA LAPKGNGLTY
IDHLTHNVYR GRMNVWAGFY EKLFNFREIK YFDIEGQVTG VKSKAMTSPC GKIRIPINEE
GKEQAGQIQE YLDMYRGEGI QHIAMGSDNL YETVDALRAN GVTLLDTIDT YYELVDKRIP
GHGESVAELQ KRKILIDGKK DALLLQIFSE NQLGPIFFEF IQRKGDDGFG NGNFKALFES
IELDQMRRGV LSAAK