Gene Vapar_3320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_3320 
Symbol 
ID7970558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp3499187 
End bp3501079 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content70% 
IMG OID644793905 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_002945204 
Protein GI239816294 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATCGCT CCATTGCCAC CGTCTCCCTC AGCGGCACGC TTCGCCAGAA GCTCGAAGCC 
GTTTCGGCCG CAGGCTTCGA CGGCATCGAG CTGTTCGAGG CCGACTTCGT CAACTTCAAG
GGCAGCGCCG CCGAGCTGGG CCGCATCGCC TCGGACCTCG GCCTCTCGAT CGACCTGTAC
CAGCCGTTCC GCGATTTCGA GGGCATGCCC GAGGCGCAGT TCCGCCGCAG CCTGGAGCGC
GCCGAGCGCA AGTTCGACCT GATGGAGGCC ATGGGCGCGC CGCTGATGCT GTGCTGCTCC
AACACCTCGC CGCCCTCGGT GAACGACCCG GCGCTGGCCG CCGCGCAGCT GCACGAGCTC
GCGGAACGCG CCGCGCGCCG CAACCTTCGC GTGGGCTTCG AGGCGCTGGC CTGGGGCCGC
CACACCTCGC TGTACGGCCA GGCCTGGAAC ATCGTGAAGC AGGCCGACCA CCCGCACCTT
GGTTTGATCC TCGACAGCTT CCACACGCTG TCGCTGAAGG ACGATGCCAC GGGCATTGCC
GCCATTCCGG GCGACAAGAT CTTCTTCCTG CAGATGGCCG ACGCGCCGCT GCTGTCGATG
GACGTGCTGC AGTGGGCGCG CCACCACCGC TCGTTCCCGG GGCAGGGCGA TCTCGACGTG
ATCGGCTTCT TCGAGCAGGT GCTGCGCGCG GGCTACACCG GCGCGCTGTC GCTGGAAATC
TTCAACGACC TGTTCCGCGA AACGCCCAAC CGGCGCACCG CGGTGGACGC CATGCGCTCG
CTGCTGTACC TCGAAAGCGA GGCGCGGCAG CGGCTGGCTT CCGCTGCCGA TGCGCCGGCA
CTGAAGCCGC AGGTCGAGCT GTTCAGCCCG CCGCCCGTGC CGGCGCTCTC GGGCCTGTCG
TTCATCGAAT TCGCGGCCGA CGAGGCCTCG GCCGGCACGC TCGGCGCGCT GCTGGAGCAG
CTCGGCTTCC GCCGCGTCGG CCGGCACCGC TCCAAGGCCG TGGCGCTGTA CCGCCAGGGC
GAAATCAACC TGATCGTCAA TGCGCAGCCG GATTCGTTCG CGCGCAGCCG CTTCGAGGCG
CACGGCACCT CGGTGTGCGC GCTGGGCGTG CGTTGCGCCG ATCCGCAGGC CGCGGTCGAG
CGCGCCACTG CCATGCGCTC GCAGCGCCAC GACAGCCCGG TCGGCCCGAA CGAATTGCGC
GTGCCGGCCA TCGTGGCGCC GGGCGGCAAC CTGATCCACT TCGTGCCCGA GGCCCTGGGC
ACCAACGGCC TGTACGAGGC CGACTTCATC CTCGAGCAAG ACGCCGCGGC CGATGTCCGC
GGCGCCGGGC TCGCGCAGGT CGACCACGTG GCGCTCGGCC TTGCGCTCGA CCAGCTCGAC
ACCTGGGTGC TGTTCACGCG CGCCGTGCTG GGGCTGGAGC CCGGCGAAAG CCTGGAGCTG
GCCGACCCCT TCGGCCTGAT CCGCAGCCGC GGCGTGGCCA ACGCCGATCG CAGCGTGCGG
CTGGTGCTCA ACGTGTCGCT GAGCCAGCGC ACCCGCACCG CGCGCACGCT GAGCCTGACC
GGCGGCGGCG CGGTGCACCA CATTGCGCTG CGCTGCGACG ACATCTTCGA GAGCGTGGCG
CGGCTGCGCG CCGCGGGCAC GCGCTTCGTG CCCATTTCGG ACAACTACTA CGACGACCTG
GCCACGCGCA TCGACCTGGA CCCTGTGCTG CTCGCAAAGC TGCGCGCCGC GGGCGTGCTG
TTCGACCGCT CCCCCACGGG CGACTACCTG CACATCTACA CCGAGAACAT CGAAGGCGGA
CTCTTCTTCG AACTGGCACA GCGCACCGCG GGCTACGACG CCTACGGCGC GCTCAACGCA
CCCGCGCGCA TGGCATCGCA GGCGCAGCAA TAA
 
Protein sequence
MHRSIATVSL SGTLRQKLEA VSAAGFDGIE LFEADFVNFK GSAAELGRIA SDLGLSIDLY 
QPFRDFEGMP EAQFRRSLER AERKFDLMEA MGAPLMLCCS NTSPPSVNDP ALAAAQLHEL
AERAARRNLR VGFEALAWGR HTSLYGQAWN IVKQADHPHL GLILDSFHTL SLKDDATGIA
AIPGDKIFFL QMADAPLLSM DVLQWARHHR SFPGQGDLDV IGFFEQVLRA GYTGALSLEI
FNDLFRETPN RRTAVDAMRS LLYLESEARQ RLASAADAPA LKPQVELFSP PPVPALSGLS
FIEFAADEAS AGTLGALLEQ LGFRRVGRHR SKAVALYRQG EINLIVNAQP DSFARSRFEA
HGTSVCALGV RCADPQAAVE RATAMRSQRH DSPVGPNELR VPAIVAPGGN LIHFVPEALG
TNGLYEADFI LEQDAAADVR GAGLAQVDHV ALGLALDQLD TWVLFTRAVL GLEPGESLEL
ADPFGLIRSR GVANADRSVR LVLNVSLSQR TRTARTLSLT GGGAVHHIAL RCDDIFESVA
RLRAAGTRFV PISDNYYDDL ATRIDLDPVL LAKLRAAGVL FDRSPTGDYL HIYTENIEGG
LFFELAQRTA GYDAYGALNA PARMASQAQQ