Gene Avi_5449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_5449 
SymbolhppD 
ID7381544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011988 
Strand
Start bp450460 
End bp452307 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content60% 
IMG OID643649051 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_002547288 
Protein GI222106497 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG1082] Sugar phosphate isomerases/epimerases
[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTGGCGGCCA TTGCCAAGGC TGGCTTCAGC GGCGTTGAAA TTTTCGAGAA CGACTTCCTG 
ACCTATGACG CCTCCCCCCG GGATGTGGCG AAAATGGTTG CCGACCACGG TCTCGACATC
ACCCTGTTCC AGCCCTTCCG CGATTTCGAA GGCATGCCGG AACTGCACCG CGCCCGCGCT
TTTGAGCGGG CCGAGCGCAA ATTCGAGATC ATGGATGAGC TTGGCACCGA CCTGATGCTG
ATCTGCTCAA ATGTCTCTCC CATCTCGCTG GGCGGCATCG ACCGGGCCGC CGCCGATTTC
CAGGAGCTGG GCGAACGCGC CGCAAAGCAT GGCGTCCGCG TCGGCTACGA GGCACTTGCC
TGGGGCCGCC ACGTCAACGA CCACAGGGAT GCCTGGGAAG TCGTCCGCCG GGCAAACCAT
GCCAATGTCG GCCTGATCCT TGATAGTTTT CACACCCTGT CGCGCAAGAT CGATCCAAAC
TCGATCCGTT CCATTCCCGG CGACAAGATC TTCATCGTCC AACTGGCCGA TGCGCCGCTT
TTCGACATGG ATCTGCTCTA CTGGAGCCGC CATTTCCGCA ACATGCCCTG CGAAGGCGAC
TTGCCGGTGG TCGATTTCAT GCGCGCTGTG GCCGCCACAG GCTATACCGG GCCGCTATCC
CTGGAGATTT TCAACGACCA GTTCCGGGGA GGCTCACCGC GGGCCATTGC CGAGGACGGC
CACCGCTCGC TGGTCTACCT CATGGACCAA GTCCAACGCC TCGAACCCGA TATCCGGCTC
AGCGCCCCGG CCATGCCAGC CCCTGTCGAA ACCCAGGGCG TCGAATTCGT GGAATTTGCG
ACGTCGGTCG AGGAAAAACA GGATCTGGCA GCATTTTTAG CGACGCTCGG CTTTTCGAAA
ACCGCGACCC ATCGCAACAG GGATCTTGAC CTTTACACCC AGGGCGACAT CCGCATTCTC
ATCAATACCG ATACGACAAA CAACAGTTTT GCCGGCGCCT CCTATGCAAT CCACGGCACA
AGTGCCTACG CCTTCGGCAT GAAGGTGGGG CACGCCGAAG ACGCCTTGAA GCGCGCCACG
GCGCTGGGGG CAACGAGCTT CTCAGAACCG CGCAAACCGG GCGAAGTACC CGTGCCCGCC
ATTCAAGGCG TGAGCAATGG CGTCATTTAT TTCCTTGATG ACACGCCTGC CCTGTCCGGC
ATCTGGAAAC AGGAATTCAA AGACGTAGAC GCCGATCAGG CTCCGGCAAA TACCCGCCTG
ACCCGTATCG ACCATCTCGC CCAGACAACC CGCTATGACG AGATGCTGAC ATGCCTGCTG
TTTTACGGCT CGATCTTCGC CACGCGGCGC ACGCCCATGG TCGATGTGGT CGATCCGGGC
GGGCTGGTGC GCAGCCAAGC GATCGAAAGC AAACCAGATC CTCGTTTCAG GGTGACGTTG
AACGGCGCCG ATAACCGGAA AACCGTCGCC GGAAAGTTTC TCGAAGAAGG CTTCGGCACC
AGCATCCAGC ATATCGCCCT GGCGACCGAC GATATCTTCG CGACGGCGCA GGCGCTATCG
GCCTGCGGCT TCCAGGCGCT GACTATCTCG CGCAACTATT ATGACGATTT GGAAGCCCGC
TTCGGTCTGG AACCGGATTT TGCCGATGCA CTGCGTTCGG CCAGTATCCT TTACGACCGC
GACGATAATG GCGAGTATTT CCAAATCTAC AGCCGGACCT TCGGTGAGGG CTTTTTCTTC
GAAATCGTCG AGAGGCGCGG CGCCTATGGT GGTTATGGTG CGATGAACGC CCCGTTCCGT
ATAGCAGCAC AAAGACGGCA ACTGCGCCCG GATGGCGTTC CGAGATAA
 
Protein sequence
MAAIAKAGFS GVEIFENDFL TYDASPRDVA KMVADHGLDI TLFQPFRDFE GMPELHRARA 
FERAERKFEI MDELGTDLML ICSNVSPISL GGIDRAAADF QELGERAAKH GVRVGYEALA
WGRHVNDHRD AWEVVRRANH ANVGLILDSF HTLSRKIDPN SIRSIPGDKI FIVQLADAPL
FDMDLLYWSR HFRNMPCEGD LPVVDFMRAV AATGYTGPLS LEIFNDQFRG GSPRAIAEDG
HRSLVYLMDQ VQRLEPDIRL SAPAMPAPVE TQGVEFVEFA TSVEEKQDLA AFLATLGFSK
TATHRNRDLD LYTQGDIRIL INTDTTNNSF AGASYAIHGT SAYAFGMKVG HAEDALKRAT
ALGATSFSEP RKPGEVPVPA IQGVSNGVIY FLDDTPALSG IWKQEFKDVD ADQAPANTRL
TRIDHLAQTT RYDEMLTCLL FYGSIFATRR TPMVDVVDPG GLVRSQAIES KPDPRFRVTL
NGADNRKTVA GKFLEEGFGT SIQHIALATD DIFATAQALS ACGFQALTIS RNYYDDLEAR
FGLEPDFADA LRSASILYDR DDNGEYFQIY SRTFGEGFFF EIVERRGAYG GYGAMNAPFR
IAAQRRQLRP DGVPR