Gene Avin_34750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_34750 
SymbolhppD 
ID7762370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3547995 
End bp3549074 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content64% 
IMG OID643806341 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_002800599 
Protein GI226945526 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCCA TACCGGAAAG CCCAGCCTTC AACCCCATCG GCACCGACGG CTTCGAGTTC 
GTCGAATTCA CCGCGCCGGA CGCCGAGGGC ATCGCCCGGC TGCGCGGGCT GTTCGTCGCC
ATGGGTTTCA CCGAGACCGC CCGACACCGT TCCAAGGAAG TCTTCCTGTT CCAGCAGAAC
GACATCAATT TCGTCCTCAA CGGCAGTCCC GACGGGCCGG TGCGGGCTTT CGCCGAGGAA
CACGGACCGA GCGCCTGCGC CATGGCCTTC CGGGTCGGCA ACGCCGTCCA GGCCGCCGAC
TACGCGCAGC GCCAGGGGGC GAGGCTCCTG GGCAGCCACG CCAATTTCGG CGAACTGAAC
ATCCCCTGCA TCGAAGGCAT CGGCGGTTCG TTGCTCTACC TGGTGGATCG CTACGGCGAA
CGCAGCATCT ACGATGTCGA CTTCGACTTC ATCGCGGGGC GCACGGCGCG GGACAACGCG
GTCGGCCTGA GCGTCATCGA CCATCTGACC CATAACGTCG CACGTGGGCA GATGGATGTC
TGGGCCGGTT TCTACGAGCG CATCGCCGGC TTCCGCGAGA CGCGCTACTT CGATATCGAG
GGCAGGCATA CGGGCCTTCT GTCCCGCGCC ATGACCGCGC CCTGCGGGAA GATCCGCATC
CCGATCAACG AGTCGGCCGA CGACCATTCG CAGATCGCCG AATTCATCCG CGATTACCAT
GGCGAGGGCA TCCAGCACAT CGCCCTGGCC ACCGACGACA TCTACGCCAC GGTGCGCGCG
CTGCGCGCCA GGGACGTGGC CTTCATGCAG ACCCCGGATA CCTACTACGA GAAGGTCGAT
ACCCGGGTTC CCGGCCATGG CGAGCTACTG GAGTCGCTGC GCGAGCTGAA CATCCTGATC
GACGGCCGTG TCGGCCGCGA AGGGCTGTTG CTGCAGATCT TCACCCGCCC GCTGATCGGC
CCGATCTTCT TCGAGATCAT TCAGCGCAAG GGCAACCAGG GCTTCGGCGA AGGCAACTTC
CGGGCTCTGT TCGAATCCAT CGAGGAGGAC CAGATCCGCC GTGGAGTGTT GAAGGGCTAG
 
Protein sequence
MNAIPESPAF NPIGTDGFEF VEFTAPDAEG IARLRGLFVA MGFTETARHR SKEVFLFQQN 
DINFVLNGSP DGPVRAFAEE HGPSACAMAF RVGNAVQAAD YAQRQGARLL GSHANFGELN
IPCIEGIGGS LLYLVDRYGE RSIYDVDFDF IAGRTARDNA VGLSVIDHLT HNVARGQMDV
WAGFYERIAG FRETRYFDIE GRHTGLLSRA MTAPCGKIRI PINESADDHS QIAEFIRDYH
GEGIQHIALA TDDIYATVRA LRARDVAFMQ TPDTYYEKVD TRVPGHGELL ESLRELNILI
DGRVGREGLL LQIFTRPLIG PIFFEIIQRK GNQGFGEGNF RALFESIEED QIRRGVLKG