Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_34750 |
Symbol | hppD |
ID | 7762370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 3547995 |
End bp | 3549074 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643806341 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_002800599 |
Protein GI | 226945526 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGCCA TACCGGAAAG CCCAGCCTTC AACCCCATCG GCACCGACGG CTTCGAGTTC GTCGAATTCA CCGCGCCGGA CGCCGAGGGC ATCGCCCGGC TGCGCGGGCT GTTCGTCGCC ATGGGTTTCA CCGAGACCGC CCGACACCGT TCCAAGGAAG TCTTCCTGTT CCAGCAGAAC GACATCAATT TCGTCCTCAA CGGCAGTCCC GACGGGCCGG TGCGGGCTTT CGCCGAGGAA CACGGACCGA GCGCCTGCGC CATGGCCTTC CGGGTCGGCA ACGCCGTCCA GGCCGCCGAC TACGCGCAGC GCCAGGGGGC GAGGCTCCTG GGCAGCCACG CCAATTTCGG CGAACTGAAC ATCCCCTGCA TCGAAGGCAT CGGCGGTTCG TTGCTCTACC TGGTGGATCG CTACGGCGAA CGCAGCATCT ACGATGTCGA CTTCGACTTC ATCGCGGGGC GCACGGCGCG GGACAACGCG GTCGGCCTGA GCGTCATCGA CCATCTGACC CATAACGTCG CACGTGGGCA GATGGATGTC TGGGCCGGTT TCTACGAGCG CATCGCCGGC TTCCGCGAGA CGCGCTACTT CGATATCGAG GGCAGGCATA CGGGCCTTCT GTCCCGCGCC ATGACCGCGC CCTGCGGGAA GATCCGCATC CCGATCAACG AGTCGGCCGA CGACCATTCG CAGATCGCCG AATTCATCCG CGATTACCAT GGCGAGGGCA TCCAGCACAT CGCCCTGGCC ACCGACGACA TCTACGCCAC GGTGCGCGCG CTGCGCGCCA GGGACGTGGC CTTCATGCAG ACCCCGGATA CCTACTACGA GAAGGTCGAT ACCCGGGTTC CCGGCCATGG CGAGCTACTG GAGTCGCTGC GCGAGCTGAA CATCCTGATC GACGGCCGTG TCGGCCGCGA AGGGCTGTTG CTGCAGATCT TCACCCGCCC GCTGATCGGC CCGATCTTCT TCGAGATCAT TCAGCGCAAG GGCAACCAGG GCTTCGGCGA AGGCAACTTC CGGGCTCTGT TCGAATCCAT CGAGGAGGAC CAGATCCGCC GTGGAGTGTT GAAGGGCTAG
|
Protein sequence | MNAIPESPAF NPIGTDGFEF VEFTAPDAEG IARLRGLFVA MGFTETARHR SKEVFLFQQN DINFVLNGSP DGPVRAFAEE HGPSACAMAF RVGNAVQAAD YAQRQGARLL GSHANFGELN IPCIEGIGGS LLYLVDRYGE RSIYDVDFDF IAGRTARDNA VGLSVIDHLT HNVARGQMDV WAGFYERIAG FRETRYFDIE GRHTGLLSRA MTAPCGKIRI PINESADDHS QIAEFIRDYH GEGIQHIALA TDDIYATVRA LRARDVAFMQ TPDTYYEKVD TRVPGHGELL ESLRELNILI DGRVGREGLL LQIFTRPLIG PIFFEIIQRK GNQGFGEGNF RALFESIEED QIRRGVLKG
|
| |