Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_50170 |
Symbol | hppD |
ID | 7763868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 5085206 |
End bp | 5086243 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643807848 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_002802082 |
Protein GI | 226947009 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGACA AGAACAATCC CATCGGCCTG CGCGGCATCG AGTTCACCGA ATTCACCGGC GGGACGCTCG AACAGCTCGA CGCCCTGTTC TGGGCCTTCG GCTTCTCGAA GAAGTATCGT CACCCGAAGC TGGACATCAG CATCTACAAC CAGAACGGCA TCAACTTCCT GCTGAACGGC GAGCGTGAAG GCTTCTCCGG TCATTTCGCC AAACTGCACG GCCCGTCGAT CAGCTCCATG GGCTGGCGTG TCGACGATGC CGCATTCGCC AGACAGGAAG CCGTCCGTCG CGGCGCGCGC GCGGCCGATC CCAAGGACTG CGACCTGCCC TACCCGGCCA TCTACGGCAT CGGCGACAGC CTGATCTATT TCATCGAGCG TTTCGGCGCC AGGGGCTCGA TCTACGCAAC CGACTTCGTT CCCCACGAGC AGGCCCGGCT CCAGCCGGAC AAAGGCTTCC TGGAGATCGA CCACCTGACC AACAACGTGC CGCAAGGCCA GATGGAGCAG TGGGGGGCCT TCTACAAGGA GATCTTCGGT TTCACCGAAG TGCGCTACTT CGACATCAAG GGCGTGAAGA CCGGCCTGAC CAGCTATGCG CTGCGCTCGC CGGACGGCAG CTTCTGCATC CCGATCAACC AGCCGAAGGA CGACAAGGAC CAGATCTCCG AGTACCTGGC CGAGTACAAC GGCCCGGGCG TACAGCACCT GGCCTTCAGC ACCAACGACA TCCTCGCCTC GCTGGACGCC ATGAAGGGCG GCCCGATCGA GATGCTCGAC ATCGACGCGA ACTACTACGA CAACGTGTTC CAGCGCCTGC CCAACGTGCG CGAGGACAAG GAGCGCATCC GCGCCCACCA CGTACTGGTC GACGGCGACC AGGACGGCTA TCTGCTACAG ATATTCACCA AGAACATCAT CGGCCCGATC TTCGTCGAGA TCATCCAGCG CGAGAACAAC CTGAGCTTCG GCGAAGGCAA CTTCGGCGCC CTGTTCCGCT CCATCGAGAA GGACCAGGAA CGTCGCGGCG TAATCTGA
|
Protein sequence | MSDKNNPIGL RGIEFTEFTG GTLEQLDALF WAFGFSKKYR HPKLDISIYN QNGINFLLNG EREGFSGHFA KLHGPSISSM GWRVDDAAFA RQEAVRRGAR AADPKDCDLP YPAIYGIGDS LIYFIERFGA RGSIYATDFV PHEQARLQPD KGFLEIDHLT NNVPQGQMEQ WGAFYKEIFG FTEVRYFDIK GVKTGLTSYA LRSPDGSFCI PINQPKDDKD QISEYLAEYN GPGVQHLAFS TNDILASLDA MKGGPIEMLD IDANYYDNVF QRLPNVREDK ERIRAHHVLV DGDQDGYLLQ IFTKNIIGPI FVEIIQRENN LSFGEGNFGA LFRSIEKDQE RRGVI
|
| |