Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_0570 |
Symbol | hpd |
ID | 5154636 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 573876 |
End bp | 574994 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640555581 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_001236754 |
Protein GI | 148252169 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.150447 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTCCGT TTCCGCACGA TGCGCCGCCT GCCACCCTCT CTGCCGACAA TCCGATGGGC ACCGACGGCT TCGAGTTTGT CGAATACGCC CATCCCGATG CGGCTCAGCT GCACGCGCTG TTCAAGCTGA TGGGCTTCGC TCCGGTCGCC CGCCACAAGA CCAAGGCGAT CACGGTCTAC CGCCAGGGCG ACATCAACTA TCTCGTCAAC GAGGAGCCCG GCAGCCACGG CCACGACTTC GTCGCCGCGC ACGGTCCCTG CGCACCGTCG ATGGCGTTCC GCGTGGTCGA TGCGAAGAAA GCCTATGCGC GGGCGGTCGC GCTCGGGGCT GAGCCGGCGG ATCTGCAGCC TTCGCAGAAG GCGCTCGACG TTCCCGCGAT CAAGGGGATC GGCGGCAGCG TGCTGTATCT GGTCGACCGC TACGGCGCCA AGGGCTCGGC CTATGATCTC GAATTCGACT GGCTCGGGGC CCGCGATCCG CGGCCGGCGG GCTCGGGCCT CTATTACATC GACCATCTGA CTCACAACGT GCGTCGCGGC CGCATGAATG TGTGGACCGG CTTCTATGAG AGACTGTTCA ACTTCCGGCA GATCCGCTTC TTCGACATCG AGGGTCGCGC CTCCGGCCTG TTCTCGCGTG CGTTGACCAG CCCTGACGGC AAGATCCGCA TCCCGATCAA CGAGGACGCG GGCGACTCCG GCCAGATCGA GGAATATCTG AAGGTCTATC GCGGCGAGGG CATCCAGCAC ATCGCCTGCG GCGCCCGCGA CATCTACGCC ACGGTCGAAG GCCTGCGCGC GTCAGGCCTG CCGTTCATGC CGTCGCCGCC CGACACCTAT TTCGAGCGGA TCGATGCGCG ACTGCCCGGC CATGGCGAAG ACATCGCGCG GCTGAAGACG AACGGCATCC TGATCGACGG CGAAGGCGTC GTCGATGGTG GCCACACCAA GGTGCTCTTG CAGATCTTCT CGGCCAACGC GATCGGGCCG ATCTTCTTCG AGTTCATCCA GCGCAAGGGC GACGACGGCT TTGGCGAAGG CAACTTCAAG GCGCTGTTCG AGTCGATCGA GGAGGACCAG ATCCGTCGCG GTGTGCTGAA GGTGGAGGCG GCGGAGTAG
|
Protein sequence | MGPFPHDAPP ATLSADNPMG TDGFEFVEYA HPDAAQLHAL FKLMGFAPVA RHKTKAITVY RQGDINYLVN EEPGSHGHDF VAAHGPCAPS MAFRVVDAKK AYARAVALGA EPADLQPSQK ALDVPAIKGI GGSVLYLVDR YGAKGSAYDL EFDWLGARDP RPAGSGLYYI DHLTHNVRRG RMNVWTGFYE RLFNFRQIRF FDIEGRASGL FSRALTSPDG KIRIPINEDA GDSGQIEEYL KVYRGEGIQH IACGARDIYA TVEGLRASGL PFMPSPPDTY FERIDARLPG HGEDIARLKT NGILIDGEGV VDGGHTKVLL QIFSANAIGP IFFEFIQRKG DDGFGEGNFK ALFESIEEDQ IRRGVLKVEA AE
|
| |