Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2764 |
Symbol | |
ID | 5734645 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3519540 |
End bp | 3520661 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641279907 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_001545530 |
Protein GI | 159899283 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGATGG CAGAACAAGA AACCACGAAT GCCCATGATC CTTTAGCATT ACGTGGCATC GATTATGTCG AAATGTATGT TGGTAATGCC CGCCAAGCAG CTCATTACTA TCGGACGGCC TTTGGTTTTA CGCCAGTCGC TTATGCTGGC TTGGAAACAG GCACGCGCGA CCGCGTTTCG TTTGTGATGC AACAGCGCAA CATCCGCTTG GTTCTGACTG GGGCACTCAA TCCTGATTCG CCAATTGCTG AGCATGTTAA ATTGCATGGT GATGGGGTTA AGGATATTGC GCTCGAAGTT GAAAACGCCA CTGCTGCGTT CGAAGCAGCA CTTGCCCGCG GCGCAACCGC AGTGCTTGAG CCAACCGTGC TGGAAAGCAA ATGGGGCAAA GTGGTTAAAG CGACCATTCG TACCTATGGT CATACGGTGC ATACGTTTGT TGAGCGCGAT GGCTATACTG GTACGTTTAT GCCAGGCTAC AACAAGGTTA AAAATCCGGC CAAAGCTGAG CCAACTGGTT TAGCCGCGGT TGATCATATT GTGGGTAACG TTGAGCTAGG CAAAATGGAT GAATGGGTCA ATTTCTATGC CCGCATCCTT GGGTTTAGCC AACTCCAACA ATTTACCGAC GACGATATTT CAACCGAATA TAGCGCCTTG ATGTCGAAAG TCGTGCAAAA TGGCACAGGC CGGATCAAAT TCCCAATTAA TGAGCCAGCC GAAGGTCGCA AAAAATCACA AATCGATGAA TATCTCGATT ATTACCGTGG CCCCGGTGCT CAACACATCG CCTTGATCAC GCCTGACATT ATCAAAACTG TGCAGCAGCT GCGTGATAAT GGCGTGGAAT TTTTGCGCAC GCCGGATACA TATTACTCGG CCTTGGCCGG TCGGGTTGGC CATATCGACG AGGACTACAA TACCCTGCAA CAATTGGGTA TTTTAGTCGA TCGTGACGAT GAAGGCTATC TCTTGCAAAT CTTCACCAAA CCAGTTGGCG ATCGACCAAC CGTGTTCTAC GAGATTATTC AACGCAAGGG CAGCCGTGGC TTTGGTGCTG GAAATTTCAA AGCCCTATTT GAAGCGATTG AGCGCGAACA AGCCAAACGT GGCAATTTGT AA
|
Protein sequence | MTMAEQETTN AHDPLALRGI DYVEMYVGNA RQAAHYYRTA FGFTPVAYAG LETGTRDRVS FVMQQRNIRL VLTGALNPDS PIAEHVKLHG DGVKDIALEV ENATAAFEAA LARGATAVLE PTVLESKWGK VVKATIRTYG HTVHTFVERD GYTGTFMPGY NKVKNPAKAE PTGLAAVDHI VGNVELGKMD EWVNFYARIL GFSQLQQFTD DDISTEYSAL MSKVVQNGTG RIKFPINEPA EGRKKSQIDE YLDYYRGPGA QHIALITPDI IKTVQQLRDN GVEFLRTPDT YYSALAGRVG HIDEDYNTLQ QLGILVDRDD EGYLLQIFTK PVGDRPTVFY EIIQRKGSRG FGAGNFKALF EAIEREQAKR GNL
|
| |