Gene Haur_2764 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2764 
Symbol 
ID5734645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3519540 
End bp3520661 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content49% 
IMG OID641279907 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_001545530 
Protein GI159899283 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGATGG CAGAACAAGA AACCACGAAT GCCCATGATC CTTTAGCATT ACGTGGCATC 
GATTATGTCG AAATGTATGT TGGTAATGCC CGCCAAGCAG CTCATTACTA TCGGACGGCC
TTTGGTTTTA CGCCAGTCGC TTATGCTGGC TTGGAAACAG GCACGCGCGA CCGCGTTTCG
TTTGTGATGC AACAGCGCAA CATCCGCTTG GTTCTGACTG GGGCACTCAA TCCTGATTCG
CCAATTGCTG AGCATGTTAA ATTGCATGGT GATGGGGTTA AGGATATTGC GCTCGAAGTT
GAAAACGCCA CTGCTGCGTT CGAAGCAGCA CTTGCCCGCG GCGCAACCGC AGTGCTTGAG
CCAACCGTGC TGGAAAGCAA ATGGGGCAAA GTGGTTAAAG CGACCATTCG TACCTATGGT
CATACGGTGC ATACGTTTGT TGAGCGCGAT GGCTATACTG GTACGTTTAT GCCAGGCTAC
AACAAGGTTA AAAATCCGGC CAAAGCTGAG CCAACTGGTT TAGCCGCGGT TGATCATATT
GTGGGTAACG TTGAGCTAGG CAAAATGGAT GAATGGGTCA ATTTCTATGC CCGCATCCTT
GGGTTTAGCC AACTCCAACA ATTTACCGAC GACGATATTT CAACCGAATA TAGCGCCTTG
ATGTCGAAAG TCGTGCAAAA TGGCACAGGC CGGATCAAAT TCCCAATTAA TGAGCCAGCC
GAAGGTCGCA AAAAATCACA AATCGATGAA TATCTCGATT ATTACCGTGG CCCCGGTGCT
CAACACATCG CCTTGATCAC GCCTGACATT ATCAAAACTG TGCAGCAGCT GCGTGATAAT
GGCGTGGAAT TTTTGCGCAC GCCGGATACA TATTACTCGG CCTTGGCCGG TCGGGTTGGC
CATATCGACG AGGACTACAA TACCCTGCAA CAATTGGGTA TTTTAGTCGA TCGTGACGAT
GAAGGCTATC TCTTGCAAAT CTTCACCAAA CCAGTTGGCG ATCGACCAAC CGTGTTCTAC
GAGATTATTC AACGCAAGGG CAGCCGTGGC TTTGGTGCTG GAAATTTCAA AGCCCTATTT
GAAGCGATTG AGCGCGAACA AGCCAAACGT GGCAATTTGT AA
 
Protein sequence
MTMAEQETTN AHDPLALRGI DYVEMYVGNA RQAAHYYRTA FGFTPVAYAG LETGTRDRVS 
FVMQQRNIRL VLTGALNPDS PIAEHVKLHG DGVKDIALEV ENATAAFEAA LARGATAVLE
PTVLESKWGK VVKATIRTYG HTVHTFVERD GYTGTFMPGY NKVKNPAKAE PTGLAAVDHI
VGNVELGKMD EWVNFYARIL GFSQLQQFTD DDISTEYSAL MSKVVQNGTG RIKFPINEPA
EGRKKSQIDE YLDYYRGPGA QHIALITPDI IKTVQQLRDN GVEFLRTPDT YYSALAGRVG
HIDEDYNTLQ QLGILVDRDD EGYLLQIFTK PVGDRPTVFY EIIQRKGSRG FGAGNFKALF
EAIEREQAKR GNL