Gene Hoch_6553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_6553 
Symbol 
ID8548970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp8997448 
End bp8998596 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content66% 
IMG OID646391215 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_003270914 
Protein GI262199705 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0196689 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.290688 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGATA AGGAATCTCT GGGAATTCAG CAGCTCGAGG CAATCCACTA CTACGTGCAC 
GACATCGAGC GCAGCCGGCG TTTCTACACC GAGCTGATGG GATTTGCCGA GCTGGGGCGC
TCGGGCGACG CGCTCGAGAA GCAGGGCAAT CAGACCTCGC GTCTGTTCGC GGCCAACGAC
TGCCGGGTGC TGGTGTCGGC GCCGCTCAAT GAGCGCTCGC GCGCGGCCCG CTTTCTGCGC
AAGCACCCCG ACGGCGTGGG CACGCTGATC TTCCGCGTCG AGGACATCGA GCGCACCTTC
AAGCTGCTCG ACGAGCGCGG CGGCACGCCG ATCGACGAGA TCCACCACTG CGAGGACGGC
AGCATGTCGT TCTTCTCGAT CACCACACCC TTTGGCGACA CCACCTTCCG CTTCGTGCAG
CGCAAAAAAG ACGGCCCCTT CCTGCCCGGC TTCGAGCACT ACGAGACGCC GCGCGGCGGT
GACAACCCCT ACCGCTTCTC GCACATCGAC CACATCACCT CGAACTTCCA GACCATGAGC
CCGGCGCTGC TGTGGATGGA GCACGTGCTC GGCTTCGAGC GCTACTGGAA GGTCGAGTTC
CACACCCGCG ACGTCGCCCA GGACGAGGGC CACACCGGCT CGGGCCTGCG CTCGGTGGTC
ATGTGGGACC CGAAGTCGGG CGTGAAATTC GCCAACAACG AGCCCATGCG GCCGTTCTTC
AAGAACTCGC AGATCAACCT GTTCAACGAG GACCACCGCG GCGACGGCGT GCAGCACGCG
GCCCTGGCGG TCGAGGACAT CGTCCCCTGC GTGCGCGGCC TGCGCGCCGC TGGCGTGCAG
TTCATGCCGA CCCCGGGCGC CTACTACGAC GCCCTGCCCG AGCGCATCCA GAGCAGCGGT
ATCGGACACA TCGACGAAGA CGTCTCGCTG CTGCGCGAGC TCGAGATCCT CATCGACGGC
GAGGCCGATC ACGCCTACCT GCTGCAGATC TTCCTGGCCG ATTCGGCGTC GATGTACAAG
GAGCCCGAGA GCGGCGCGTT CTTCTTCGAG ATCATCCAGC GCAAGGGCGA CGACGGCTTC
GGCGCCGGCA ACTTCCGCGC CCTGTTCGAG AGCATCGAGC GCCAGCAGAT CGCCGAGAGG
GCTCTGTGA
 
Protein sequence
MADKESLGIQ QLEAIHYYVH DIERSRRFYT ELMGFAELGR SGDALEKQGN QTSRLFAAND 
CRVLVSAPLN ERSRAARFLR KHPDGVGTLI FRVEDIERTF KLLDERGGTP IDEIHHCEDG
SMSFFSITTP FGDTTFRFVQ RKKDGPFLPG FEHYETPRGG DNPYRFSHID HITSNFQTMS
PALLWMEHVL GFERYWKVEF HTRDVAQDEG HTGSGLRSVV MWDPKSGVKF ANNEPMRPFF
KNSQINLFNE DHRGDGVQHA ALAVEDIVPC VRGLRAAGVQ FMPTPGAYYD ALPERIQSSG
IGHIDEDVSL LRELEILIDG EADHAYLLQI FLADSASMYK EPESGAFFFE IIQRKGDDGF
GAGNFRALFE SIERQQIAER AL