Gene Anae109_3484 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3484 
Symbol 
ID5374548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4090668 
End bp4091843 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content72% 
IMG OID640845008 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_001380651 
Protein GI153006326 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.37161 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACGA CCAGGCTGGA GCCGCTGGGG ATCCGCCGCA TCGAGGCGCT GCACTACTAC 
GTCCACGACC TGGAGCGGAG CCGCCGCTTC TACGTCGAGC GCCTGGACTT CTCCGAGTCC
GGCGCCTCCA CGCCCGAGCT CGAGCGCGAG GGGCGGCAGC GCTCCGCGGC GTTCGAGGCC
GGCGACGTGC GCATCCTCTG CTCCCAGCCG GCCGGCGAGG GCGGGCGCGC CTGGCGCTAC
CTGCGCAAGC ACCCCGACGG CGTCGGCGCG GTGATCTTCG AGGTGGAGGA CGCCGAGCGC
GCCTTCCGCC TGCTCGAGGA GCGCGGCGGG ACCCCCATCA CCGACCTCCA GGTCCACGAG
GACGACGGGG GGACGCTCCG CACCTTCAAC ATCACGACGC CGCTCGGCGA CACCACCTTC
CGGTTCGTGG AGCGCCGCGG GTACCGCGGG CTCTACCCCG GCGTCGCGCG CCACGCGGCG
CCGAAGGGCG GGGCGAACGC GTTCGGCTTC GGCTACGTCG ACCACCTCAC CTCGAACTTC
CAGACCATGA AGCCGGCGCT CCTCTGGATG GAGCACGTGC TCGGCCTGGA GGAGTTCTGG
GAGGTCCAGT TCCACACGAA GGACGCCGCC GAGGCGAAGC GGGCGGCGAT CCAGGCGCAG
AAGGGGTCGG GGCTGCGCTC GGTGGTGATG AAGGACCCCG CCTCGGGCGT GAAGTTCGCG
AACAACGAGC CGTGGCGGCC CGCGTTCAAG TCCTCCCAGA TCAACGTGTT CAACGAGGAT
CACCGCGGCG ACGGCATCCA GCACGCCGCG CTCACCGTCG CGGACATCCT CTCCGCCGTG
CGCGGCCTGC GCGCGCGCGG GGTCGAGTTC ATGCCGACGC CGGCGTCCTA CTACGAGGCG
CTCCCCGAGC GGATCCGGCG CACCGGGATC GGCCGGATCG ACGAGCAGGT GGAGACGCTG
CGGGAGCTCG AGATCCTGGT GGACGGCGCC GGCCAGGGCT CTTACCTGCT CCAGATCTTC
CTGCGCGACG CGGCGGGGCT GTACCACGAG CCCGCGGCCG GCCCCTTCTT CTTCGAGATC
ATCCAGCGCA AGGGCGATCA GGGGTTCGGC GCGGGGAACT TCCGCGCGCT CTTCGAGTCG
ATCGAGCGCG AGCAGCAGCG CGAGGGGCGC ACCTAG
 
Protein sequence
MTTTRLEPLG IRRIEALHYY VHDLERSRRF YVERLDFSES GASTPELERE GRQRSAAFEA 
GDVRILCSQP AGEGGRAWRY LRKHPDGVGA VIFEVEDAER AFRLLEERGG TPITDLQVHE
DDGGTLRTFN ITTPLGDTTF RFVERRGYRG LYPGVARHAA PKGGANAFGF GYVDHLTSNF
QTMKPALLWM EHVLGLEEFW EVQFHTKDAA EAKRAAIQAQ KGSGLRSVVM KDPASGVKFA
NNEPWRPAFK SSQINVFNED HRGDGIQHAA LTVADILSAV RGLRARGVEF MPTPASYYEA
LPERIRRTGI GRIDEQVETL RELEILVDGA GQGSYLLQIF LRDAAGLYHE PAAGPFFFEI
IQRKGDQGFG AGNFRALFES IEREQQREGR T