Gene A2cp1_3571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA2cp1_3571 
Symbol 
ID7299621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter dehalogenans 2CP-1 
KingdomBacteria 
Replicon accessionNC_011891 
Strand
Start bp3990977 
End bp3992164 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content71% 
IMG OID643596384 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_002493967 
Protein GI220918663 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.665033 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTCGC ACTCGCAGAA GACGCAGCTC GAGCCGCTCG GCATCGTCCG CATCGAGGGG 
CTGCATTACT ACGTGCACGA CCTCGAGCGC AGCCGCCGCT TCTACACGCA GAAGATGGAC
TTCGCGGAGG TGGCCCGCAG CGCGCCCGCG CTGGAGCGGG AGGGCCGGCA GCGCTCGGCG
GTGTTCGAGG CGGGCGACGT CCGGGTGGTG TGCTCGGAGC CGGTGGGCGA GGGCGGCCGC
GCCTGGCGCT GGCTGCGCAA GCACCCCGAC GGCGTGGGCA CGGTGGTGTT CCAGGTGGAG
GACGCGGACC GCTGCTTCCG GCTGCTGGAG GAGCGCGGGG CGACGCCCAT CACCGACGTG
CAGGAGCACC GCGACGACGG AGGGACGCTG CGCACGTTCA ACATCACCAC CCCGCTCGGC
GACACCACCT TCCGCTTCGT GGAGCGCCGC GGCTACCGCG CCGTCTACCC GGGCATCGAG
CCGCTCGCCG CGCCGGAGGG CGGGCGCAAC GCGTTCGGCT TCGGCCACGT GGACCACCTC
ACCAACAACT TCCAGACCAT GAAGCCGGCG CTCCTGTGGA TGGAGCACGT CATGGGGATG
GAGGAGTTCT GGGAGGTGGA GTTCCACACC AAGGACGCGG CCGGCGCGCG CCGGGCCGCG
CTCGAGGCGC AGAAGGGCTC GGGCCTGCGC TCGGTGGTGA TGCGCGAGCC GCGCTCCGGC
GTGAAGTTCG CGAACAACGA GCCGTGGCGC CCCGCGTTCA AGTCCTCGCA GATCAACGTC
TTCAACGAGG ACCACCGCGG CGACGGCGTG CAGCACGCCG CGCTGACGGT GCAGGACATC
CTCTCCTCGG TGCGCGGCAT GCGCGCCCGC GGGGTGGAGT TCATGCCCAC GCCGGCGACG
TACTACGAGG CGCTGCCGGA GCGGATCCGC AGCACCGGCA TCGGCCGGAT CGACGAGGAC
CCGCGCGTGC TGCAGGAGCT CGAGATCCTG GTGGACGGCG CCGGCGACCA CTCCTACCTG
CTGCAGATCT TCCTGCGCGA CGCGGCCGGC CTGTACCACG AGCCCGACGC CGGGCCGTTC
TTCTTCGAGA TCATCCAGCG CAAGGGCGAC CAGGGCTTCG GCGCGGGCAA CTTCCGCGCG
CTGTTCGAGT CCATCGAGCG CGAGCAGGTG AAGGAAGGGC GGGCCTGA
 
Protein sequence
MTSHSQKTQL EPLGIVRIEG LHYYVHDLER SRRFYTQKMD FAEVARSAPA LEREGRQRSA 
VFEAGDVRVV CSEPVGEGGR AWRWLRKHPD GVGTVVFQVE DADRCFRLLE ERGATPITDV
QEHRDDGGTL RTFNITTPLG DTTFRFVERR GYRAVYPGIE PLAAPEGGRN AFGFGHVDHL
TNNFQTMKPA LLWMEHVMGM EEFWEVEFHT KDAAGARRAA LEAQKGSGLR SVVMREPRSG
VKFANNEPWR PAFKSSQINV FNEDHRGDGV QHAALTVQDI LSSVRGMRAR GVEFMPTPAT
YYEALPERIR STGIGRIDED PRVLQELEIL VDGAGDHSYL LQIFLRDAAG LYHEPDAGPF
FFEIIQRKGD QGFGAGNFRA LFESIEREQV KEGRA