Gene EcHS_A0411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0411 
SymbolmhpA 
ID5591963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp433117 
End bp434781 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content57% 
IMG OID640919596 
Product3-(3-hydroxyphenyl)propionate hydroxylase 
Protein accessionYP_001457181 
Protein GI157159863 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.0836277 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATAC AACACCCTGA CATCCAGCCT GCTGTTAACC ATAGCGTTCA GGTGGCGATC 
GCTGGTGCCG GTCCGGTTGG GCTGATGATG GCGAACTATC TCGGTCAGAT GGGCATTGAC
GTGCTGGTGG TGGAGAAACT CGATAAGTTG ATCGACTACC CGCGTGCGAT TGGTATTGAT
GACGAGGCGC TGCGCACCAT GCAGTCGGTC GGCCTGGTCG AGAATGTTCT GCCGCACACT
ACACCGTGGC ACGCGATGCG TTTTCTCACC CCAAAAGGCC GCTGTTTTGC TGATATTCAG
CCAATGACCG ATGAATTTGG CTGGCCGCGC CGTAACGCCT TTATTCAGCC ACAGGTCGAT
GCGGTGATGC TGGAAGGGTT GTCGCGTTTT CCGAATGTGC GCTGCTTGTT TGCCCGCGAG
CTGGAGGCCT TCAGCCAGCA AAATGACGAA GTGACCTTGC ACCTGAAAAC GGCAGAAGGG
CAGCGGGAAA CGGTCAAAGC CCAGTGGCTG GTAGCCTGTG ACGGTGGAGC AAGTTTTGTC
CGTCGCACTC TGAATGTGCC GTTTGAAGGT AAAACTGCGC CAAATCAGTG GATTGTGGTA
GATATCGCCA ACGATCCGTT AAGTACGCCG CATATCTATT TGTGTTGCGA TCCGGTGCGC
CCGTATGTTT CTGCCGCGCT GCCTCATGCG GTACGTCGCT TTGAATTTAT GGTGATGCCG
GGAGAAACCG AAGAGCAGCT GCGTGAGCCG CAAAATATGC GCAAGCTGTT AAGCAAAGTG
CTGCCTAATC CGGACAATGT TGAATTGATT CGCCAGCGTG TCTACACCCA CAACGCGCGA
CTGGCGCAAC GTTTCCGTAT TGATCGCGTA CTGCTGGCGG GCGATGCCGC GCACATCATG
CCGGTATGGC AGGGGCAGGG CTATAACAGT GGTATGCGCG ACGCCTTTAA CCTCGCATGG
AAACTGGCGT TGGTTATCCA GGGGAAAGCC CGCGATGCGC TGCTCGATAC CTATCAACAA
GAACGTCGCG ATCACGCCAA AGCGATGATT GACCTGTCCG TGACGGCGGG CAACGTGCTG
GCTCCGCCGA AACGCTGGCA GGGTACGTTA CGTGACGGCG TTTCCTGGCT GTTGAATTAT
CTGCCGCCAG TAAAACGCTA CTTCCTCGAA ATGCGCTTCA AGCCGATGCC GCAATATTAC
GGCGGTGCGC TGATGCGTGA GGGCGAAGCG AAGCACTCTC CGGTCGGCAA GATGTTTATT
CAGCCGAAAG TCACGCTGGA AAACGGCGAC GTGACGCTGC TCGATAACGC GATCGGCGCG
AACTTCGCGG TAATTGGCTG GGGATGCAAT CCACTGTGGG GGATGAGCGA CGAGCAAATC
CAGCAGTGGC GCGCGTTGGG CACACGCTTC ATTCAGGTGG TGCCGGAAGT GCAAATTCAT
ACCGCACAGG ATAACCACGA CGGCGTACTA CGCGTGGGCG ATACGCAAGG TCGCCTGCGT
AGCTGGTTCG CGCAACACAA TGCTTCGCTG GTGGTGATGC GCCCGGATCG CTTTGTTGCC
GCCACCGCCA TTCCGCAAAC CCTGGGCAAG ACCCTGAATA AACTGGCGTC GGTGATGACG
CTGACCCGCC CTGATGCCGA CGTTTCTGTC GAAAAGGTAG CCTGA
 
Protein sequence
MAIQHPDIQP AVNHSVQVAI AGAGPVGLMM ANYLGQMGID VLVVEKLDKL IDYPRAIGID 
DEALRTMQSV GLVENVLPHT TPWHAMRFLT PKGRCFADIQ PMTDEFGWPR RNAFIQPQVD
AVMLEGLSRF PNVRCLFARE LEAFSQQNDE VTLHLKTAEG QRETVKAQWL VACDGGASFV
RRTLNVPFEG KTAPNQWIVV DIANDPLSTP HIYLCCDPVR PYVSAALPHA VRRFEFMVMP
GETEEQLREP QNMRKLLSKV LPNPDNVELI RQRVYTHNAR LAQRFRIDRV LLAGDAAHIM
PVWQGQGYNS GMRDAFNLAW KLALVIQGKA RDALLDTYQQ ERRDHAKAMI DLSVTAGNVL
APPKRWQGTL RDGVSWLLNY LPPVKRYFLE MRFKPMPQYY GGALMREGEA KHSPVGKMFI
QPKVTLENGD VTLLDNAIGA NFAVIGWGCN PLWGMSDEQI QQWRALGTRF IQVVPEVQIH
TAQDNHDGVL RVGDTQGRLR SWFAQHNASL VVMRPDRFVA ATAIPQTLGK TLNKLASVMT
LTRPDADVSV EKVA