Gene ECD_00301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_00301 
SymbolmhpA 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp338146 
End bp339810 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content57% 
IMG OID 
Product3-(3-hydroxyphenyl)propionate hydroxylase 
Protein accessionACT42200 
Protein GI253976530 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00155662 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATAC AACACCCTGA CATCCAGCCT GCTGTTAACC ATAGCGTTCA GGTGGCGATC 
GCTGGTGCCG GCCCGGTTGG GCTGATGATG GCGAACTATC TCGGCCAGAT GGGCATTGAC
GTGCTGGTGG TGGAGAAACT CGATAAGTTG ATCGACTACC CGCGTGCGAT TGGTATTGAT
GACGAGGCGC TGCGCACCAT GCAGTCGGTC GGCCTGGTCG ATGATGTTCT GCCGCACACT
ACGCCGTGGC ACGCGATGCG TTTTCTCACC CCGAAAGGCC GCTGTTTTGC TGATATTCAG
CCAATGACCG ATGAATTTGG CTGGCCGCGC CGTAACGCCT TTATTCAGCC GCAGGTCGAT
GCGGTGATGC TGGAAGGGGT GTCGCGTTTT CCGAATGTGC GCTGCTTGTT TTCCCGCGAG
CTGGAGGCCT TCAGTCAGCA AGATGACGAA GTGACCTTGC ACCTGAAAAC GGCAGAAGGG
CAGCGGGAAA TAGTCAAAGC CCAGTGGCTG GTGGCCTGTG ATGGTGGGGC AAGTTTTGTC
CGTCGCACCC TGAATGTGCC GTTTGAAGGT AAAACTGCGC CAAATCAGTG GATTGTGGTA
GATATCGCCA ACGATCCGTT AAGTACGCCG CATATCTATT TGTGTTGCGA TCCGGTGCGC
CCGTATGTTT CTGCCGCGCT ACCTCATGCG GTACGTCGCT TTGAATTTAT GGTGATGCCA
GGAGAAACCG AAGAACAGCT GCGTGAGCCG CAAAATATGC GCAAGCTGTT AAGCAAAGTG
CTGCCTAATC CGGACAATGT TGAATTGATT CGCCAGCGTG TCTACACCCA CAACGCGCGA
CTGGCGCAAC GTTTCCGTAT TGATCGCGTA CTGCTGGCGG GCGATGCCGC GCACATCATG
CCGGTATGGC AGGGGCAGGG CTATAACAGC GGTATGCGCG ACGCCTTTAA CCTCGCCTGG
AAACTGGCGT TGGTTATCCA GGGGAAAGCC CGTGATGCGC TGCTCGATAC CTATCAACAA
GAACGACGCG ATCACGCCAA AGCGATGATT GACCTGTCCG TGACGGCGGG CAACGTGCTG
GCTCCGCCGA AACGCTGGCA GGGTACGTTA CGTGACGGCG TTTCCTGGCT GTTGAATTAT
CTGCCGCCAG TAAAACGCTA CTTCCTCGAA ATGCGCTTCA AGCCGATGCC GCAATATTAC
GGCGGTGCGC TGGTGCGAGA GGGCGAAGCG AAGCACTCTC CGGTCGGCAA GATGTTTATT
CAGCCGAAAG TCACGCTGGA AAACGGCGAC GTGACGCTGC TCGATAACGC GATCGGCGCG
AACTTCGCGG TAATTGGCTG GGGATGCAAT CCACTGTGGG GGATGAGCGA CGAGCAAATC
CAGCAGTGGC GCGCGTTGGG CACCCGCTTC ATTCAGGTGG TGCCGGAAGT GCAAATTCAT
ACCGCACAGG ATAACCACGA CGGCGTACTA CGCGTGGGGG ATACGCAAGG TCGCCTGCGT
AGCTGGTTCG CACAACATAA TGCTTCGCTG GTGGTGATGC GCCCGGATCG CTTTGTTGCC
GCCACCGCCA TTCCGCAAAC CCTGGGTAAT ACGCTGAATA AACTGGCGTC GGTGATGACG
CTGACCCGCC CTGATGCCGA CGTTTCTGTC GAAAAGGTAG CCTGA
 
Protein sequence
MAIQHPDIQP AVNHSVQVAI AGAGPVGLMM ANYLGQMGID VLVVEKLDKL IDYPRAIGID 
DEALRTMQSV GLVDDVLPHT TPWHAMRFLT PKGRCFADIQ PMTDEFGWPR RNAFIQPQVD
AVMLEGVSRF PNVRCLFSRE LEAFSQQDDE VTLHLKTAEG QREIVKAQWL VACDGGASFV
RRTLNVPFEG KTAPNQWIVV DIANDPLSTP HIYLCCDPVR PYVSAALPHA VRRFEFMVMP
GETEEQLREP QNMRKLLSKV LPNPDNVELI RQRVYTHNAR LAQRFRIDRV LLAGDAAHIM
PVWQGQGYNS GMRDAFNLAW KLALVIQGKA RDALLDTYQQ ERRDHAKAMI DLSVTAGNVL
APPKRWQGTL RDGVSWLLNY LPPVKRYFLE MRFKPMPQYY GGALVREGEA KHSPVGKMFI
QPKVTLENGD VTLLDNAIGA NFAVIGWGCN PLWGMSDEQI QQWRALGTRF IQVVPEVQIH
TAQDNHDGVL RVGDTQGRLR SWFAQHNASL VVMRPDRFVA ATAIPQTLGN TLNKLASVMT
LTRPDADVSV EKVA