Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4747 |
Symbol | mhpA |
ID | 5673089 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5668474 |
End bp | 5670198 |
Gene Length | 1725 bp |
Protein Length | 574 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641243604 |
Product | 3-(3-hydroxyphenyl)propionate hydroxylase |
Protein accession | YP_001509020 |
Protein GI | 158316512 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.649883 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCACGG ACCTGCCCGG AGCCGGCACC GCACCCGGAG CCGTCGCCGC CGCGCCGGCC GTCGTGCTGA TCGGCGCCGG CCCGGTGGGG CTGACGCTCG CGAACCTGCT CGGCGGCTAC GGCGTGCGCA CCCTGGTGGT CGAGGAGCGG GAGACGCTGA TCGACTACCC GCGCGGGGTC GGCCTGGACG ACGAGTCGCT GCGCACCTTC CAGAGCACGG GCCTGGTCGA CCGGATCCTG CCGCACACGA ACCCCAACCA GGTGATGCGG TTCGTCGACG CGAAGCAGCG CGTGCTCGCC GAGATCGCGC CGACCAGCGA GCCGTTCGGC TGGCCGAAGC GCAACGGCTT CGTCCAGCCG TTGGTGGACG CCGAGCTGCT GGCGGGCCTG CGGCGGTACG AGCACGTCGA GGTCCGCTGG GGCCACCGGA TGGAGTCGCT GGAGCAGGCC GCCGGCGGGG TGACGGTAGG GCTGTCCGGG CCCGCCGGCC CCTCGACCGT CACGGCCGGC CACGTCGTCG GCTGCGACGG CGGGCGCAGC GCCACCCGGC AGCTCATGGG GATGTCGTTC GAGGGCACGA CGTCGCCGAC GCGCTGGCTC GTGGTCGACA TCCGCCCCGA CCCGCTCGGC CGGCCCAACG TGGATGTCGG CGCGGACCCG GCGCGCCCCT ACGTGTCGGT GTCCATCGCG CACGGCATCC GCCGCTTCGA GTTCATGCTG CACGCGGACG AGGCCGACGA GGCGGCGGAG GACCCGGAGT TCATCGCCAC GATGCTCGCC CGGTTCGTCC CGCGCCCGGA CCGCGTCAAC ATCATCCGGC GCCGGGTGTA CACGCACCAC TCGCGGATCG CCGGCTCCTT CCGCGACGGG CGGGTCCTGA TCGCCGGCGA CGCCGCCCAC CTGATGCCGG TCTGGCAGGG CCAGGGCTAC AACAGCGGCA TCCGGGACGC GGCCAACCTC GGCTGGAAGC TCGCCGCGGT GGCGAACGGC CTCGCCGGCG ACGCCCTGCT CGACACCTAC GACGTCGAGC GCCGCCGGCA CGCGCAGTCG ATGATCGACC TGTCGACGAC CGTCGGGCGC ATCATCTCCC CGACCAACCG GCGGGTCGCC ACCGTGCGGG ACTGGATCGC CCGGACGGCC TCCGCCGTGC CCGCGCTCAA GCAGTACGTC GTGGAGATGC GGTTCAAACC GATGCCGCGA TACGTCGAGG GCGCGGTCGT GCACACGGAG CCGATCCGGG CGGACTCGCC GGTCGGGACC CTGTTCATCC AGCCGCGGGT CGACACCCGC GAGCGCGAGA ACGTCCGGCT CGACGACGTG CTCGGGCCCT GGTTCGCCGT GCTGTGCTGG AACAACGACC CGTACGAGCT GCTCGGCCCC GAGACGTTCG CGAGGTGGAA GGCGCTCGGC GCGACCTTCG TCGCGCTGCG GCCGGCCACC CAGCTGCACT GGGCGGACCA GGACCACCCG GACGTCGTGG TGGTCGGGGA CCGGACCGGC GCGCTCAAGT CCTGGTTCGA CGCGCACGAG GAGTCCGTGC TCTTCCTGCG GCCCGACCGG TGTGTGTCCG GCGCCTGCGT CGCTCAGCTG ACATCCGAAC TGGCCGCGTC GCTCACCGGC GCGCTGTCCC TCACCCCGGG AGCCGAGGAT GGCGCTCGCC CTCTGCTGCA TGTCACACAG CCCGCTGCTC GGCCTGCCCG GCCCGCCGCC GTCTCTCCTG GCTGA
|
Protein sequence | MATDLPGAGT APGAVAAAPA VVLIGAGPVG LTLANLLGGY GVRTLVVEER ETLIDYPRGV GLDDESLRTF QSTGLVDRIL PHTNPNQVMR FVDAKQRVLA EIAPTSEPFG WPKRNGFVQP LVDAELLAGL RRYEHVEVRW GHRMESLEQA AGGVTVGLSG PAGPSTVTAG HVVGCDGGRS ATRQLMGMSF EGTTSPTRWL VVDIRPDPLG RPNVDVGADP ARPYVSVSIA HGIRRFEFML HADEADEAAE DPEFIATMLA RFVPRPDRVN IIRRRVYTHH SRIAGSFRDG RVLIAGDAAH LMPVWQGQGY NSGIRDAANL GWKLAAVANG LAGDALLDTY DVERRRHAQS MIDLSTTVGR IISPTNRRVA TVRDWIARTA SAVPALKQYV VEMRFKPMPR YVEGAVVHTE PIRADSPVGT LFIQPRVDTR ERENVRLDDV LGPWFAVLCW NNDPYELLGP ETFARWKALG ATFVALRPAT QLHWADQDHP DVVVVGDRTG ALKSWFDAHE ESVLFLRPDR CVSGACVAQL TSELAASLTG ALSLTPGAED GARPLLHVTQ PAARPARPAA VSPG
|
| |