Gene Franean1_4747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4747 
SymbolmhpA 
ID5673089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5668474 
End bp5670198 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content73% 
IMG OID641243604 
Product3-(3-hydroxyphenyl)propionate hydroxylase 
Protein accessionYP_001509020 
Protein GI158316512 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.649883 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACGG ACCTGCCCGG AGCCGGCACC GCACCCGGAG CCGTCGCCGC CGCGCCGGCC 
GTCGTGCTGA TCGGCGCCGG CCCGGTGGGG CTGACGCTCG CGAACCTGCT CGGCGGCTAC
GGCGTGCGCA CCCTGGTGGT CGAGGAGCGG GAGACGCTGA TCGACTACCC GCGCGGGGTC
GGCCTGGACG ACGAGTCGCT GCGCACCTTC CAGAGCACGG GCCTGGTCGA CCGGATCCTG
CCGCACACGA ACCCCAACCA GGTGATGCGG TTCGTCGACG CGAAGCAGCG CGTGCTCGCC
GAGATCGCGC CGACCAGCGA GCCGTTCGGC TGGCCGAAGC GCAACGGCTT CGTCCAGCCG
TTGGTGGACG CCGAGCTGCT GGCGGGCCTG CGGCGGTACG AGCACGTCGA GGTCCGCTGG
GGCCACCGGA TGGAGTCGCT GGAGCAGGCC GCCGGCGGGG TGACGGTAGG GCTGTCCGGG
CCCGCCGGCC CCTCGACCGT CACGGCCGGC CACGTCGTCG GCTGCGACGG CGGGCGCAGC
GCCACCCGGC AGCTCATGGG GATGTCGTTC GAGGGCACGA CGTCGCCGAC GCGCTGGCTC
GTGGTCGACA TCCGCCCCGA CCCGCTCGGC CGGCCCAACG TGGATGTCGG CGCGGACCCG
GCGCGCCCCT ACGTGTCGGT GTCCATCGCG CACGGCATCC GCCGCTTCGA GTTCATGCTG
CACGCGGACG AGGCCGACGA GGCGGCGGAG GACCCGGAGT TCATCGCCAC GATGCTCGCC
CGGTTCGTCC CGCGCCCGGA CCGCGTCAAC ATCATCCGGC GCCGGGTGTA CACGCACCAC
TCGCGGATCG CCGGCTCCTT CCGCGACGGG CGGGTCCTGA TCGCCGGCGA CGCCGCCCAC
CTGATGCCGG TCTGGCAGGG CCAGGGCTAC AACAGCGGCA TCCGGGACGC GGCCAACCTC
GGCTGGAAGC TCGCCGCGGT GGCGAACGGC CTCGCCGGCG ACGCCCTGCT CGACACCTAC
GACGTCGAGC GCCGCCGGCA CGCGCAGTCG ATGATCGACC TGTCGACGAC CGTCGGGCGC
ATCATCTCCC CGACCAACCG GCGGGTCGCC ACCGTGCGGG ACTGGATCGC CCGGACGGCC
TCCGCCGTGC CCGCGCTCAA GCAGTACGTC GTGGAGATGC GGTTCAAACC GATGCCGCGA
TACGTCGAGG GCGCGGTCGT GCACACGGAG CCGATCCGGG CGGACTCGCC GGTCGGGACC
CTGTTCATCC AGCCGCGGGT CGACACCCGC GAGCGCGAGA ACGTCCGGCT CGACGACGTG
CTCGGGCCCT GGTTCGCCGT GCTGTGCTGG AACAACGACC CGTACGAGCT GCTCGGCCCC
GAGACGTTCG CGAGGTGGAA GGCGCTCGGC GCGACCTTCG TCGCGCTGCG GCCGGCCACC
CAGCTGCACT GGGCGGACCA GGACCACCCG GACGTCGTGG TGGTCGGGGA CCGGACCGGC
GCGCTCAAGT CCTGGTTCGA CGCGCACGAG GAGTCCGTGC TCTTCCTGCG GCCCGACCGG
TGTGTGTCCG GCGCCTGCGT CGCTCAGCTG ACATCCGAAC TGGCCGCGTC GCTCACCGGC
GCGCTGTCCC TCACCCCGGG AGCCGAGGAT GGCGCTCGCC CTCTGCTGCA TGTCACACAG
CCCGCTGCTC GGCCTGCCCG GCCCGCCGCC GTCTCTCCTG GCTGA
 
Protein sequence
MATDLPGAGT APGAVAAAPA VVLIGAGPVG LTLANLLGGY GVRTLVVEER ETLIDYPRGV 
GLDDESLRTF QSTGLVDRIL PHTNPNQVMR FVDAKQRVLA EIAPTSEPFG WPKRNGFVQP
LVDAELLAGL RRYEHVEVRW GHRMESLEQA AGGVTVGLSG PAGPSTVTAG HVVGCDGGRS
ATRQLMGMSF EGTTSPTRWL VVDIRPDPLG RPNVDVGADP ARPYVSVSIA HGIRRFEFML
HADEADEAAE DPEFIATMLA RFVPRPDRVN IIRRRVYTHH SRIAGSFRDG RVLIAGDAAH
LMPVWQGQGY NSGIRDAANL GWKLAAVANG LAGDALLDTY DVERRRHAQS MIDLSTTVGR
IISPTNRRVA TVRDWIARTA SAVPALKQYV VEMRFKPMPR YVEGAVVHTE PIRADSPVGT
LFIQPRVDTR ERENVRLDDV LGPWFAVLCW NNDPYELLGP ETFARWKALG ATFVALRPAT
QLHWADQDHP DVVVVGDRTG ALKSWFDAHE ESVLFLRPDR CVSGACVAQL TSELAASLTG
ALSLTPGAED GARPLLHVTQ PAARPARPAA VSPG