Gene Franean1_4834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4834 
SymbolmhpA 
ID5673175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5775501 
End bp5777294 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content74% 
IMG OID641243690 
Product3-(3-hydroxyphenyl)propionate hydroxylase 
Protein accessionYP_001509106 
Protein GI158316598 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.447643 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00803206 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGACCGCGC CCGAGCCGGC CGAGGTCGAG GCCGAGGTCG AGGCCGAGGC TGAGGCCATG 
GCCGACGTCG TCGTCGTCGG CTACGGGCCG GTCGGCCAGG TAACGGCGAT TCTGCTGGCG
CGGCGCGGCT GGCGGGTCAC AGTCCTGGAG CGATGGCCCC GGCCGTACCC GATGCCGCGC
GCGGTCTCCT TCGACGGCGA GTCGGCCCGC ATCCTCGCCG CGGCCGGCGT CGGCCCGGCG
ATGACCGAGT TCGGTGAGCC GTCCCGCGAC TACACCTGGC GCAACGCCGC GGGTGACGTG
CTGCTGCACG TCGACGTCCC CGAGCGCGGG CGTTCCGGCT GGCCGGACTC GACGTCGATG
TACCAGCCGG CACTGGAGGA GGCGCTCGCG GAGCGGGGCT CCCGACTGCC GGGCCTGCGG
GTGCTGCGCG GGCACCGGGT GGTCGCGCTG ACCGAACGGG ACGGCCACGT CGAGCTGACG
GCCGTCCTCG ACGGCTCGCC GCCGGACCGC ACCCTACGCC TCCGCGCCCG TTGGGTGGTG
GGCTGCGACG GCGCGAACAG CTTCGTGCGG ACGAGCCTGG GCGTGGAGAC CACCGACTTC
GCCTACTTCA ACGACTGGCT GACCTGTGAC GTGGTCCTGC GCGAACCGGC GGTGCACAGG
CCCAACAACC TCCAGATCTG CGACCCGACC CGGCCCCGCA CGGCCGTGTC CGCCGGGCCG
GGGCACCGGC GGTGGGAGTT CATGCGGCTA CCCGGGGAAC CGGCCGAGGA GTTCGGCCGG
ACGGAGTCGG CGTGGCGGCT GCTCGGTCTG TTCGGCCTGC ACCCGGGCAA CGCCACACTG
GAACGGCACG CCGTCTACAC GTTCCAGGCC AGGTACGTCG ACCGCTGGCG GGTGGGGCGG
GTGCTGCTCG CCGGGGACGC CGCGCACCTG ATGCCGCCGT TCGCCGGGCA GGGCATGTGC
TCGGGCTTCC GCGACGCGGC GAATCTCTCC TGGAAGCTCG ACCTGGTGCT CGCCGGAACC
GCCCCGGCTG AGCTGCTGGA CACCTACACC CTGGAGCGGC GGGCGCACGT CCAGCACGCG
ATCGGCATGT CGATGGACCT CGGGCGGGTG ATCTGCGAGA CCGACCCGGC CGCGGCCCGG
GACCGGGACG AGGTCATGAT CGCGGTGCGT GCCCGGGGGC TGCGGGAGGA CCGCGCGCAG
TCCGCCGTCG AGCCGCTGAC CGCCGGGTTC CTGCGCGGCG GGGCCGCCGC ACACGGCGGT
GGCCCCCAGC GTCCCCCGCA GCCTTCCCGC CCGGGGCGTC CGGGGCCGCC GTCGGTTCCG
GTGGGTGACC TCGTGCCGCA GGGACGAGTT CGGGTCGGTG ACCGCACCGG GCTGTTCGAC
GAGTTCGTCG AGCCCGGGTT CGTCCTGCTG GCCACCTGGC CGGCCGGCGG GCATCTCCAT
CTCCGCCCCG AGACGGACGC CGCTTTTGCG CTTCTGGGCG GACGGGTGGT CAATGTTGTC
CCCGCCACCG ATGAGGAGCC TCCTGCCGTC GTGGCCGGGA GCGCGGACGC GAGCGCAAGA
ATCACCGTCG TCGACGTCGA CGACGTCTAT CTCCGCTATC TCGCCCTCGC CGGCGCCGAT
GCCGTTCTGG TGCGGCCGGA CTTCTACCTC TTCGGCCGTG TCCCGGCGGC GGCACCGCCG
GCCGCGGACA CCCGGGCCGG CGGTACAGCG GTACAGCGGT CGGATTCCGT CGAGGAACTC
GTCGGTGACC TGCTGGCCGC GCTCGACGCC CCGATTCCCG TCGGGAAGCA TTAG
 
Protein sequence
MTAPEPAEVE AEVEAEAEAM ADVVVVGYGP VGQVTAILLA RRGWRVTVLE RWPRPYPMPR 
AVSFDGESAR ILAAAGVGPA MTEFGEPSRD YTWRNAAGDV LLHVDVPERG RSGWPDSTSM
YQPALEEALA ERGSRLPGLR VLRGHRVVAL TERDGHVELT AVLDGSPPDR TLRLRARWVV
GCDGANSFVR TSLGVETTDF AYFNDWLTCD VVLREPAVHR PNNLQICDPT RPRTAVSAGP
GHRRWEFMRL PGEPAEEFGR TESAWRLLGL FGLHPGNATL ERHAVYTFQA RYVDRWRVGR
VLLAGDAAHL MPPFAGQGMC SGFRDAANLS WKLDLVLAGT APAELLDTYT LERRAHVQHA
IGMSMDLGRV ICETDPAAAR DRDEVMIAVR ARGLREDRAQ SAVEPLTAGF LRGGAAAHGG
GPQRPPQPSR PGRPGPPSVP VGDLVPQGRV RVGDRTGLFD EFVEPGFVLL ATWPAGGHLH
LRPETDAAFA LLGGRVVNVV PATDEEPPAV VAGSADASAR ITVVDVDDVY LRYLALAGAD
AVLVRPDFYL FGRVPAAAPP AADTRAGGTA VQRSDSVEEL VGDLLAALDA PIPVGKH