Gene Franean1_4331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4331 
Symbol 
ID5672686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5173712 
End bp5175067 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content75% 
IMG OID641243204 
Productmonooxygenase FAD-binding 
Protein accessionYP_001508621 
Protein GI158316113 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.143503 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGATCG TCTGTGTAGG CGGGGGCCCG GCCGGGCTCT ACTTCGCCAT CTCCGCCAAG 
CGCCGGGACG CCGGCCACGA GATCACCATC ATCGACCGCG ACCCGCCCGG CGCCACCTAC
GGCTGGGGTG TCGTCTACTG GGACAACCTG CTCGACGTCC TGTTCCGCAA CGATCCCGAC
AGCGCGCGGG AGATCCGCGG CGCGTCCACG CTCTGGCAGG AGCAGGACAT CAGCCTGGGC
AGCGAACGGG CGGCGCACTT CGCCGGCTAC GGCTTCAGCG TGCAGCGCGC GGCCCTGCTC
GACATCCTCA CCCGGCGCGC CGAGGAGCTC GGCGTGCGGG TCGAGCACGA CCGCGAGGTC
GCCGACCTCA CCGACCTGGC GGACCTGGGG TGCCTCGGCG GCCTCGGCGC GCACGCCGAC
GCCGACCTTG TGATCGCCGC CGACGGCGCG AACAGCCAGG TGCGCAGCAT GTTCGCGGAC
CGGTTCGGCA CCCGGGTCGA CACCGGTGGC AACCGCTACA TCTGGCTGGG CACGCCCCGG
CGCTTCGAGC GCTTCACGTT CGCGTTCGAG CCGACCCCGG CCGGCTGGGT GTGGTTCCAC
GCCTACCCGT CCGGGGCGGA GGTGAGCACC TGCATCGTCG AGTGCGCGCC GCGGACCTGG
GACGCGCTCG GCCTCGGCAC GGACGACGGC GAGGGACTAC GCCTGCTCGG GAAGATCTTC
GCTGGCCCGC TGGCCGGCGA GGGCCTGATC GACCAGCTGC GCCGGCCCGC CCGCTGGCAG
CGGTTCGGCC AGGTCAGCAA CCGTAGCTGG TACTGGGACA ACATGGTGCT GCTGGGCGAC
GCCGCCCACA CCACCCACTT CACGCTCGGC TCCGGCACGG CCCTGGCCAT GATGGATGGC
GTCATGCTCG CCCAGATGCT CTACGAGCAC GGCGAGGTCC CGGTGGCGCT GGCCGAGTTC
GACCGGGCCG GCCGGGCCGC GCTGGCCCCG CTGCAGGCCC GGGCCCGGAC GAGCATGGCC
TGGTTCGAGC GGATCGACGG CCAGCTCGAC CGGGCCAGGC CGGCCACCGG CGGCGACCCC
GACCCGGTGG CCTTCGCCTA CGCGATGGCC ACCCGGCAGG GCGACCAGCC GCCATGGCGG
TACCAGGCAC ACCGGGCGAT GCAGGTCGGG GCCGTCCGGC GGCTGCGCCG TGAGGTCGAC
TCGTCGGTGC GCTGGTACCT GGCCCGCCGG CGCGGCGAAC CGGCCCGGCC CGCCGGCCGG
CCGGCGCCGG CCGGCACCCC CACGCCCGCA GGCCCGGCCG GCCGGGCCCT GGCGGGCGCG
GGTTCCGGGT CCGCCGCGCA CCGCTCCCGC GGTTAG
 
Protein sequence
MRIVCVGGGP AGLYFAISAK RRDAGHEITI IDRDPPGATY GWGVVYWDNL LDVLFRNDPD 
SAREIRGAST LWQEQDISLG SERAAHFAGY GFSVQRAALL DILTRRAEEL GVRVEHDREV
ADLTDLADLG CLGGLGAHAD ADLVIAADGA NSQVRSMFAD RFGTRVDTGG NRYIWLGTPR
RFERFTFAFE PTPAGWVWFH AYPSGAEVST CIVECAPRTW DALGLGTDDG EGLRLLGKIF
AGPLAGEGLI DQLRRPARWQ RFGQVSNRSW YWDNMVLLGD AAHTTHFTLG SGTALAMMDG
VMLAQMLYEH GEVPVALAEF DRAGRAALAP LQARARTSMA WFERIDGQLD RARPATGGDP
DPVAFAYAMA TRQGDQPPWR YQAHRAMQVG AVRRLRREVD SSVRWYLARR RGEPARPAGR
PAPAGTPTPA GPAGRALAGA GSGSAAHRSR G