Gene Franean1_2017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2017 
Symbol 
ID5670418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2423015 
End bp2424382 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content72% 
IMG OID641240938 
Productputative FMNH2-utilizing oxygenase 
Protein accessionYP_001506360 
Protein GI158313852 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0632006 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.1646 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCC CGACCAGGCA GATCCACCTC GCCGCGCACT TCCCGGGCGT GAACAACACG 
ACCGTGTGGA GCGACCCGCG CTCGGGCAGC CACATCGCGT TCGAGTCGTT CGTCCATTTC
GCCCGGACGG CGGAACGGGC GAAGTTCGAC TTCCTGTTCC TGGCGGAGGG GCTGCGGCTG
AGGGAGCAAC GCGGGCGCAT CCATGACCTG GACGTCGTCG GGCGTCCAGA CACGTTCACC
GTGCTCGCGG CGCTGGCGGC GGTGACCGAC CGGTTGGGCC TGGCCGGGAC CATCAACTCG
ACGTTCAACG AGCCGTACGA GGTGGCCCGC CAGTTCGCCA GTCTCGACCA TCTCTCCGAC
GGGCGCGCCG CGTGGAACGT CGTCACCTCC TGGGACGCGT TCACCGGGGA GAACTTCCGC
CGCGGTGGCT TCCTGGCCGA GGAGCAGCGC TACGAGCGCG CCGAGCTGTT CCTGCGGACG
GCCAGCGAGC TGTTCGACTC CTGGCGGGGG GACGAGATCG TCGCGGACAA GGAGTCCGGC
GTCTTCCTGG CCGATGCCAA GGCGGGGGCG TTCGAGCACC ACGACGCCCA CTTCGACATC
AGCGGGCAGT TCACGGTGCC GCGCAGCCCG CAGGGCCGGC CGGTGATCTT CCAGGCCGGT
GACTCCGACG CGGGCCGGGA GTTCGCCGCC CGGTCCGCCG ACGCGATCTT CAGCCGGCAC
AGCACGTTCG ACGCCGGGCA GGCGTTCCAC GCGGACGTCA AGCGCCGCCT CGCCCGCTAC
GGCCGTGCGC CCGAGGATCT CCTCGTCCTG CCGGCGGCGA CGTTCGTCCT CGGCGACACC
GACGCGCAGG CGCGGGAGCG GGCCGAGGAG GTCCGCCGCC AGCAGGTCAG CGGCGCCACC
GCGATCCAGT TCCTCGAGCA GGTGTGGAAC CGCGACCTCG GTGACCACGA TCCGGACGGG
CCGCTGCCCG AGGTCGACCC CGTACCCGGG GAGAACACCG TCGCCCAGGG CAGGGCGAGC
GTGCGGATGT ACGAGGACCG GCTGGCCACC GCCCGCCGCT GGCGCGAGAT CGCCGAGGCG
GGGAAGCTCA CCACCCGCGA GCTGGTCATT GAGGTCAGCG GGCGGCAGGC GTTCGTCGGC
AGCCCGGCGA CCGTCGCGGA CACGATCAAC CGGTTCGTAC AGGCCCGCGC GGCGGACGGG
TTCATCCTCG TCCCGCACAT AACCCCCGCC GGCCTGGACG AGTTCGCGGA CACGGTCGTC
CCGCTGCTCC AGGAGCGCGG CGTGTTCCGG GCCGACTACA CGGGGACGAC GCTGCGCGAC
CATCTCGGCC TCGCCCCGGT GCCAGGATGG CCGGTTCACG CGGTCTGA
 
Protein sequence
MSSPTRQIHL AAHFPGVNNT TVWSDPRSGS HIAFESFVHF ARTAERAKFD FLFLAEGLRL 
REQRGRIHDL DVVGRPDTFT VLAALAAVTD RLGLAGTINS TFNEPYEVAR QFASLDHLSD
GRAAWNVVTS WDAFTGENFR RGGFLAEEQR YERAELFLRT ASELFDSWRG DEIVADKESG
VFLADAKAGA FEHHDAHFDI SGQFTVPRSP QGRPVIFQAG DSDAGREFAA RSADAIFSRH
STFDAGQAFH ADVKRRLARY GRAPEDLLVL PAATFVLGDT DAQARERAEE VRRQQVSGAT
AIQFLEQVWN RDLGDHDPDG PLPEVDPVPG ENTVAQGRAS VRMYEDRLAT ARRWREIAEA
GKLTTRELVI EVSGRQAFVG SPATVADTIN RFVQARAADG FILVPHITPA GLDEFADTVV
PLLQERGVFR ADYTGTTLRD HLGLAPVPGW PVHAV