Gene Franean1_2015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2015 
Symbol 
ID5670416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2420609 
End bp2421784 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content75% 
IMG OID641240936 
Productluciferase family protein 
Protein accessionYP_001506358 
Protein GI158313850 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03558] luciferase family oxidoreductase, group 1 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.878231 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.162196 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCCAG CAGGCTCAGT CCCGCTGTCG GTGCTCGACC TGGCCCCCCT CCCCTCGGGA 
TCCTCGGCGG CCGACGCGCT GCGCAACACG CTCGATCTGG CCCGGCAGGC CGAGCGGTTC
GGCTACGCCC GCTACTGGCT CGCCGAGCAT CACCTGACCC CGGGGGTCGC CTCCGCCGCA
CCCGCCGTTC TGATCGCCCT GGTCGCAGCG GCGACGGAGA GGATCAGGGT CGGCTCCGGG
GCGGTCCAGA CCGGGCATGA CACGGCGGTG GTCGTCGCCG AGCAGTTCGG CACGATCGCG
CACCTTCACC CCGGGCGGGT CGACCTGGGG CTTGGTCGAT CCAACCTGGG CCGCTTGGTC
GACCGGGCCG CCAAGCCCAC GCCGAACGGC ACAGCGCAGC CCCTGAACGG CACAGCGCAG
CCGCCGCCTC GACCCTCGGC GCGGGTGGTC GACGGCCTGC TCCTCCCTGA GCCGCCGGCG
GTCACCTTCG ACGTGGAGCG CCTCGGCCGG CAGTTGCGGC TGGTCGGTTT CCGGGACGGA
CCCGAGGAGG ACTACACCGC GCTGGTCCGC GACATCCAGG CGTTCGTCGG TGGTACCTAC
CGGGCGCCTG ACGGTACGGC GCTGTCCGCG CCGGCCGCGG AGGGCGCAGA TCTGGAGATC
TGGATCCTGG CGGCGACCGC GGGCGGGAGC GCGCTCACGG CCGCGGCCCT CGGCCTGCCG
CTCGGCGCGA ATTACCACAT CGTCCCGTCC ACGGTGCTCG AGACGATCGC GGCGTACCGG
GCCGCGTTCC GGCCCGGCGT GCTGAGCGAG CCGCGGGTGA TGGTCTCCGC CGACGTCGTC
GTCGCGCCGG ACGACGAGAC CGCCCGCCGG CTCGCCAGCG GCTACGGCCC GTGGGTGGCC
AGCATCCGCG CGGGCGGTGG CGCCATCCCG TACCCGAGCC CCGGGGAGGT GGCGGCGCGC
GGGCTGAGCG ACGCCGAGCG GGCGCTGGTC GCCGACCGGG TGGACACCCA GTTCGTCGGC
TCGCCCGAGA CGGTCGTGCG GGGGCTGCGG GTCCTGCGGG ACGCCACCGG CGCCGACGAG
CTGCTGATCA CCACGATCAC CCACGATCAC GCCGATCGCG TGCGTTCCTA CGAGCTGCTC
GCCGAGGCCT GGTCCCGGTT GAGCGCGGTG GGCTGA
 
Protein sequence
MPPAGSVPLS VLDLAPLPSG SSAADALRNT LDLARQAERF GYARYWLAEH HLTPGVASAA 
PAVLIALVAA ATERIRVGSG AVQTGHDTAV VVAEQFGTIA HLHPGRVDLG LGRSNLGRLV
DRAAKPTPNG TAQPLNGTAQ PPPRPSARVV DGLLLPEPPA VTFDVERLGR QLRLVGFRDG
PEEDYTALVR DIQAFVGGTY RAPDGTALSA PAAEGADLEI WILAATAGGS ALTAAALGLP
LGANYHIVPS TVLETIAAYR AAFRPGVLSE PRVMVSADVV VAPDDETARR LASGYGPWVA
SIRAGGGAIP YPSPGEVAAR GLSDAERALV ADRVDTQFVG SPETVVRGLR VLRDATGADE
LLITTITHDH ADRVRSYELL AEAWSRLSAV G