Gene Franean1_3309 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3309 
Symbol 
ID5671681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3920887 
End bp3921843 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content71% 
IMG OID641242198 
Productluciferase family protein 
Protein accessionYP_001507618 
Protein GI158315110 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03564] F420-dependent oxidoreductase, MSMEG_4879 family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.243042 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAATCG GCCTGATGAT CGGCTCCGAC AGGGACCGGC AGTACCGGGA GCGGGTGCGC 
GGTTTCATCG CCGACGCCCA GAGCGCGGAG GAGGCGGGCT TCGCCTCGAT GTGGGTACCG
CAGATTCCCG GTTACTTCGA CGCGCTGACC GTGGTCACGT TGATGGGCCA GGCGACGCGT
CGCATCGAGC TGGGCACCGC GGTCGTGCCG GTCCAGACCC GCCATCCGGT CGTCATGGCC
CAGCAGGTGC TGTCCACCCA GGCGGTCTGC GAGGGGCGTT TCACGCTGGG GATCGGGCCG
TCACACCACT GGATCATCGA GGACCAGCTC GGCCTCTCCT ACGAACGCCC AGCCCACCTG
GTGCGGAACT ACCTCCAGGT GCTGAACGCG GCGTTCGCCG GGCCGGGGCG GATCGAGGTC
GACAACGACA CCTACCGGGT ACACAGCCCG CTCGACGTCA CCGACCTCGT CCCGACGCCG
ATCCTGATCG CGGCGCTGGC ACCGGTCATG CTCCGCATCT CCGGCGAGCA GACCTCCGGA
ACCATTCTCT GGATGGCGGA CGAGCGGGCC ATCGGCGACT ATGTGGTCCC GCGCATCACG
AAAGCCGCGG CCGACGCCGG GCGACCGGCA CCGCGCATCG TCGCCGGGGT TCCGGTCGCC
CTGTGCCCCG CCGGCGAGGT CGACGCCGCG CGCTCCGCGG CGAACGACGT CCTCGGGCAC
GCCGACTACT CGCCCAACTA CAAGCGGCTC CTCGGCCACG GGGATGCCAG GGACGTCGGC
GACGTGATGG CCGTCGGCGA CGAATCCGCC ATCGTGGAAC GCCTGCGCGG TTTCCGCGAC
GCGGGAGTCA CCGACCTCGC GGCCCGCGTG GTCGCGCTCG GAGCCGACCG CGCCGAGCGC
GTCGAGTCCC AGCGGCGGAC ACAGGAGTTC CTCGCGACGC TCTGCCCCCG GATGTGA
 
Protein sequence
MRIGLMIGSD RDRQYRERVR GFIADAQSAE EAGFASMWVP QIPGYFDALT VVTLMGQATR 
RIELGTAVVP VQTRHPVVMA QQVLSTQAVC EGRFTLGIGP SHHWIIEDQL GLSYERPAHL
VRNYLQVLNA AFAGPGRIEV DNDTYRVHSP LDVTDLVPTP ILIAALAPVM LRISGEQTSG
TILWMADERA IGDYVVPRIT KAAADAGRPA PRIVAGVPVA LCPAGEVDAA RSAANDVLGH
ADYSPNYKRL LGHGDARDVG DVMAVGDESA IVERLRGFRD AGVTDLAARV VALGADRAER
VESQRRTQEF LATLCPRM