Gene Franean1_4308 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4308 
Symbol 
ID5672663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5146673 
End bp5147629 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content74% 
IMG OID641243181 
Productluciferase family protein 
Protein accessionYP_001508598 
Protein GI158316090 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03564] F420-dependent oxidoreductase, MSMEG_4879 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.502996 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGGTCG GCGTGATGAT CGGACCGGAG CGGGGTGACT CCGCGCGCAA GGTCGGGCGG 
ATGATCGACG ACGTGTTGTG GGCCGAGAGC GCGGGGATGG ACACCGCCTG GATCCCCCAG
GTGCCCTCGG ACTTCGACGC CCTGATCGCC GTCTCGCTGA TGGGCGCGCG CACGGAGCGG
ATCGAGCTGG GCACGGCAGT CGTCCCGCTG CAGGCCCAGC ACCCGGTGGC GCTGGCGCGC
CAGACGCTGT CAGCGCAGGC GGCGACCAAC GGGCGGCTGG CGCTGGGCGT CGGGCCGTCG
CACCACTGGA TCGTGCGGGA CATGCTCGGC CTGCCCTACG ACAAGCCGGC GGCGTTCACC
CGCGACTACC TCGAGGTCCT CAACGTCGCA CTGCACGGGC CCGGCCCGGT GGACGTCGAG
AACGACACCT TCCGGGTGCA CAACCCGCTC GAGATCGGCC CGATCGCCCC GCTGCCCGTG
TTCATCGCCG CGCTCGGCCC GGTGATGCTG CGCATCGCCG GCGAGCACGC CGACGGGACG
GTGCTGTGGC TGGCCGACGA GCGCGCGGTC GCCGACCACG TGGCGCCGCG GATCACCAAG
GCCGCCCAGG AGGCGGGCCG CCCGGCGCCG CGGATCGTGG CGGGCATCCC GGTCTGCCTG
TGCGCGCCGG CCGATGTCGA CAAGGCCCGG GAGCGGGCGA ACCGCATCCT CGGCGAGGCC
GAGGTCTCCC CGAACTACCA GCGGCTGCTC GACCAGGGCG ACGCCACGAG CGTCGGCGAC
CTGTGCGCGG CCGGCGACGA GGCGGCGATC CTGGCCCGGT TCCGGCAGTT CGCCGACGCG
GGCGTCACCG ACCTGTCGGT GCGGCTGCTG CCCATCGGCG ACAACCGGGA CGAGCTGGTC
GCCTCCAAGC GCCGCACCCG GGAAGTGATC GCCGCCCTCG CGGCGGAAGT GAGATGA
 
Protein sequence
MRVGVMIGPE RGDSARKVGR MIDDVLWAES AGMDTAWIPQ VPSDFDALIA VSLMGARTER 
IELGTAVVPL QAQHPVALAR QTLSAQAATN GRLALGVGPS HHWIVRDMLG LPYDKPAAFT
RDYLEVLNVA LHGPGPVDVE NDTFRVHNPL EIGPIAPLPV FIAALGPVML RIAGEHADGT
VLWLADERAV ADHVAPRITK AAQEAGRPAP RIVAGIPVCL CAPADVDKAR ERANRILGEA
EVSPNYQRLL DQGDATSVGD LCAAGDEAAI LARFRQFADA GVTDLSVRLL PIGDNRDELV
ASKRRTREVI AALAAEVR