Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6468 |
Symbol | |
ID | 5674783 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 7862520 |
End bp | 7863590 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641245316 |
Product | luciferase family protein |
Protein accession | YP_001510711 |
Protein GI | 158318203 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | [TIGR03558] luciferase family oxidoreductase, group 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0486136 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCCTGA CGAGACCGGT GCCGCTGTCC GTCCTCGACC TGGTGCCGGT GGGCACGGGT GTCTCCGCCG GCGAGGCCGT GCGGGGCAGC CTTGAGCTGG CCCGGCGGGC CGAGGATCTC GGCTTCACCC GCTACTGGCT GGCCGAGCAC CACGGCATGC CGGGCATCGC CAGTTCCGCG ACGACCCTCC TGATGGGCCA GGTGGCCGCG GCGACGTCCA CCATCAGGGT TGGCTCGGGC GGGGTGATGC TGCCCAACCA TGCCCCGCTC ACGATCGCCG AGCAGTTCGG GACCCTGGGC GCGTTCTTCC CCGGCCGCAT CGACCTCGGC ATCGGCCGTG CGCCCGGCAC CGACCAGAAC ACCGCGCGTG CCCTGCGCCG CGGCCCGGGG CCGATGTCCG CGGACGACTT CCCCGACCAG CTCGCCGAGC TGACGACCTT CCTCTCCGGT GGGTCGTTCC CCGCCGGGCA CCCGCTGCGC GGGGTCGAGG CGTACCCGCG GGCGGACACA CCGCCGATCT GGCTGCTCGG CTCGAGCGGC TACAGCGCCC AGGTCGCCGG GATCCTCGGC CTGCCGTTCG CGTTCGCGCA CCATTTCAGC TCCGTCAGCA CGCTGCCCGC GCTGCGGCTG TACCGCGACT CGTTCCGGCC GTCGCCCGGA CGGGAACAGC CGTACGCGAT GGTCGCCGCC GGTGTCCTGT GCGCCGAGGA GGACGAGCGC GCCCAGTGGC TCGCCGGGCC CACCCGCCTG ACGATGCTGC GCCTGCGCCT GGGACGGCCC GGCGCCTACC CCACGCCCGA GGAGGCCGCG GCCTACCCCT GGACGCCGCA GGAGCGCGAT CTGGCGGAGT CCGCGACGTC CTCGCACATC GTGGGCGGAC CGGAAACCGT GCGGGACGGC CTGAACGCGC TACTCGAGGC GACCGGGGCC GACGAGCTGA TGATCGTCAC GAACGTGCAC TCCAACGCCG ACCGGATCCG CTCCTACGAG CTCGTCGCCC GGCTGGCCCT GGACGAACCG GGGGTCGGGC AGGTCAGGGA CGCCCGCGCG GCCGGAGTGT CCGGGCCGTG A
|
Protein sequence | MSLTRPVPLS VLDLVPVGTG VSAGEAVRGS LELARRAEDL GFTRYWLAEH HGMPGIASSA TTLLMGQVAA ATSTIRVGSG GVMLPNHAPL TIAEQFGTLG AFFPGRIDLG IGRAPGTDQN TARALRRGPG PMSADDFPDQ LAELTTFLSG GSFPAGHPLR GVEAYPRADT PPIWLLGSSG YSAQVAGILG LPFAFAHHFS SVSTLPALRL YRDSFRPSPG REQPYAMVAA GVLCAEEDER AQWLAGPTRL TMLRLRLGRP GAYPTPEEAA AYPWTPQERD LAESATSSHI VGGPETVRDG LNALLEATGA DELMIVTNVH SNADRIRSYE LVARLALDEP GVGQVRDARA AGVSGP
|
| |