Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5777 |
Symbol | |
ID | 5674102 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7015549 |
End bp | 7016529 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641244628 |
Product | luciferase family protein |
Protein accession | YP_001510031 |
Protein GI | 158317523 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACGCCA CCGCTGAACC CGTCCGCGTT GGCGTACTGT TGCCGACCCG GGAGCAGGCG GTCACCGGCA GGTTCGCGGC GGAGCCGTTG CTGGCGTTCG CCGGGAAGGC GGAGGCGCTC GGTTTCGACT CAGTGTGGGT CGGCGACTCG CTGACCGCCA GGCCGCGGTT CGACCCGTTC GTGGTGCTCG CCGCGGTCGC GGCGGTGACC AGCCGAGTAA CGCTGGGCAC CGCGGCACTG ACACCGGTGC TGCGCCACCC GCTGATCGTG GCGAACCTGA TCGCCAGCCT CGACCACGTG TCCGCGTCCC GCCTGGTGCT GGGACTCGGC GCCGGCTTCC CGGTCCCCGA GACGGAGGCC GAGTTCGCCG CCGTCGGCGC GTCCTTCGGG CAGCGGGTGG GCAGGCTGGA CGAGTCGGTT GCACTGTGGC GGCAGGCCTG GCGGACCGAC GGCGCCGTCG CGGCGGAGTT CACCGGACGG TACTGGCAGG TGGCCGGTCT CGACCGGCTG CCGCCGCCGC ACCGGCCCGG GGGACCGCCA CTGTGGCTGG CCGGCAGCGA CACCCCGGCG GTGACGGCAC GGGTGGCGCG GTACTACGAC GGCTGGCTTC CCTTCCTGCC CACCGCGCAG GCCTACGACC AGGCATGGCG CGCTGTCGTC GCCGCTTGCC AGGAGGCGGG CCGGGCGCCG GGGGCCGTCA CGGCGGGCCT ATACGCCACG GTCACCGTCG ACGAGGACCG GGACCGGGCC AAGGCGGAGC TCGCTGACTA CGTCGGTCGC TACTACGGCC GACCGCTCGA GCAGATGGCC GCGTTGCAGG CGTACGGCTG GGGGAGCGCT GAGGAGTGCG CGCGGTGGCT CGTGGGCTAC GTCCGGGCCG GTGCCCGGCA CGTGGTCGTC CGGATCGGTT CGCTCGACCC GGCCAGCCAG CTCGAACTGA TCGCGCGCGA CGTCCTGCCG GCGGTACGCG CCGCGGGATG A
|
Protein sequence | MDATAEPVRV GVLLPTREQA VTGRFAAEPL LAFAGKAEAL GFDSVWVGDS LTARPRFDPF VVLAAVAAVT SRVTLGTAAL TPVLRHPLIV ANLIASLDHV SASRLVLGLG AGFPVPETEA EFAAVGASFG QRVGRLDESV ALWRQAWRTD GAVAAEFTGR YWQVAGLDRL PPPHRPGGPP LWLAGSDTPA VTARVARYYD GWLPFLPTAQ AYDQAWRAVV AACQEAGRAP GAVTAGLYAT VTVDEDRDRA KAELADYVGR YYGRPLEQMA ALQAYGWGSA EECARWLVGY VRAGARHVVV RIGSLDPASQ LELIARDVLP AVRAAG
|
| |