Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4624 |
Symbol | |
ID | 5672969 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5514695 |
End bp | 5515615 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641243485 |
Product | luciferase family protein |
Protein accession | YP_001508901 |
Protein GI | 158316393 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAACTGG GATTTGCTAT GCCGCATCTG CTGAGACTGA AGGCCACATG TCAACCGTGG GAAGCTAAAG TGACGGGTGC GGACCAGACG CGTCTCGCCA AGTGGGCCGA GAAGCTTGGC TACGCCATGA TAAGCGTGCC CGAACACCAC ATCATTCCGA AGACCCATGT CGATCTTTCG GGGCCGCACT ACTTAAGTGC GTACCCGACC ATGGCCTATC TGGCCGGGGC CACGGAAAAG ATACGAGTTA ACTCGTGCAT CGCGCTCCTG CCGTTACAGC ATCCCGCCAT CACCGCCAAG GCTCTCTCGA GCATCGATTG GCTATCGAGC GGCCGCGTCT CCGTCACGTT CGGGGTGGGC TGGCTGGAAG AGGAGTTTGA AACTCTAGGC GTTCCCTTCC GTGAACGCGG AGCGATGAGC GAGGAGTACA TTCAGGCGAT CATCGAGCTC TGGACCAAGG AAGAACCGGC GTTCGAAGGA AAGTATGTCT CTTTCCGGGA CGTCGCGTTC GAGCCCAAGC CCGTCCAGAA ACCACACCCA CCGGTGTGGT TCGGTGGTGA CGCCGATGCC GTGCTGAGGC GCACCGCCCG CTACGCTTCG GGCTGGTGGC CATTCCTCAC CAAACCCGAG GACATCCCCG CGAAGATCGA CTTTGTCAAG TCGCAGCCCG ACTACAACGG CAAGCTTACT GATGTGTTCT ACGGCTTCGC CACCACGCGA GTCGGTGACG GTCATGTTAT ACAGAAAGAC CCACGCGCTC GGGCAGGCAT GACCAAACAG GAGATCATAG ACCGGCTCTG CTGGTTCAAG GAGCTGGGCG TGACGATGAG TTCAGTGCCG ATCCCCAGCG TCAATCACCT CGAGGACTAC CTCGACTACA CCCAATGGGT GGCGGAAGAG ATCATGCCCG TGGTTGCGTA G
|
Protein sequence | MKLGFAMPHL LRLKATCQPW EAKVTGADQT RLAKWAEKLG YAMISVPEHH IIPKTHVDLS GPHYLSAYPT MAYLAGATEK IRVNSCIALL PLQHPAITAK ALSSIDWLSS GRVSVTFGVG WLEEEFETLG VPFRERGAMS EEYIQAIIEL WTKEEPAFEG KYVSFRDVAF EPKPVQKPHP PVWFGGDADA VLRRTARYAS GWWPFLTKPE DIPAKIDFVK SQPDYNGKLT DVFYGFATTR VGDGHVIQKD PRARAGMTKQ EIIDRLCWFK ELGVTMSSVP IPSVNHLEDY LDYTQWVAEE IMPVVA
|
| |