Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4154 |
Symbol | |
ID | 5672509 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4934662 |
End bp | 4935567 |
Gene Length | 906 bp |
Protein Length | 301 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641243027 |
Product | hypothetical protein |
Protein accession | YP_001508444 |
Protein GI | 158315936 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | [TIGR03620] probable F420-dependent oxidoreductase, MSMEG_4141 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.238248 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.605436 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGGTAG CGGAGACCCG GCGGCGGCTG GGCAGGTTCG GGGTGTGGGT GGCCCCGTTC TCCTTGCTCG AGACGTCGGT GGCGGTGCAG CGCAGACAGT TCGCCCGGAT CGAACATCTC GGGTACGGCT CCCTGTGGAG TGGGGAGACG CCGCCGGGTG CGCCGGTCGG GGGCCGGGAG GTGTTCACCC AGCACGGGTT GATGCTCGCC GCGACCGAGC GGATCGTCGT CGGTACGGGC ATCGCGAACA TCAGCACCCG CACGGCGGGC GCGATGCACA CCGGTGCCGC GACGCTGGCC GAGGCGTATC CCGGCCGGTT CGTGCTCGGT CTGGGCGGCC AGTCCGGTGA CCGGCCCCTC ACCCGTCTAC GGGAGTATCT CGACGCGATG GACCACGCCG CGCGGGCGCT GGCGCAGCTG CCGGCTCCGG CCTATCCGCG TGTTCTCGCC GCGCTCGGGC CACGCGCCCA CGGGCTGGCT TCCGATCGCG CCGACGGCGT GCACCCGTTC CTGCAACCGG TGGCGCACAC GGCGGCGGCC AGGGCGGCCG TGGGCCCGGA CCGTCTGGTG ATCCCCCACC AGGCCGTCGT ACTCGAAACG GACGCGGACG CGGCCCGGGC GCGGCTGCGG GCGATTTTCG CTCTGGGGGT GGGCGCCTCG GCCTCGCCTT ACACCGCGCA CTACCGGCGG CTCGGCTACA GCGAGGCGGA CCTGGCCGGG CAGCGCAGTG ACCGGCTGGT CGACGACGTC CTGGCCTGGG GCGACGAGGC TGCTGTCGCG GCCCGGCTGA TCGCGCATCT CGACGCCGGT GCCGATCATG TGCTTGTGCA CCCGTTCGCG GCGGACCTTC CCGCGGCGGT CGACCAGCTC GAACGGCTCG CTCCCCTGTT GCGCGACGCA GCCTGA
|
Protein sequence | MAVAETRRRL GRFGVWVAPF SLLETSVAVQ RRQFARIEHL GYGSLWSGET PPGAPVGGRE VFTQHGLMLA ATERIVVGTG IANISTRTAG AMHTGAATLA EAYPGRFVLG LGGQSGDRPL TRLREYLDAM DHAARALAQL PAPAYPRVLA ALGPRAHGLA SDRADGVHPF LQPVAHTAAA RAAVGPDRLV IPHQAVVLET DADAARARLR AIFALGVGAS ASPYTAHYRR LGYSEADLAG QRSDRLVDDV LAWGDEAAVA ARLIAHLDAG ADHVLVHPFA ADLPAAVDQL ERLAPLLRDA A
|
| |