Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3644 |
Symbol | |
ID | 5672011 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4317731 |
End bp | 4318654 |
Gene Length | 924 bp |
Protein Length | 307 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641242528 |
Product | hypothetical protein |
Protein accession | YP_001507948 |
Protein GI | 158315440 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | [TIGR03620] probable F420-dependent oxidoreductase, MSMEG_4141 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.150808 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACAGAGG ACCTGGGACA CCCGCTCGGC CACGTCGGCG TCTGGACGTT CGCCTTCGAC GGCCAGCCCT CCAGCCGGGT GCGGGAGGCC GCCGCGGAGA TCGAGGAGCT GGGCTACGGC GCGATCTGGT ACGGCGAGGG ATTCGGCCGG GACACGGTGA GCCAGGCGTG GCTGCTGCTG TCGGCCACCC GGCGGGTCAC CGTCGCGTCC GGCATCGCGA ACATCGCGTT CCGTGACCCG ATCGCCCTGG CGGGCGCGGC CCGCGCGCTC GGCGAGACGT TCCCGGGGCG CTACCTTCTC GGCCTCGGAG GACACCGGGT TGACGACACC ACCCACCTGC TGGACGGATA CCCCGTGCCG GGCCTGGGCC GAGCCGTGTC GACCATGGGT GCCTACCTGG CGGCCATGGA CGCCGTCCCC GCCCACAGCC CGTCGCCGCG GCCGGCCGTG CGCCGGGTAC TGGCCGCGCT CGGTCCGAAG ATGCTGTCGC TGGCGGCCGA ACGGACATGG GGGGCGCACC CGTACTTCGT GCCCGTCGAA CACACCCGGC GGGCCCGCGA GATCATGGGT CCCGAGGCGT TCCTCGGGGT CGAGCAGGCC GTCGTCCTGG ACACCGATGT CGACCGGGCC CGCCAGGTCG CCGCGGCGCA CGTCGCCGGC TACGTCTCCG CGGCACCCCA CCAGGAGGCG AACATGCGGC GGTTGGGCTT CGGTGATGAC GACCTCGTCG GCGGCCCGAG CCGCTGGCTG GTCGACGCGA TCGTCGCCTA CGGCGACGCC GGGACAATCC GGGACCGGGT ACGCCAGCAC CTCCACGCCG GCGCCAACCA CGTCTGTGTC CAGGTCCTCA CCGTCGACCC GACCGTGCTG CCAGACCGTG AATGGCGTGA GCTCGCCCCG GCCCTGCGCG ACGCCGCCCC ATGA
|
Protein sequence | MTEDLGHPLG HVGVWTFAFD GQPSSRVREA AAEIEELGYG AIWYGEGFGR DTVSQAWLLL SATRRVTVAS GIANIAFRDP IALAGAARAL GETFPGRYLL GLGGHRVDDT THLLDGYPVP GLGRAVSTMG AYLAAMDAVP AHSPSPRPAV RRVLAALGPK MLSLAAERTW GAHPYFVPVE HTRRAREIMG PEAFLGVEQA VVLDTDVDRA RQVAAAHVAG YVSAAPHQEA NMRRLGFGDD DLVGGPSRWL VDAIVAYGDA GTIRDRVRQH LHAGANHVCV QVLTVDPTVL PDREWRELAP ALRDAAP
|
| |