Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4963 |
Symbol | |
ID | 5673302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5955548 |
End bp | 5956534 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641243817 |
Product | aldo/keto reductase |
Protein accession | YP_001509233 |
Protein GI | 158316725 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0173735 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.596284 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACG TCCCCACCCG GCACCTAGGC GAACTGGCGG TTTCCGCGCA GGGCCTGGGG TGCATGGGCA TGAGCCACGG CTACGGCGCC GCGGACGACG CGCAGTCGAT CGCGACGCTG CACCACGCCC TCGCCCTCGG GGTGACCTTC CTGGACACCT CCGACTTCTA CGGCGACGGG CACAACGAGG AGCTGATCGG ACGGGCCATC GCCGGGCGCC GCGACGAGGT GGTGCTGGCC ACGAAGTTCG GCTTCGCCAA CCGCTTGGGC GAGCCCACCC GGATCCGCGG CGACGCCGCC TACGTGCGGC AGGCGTGCGA GGCGTCGCTG CGCCGGCTCG GGGTCGACCA TATCGACCTC TACTACCAGC ACCGGGTCGA TCCGCAGGTG CCGATCGAGG AGACCGTCGG CGCCATGGCC GAGCTGGTGC GGGCCGGGAA GGTCCGCCAC CTCGGGCTGT CCGAGGCCGG CGTGCGAACC ATCCGGCGGG CGCACGCGGT GCACCCGATC GCCGCGCTGC AGAGCGAGTG GTCGCTGTGG ACCCGCGACC TGGAGGCGGA GATCGTGCCG GTCTGCCGCG ATCTTGGTAT CGGCCTGGTC CCGTTCTCCC CGCTGGGGCG CGGCTTCCTG ACCGGTCGGT ACAGCTCGGT CGAGGGGCTG GCGGAGACCG ACGTGCGGCG CGGCCAGCCG CGTTTCGCCG ACGGCAACCT CGAACGGAAC CTGGCGATCG TCGCGAAGCT GAACGAGCTG GCTGCGGCGA AGGGAGTCAC CGCCGGCCAG CTCGCCCTGG CCTGGGTGCA GCACCGGGGC GACGACGTGG TGCCGATCCC CGGCACTCGG CGGCAGCGGT ACCTGGAGGA GAACCTCGCG GCCCTGGCCG TCGAGCTGTC CACCGAGGAC CTCGCCGCCA TCGAGGCCGC CGCTCCGCCC GAGCAGGTCG CGGGCACCCG CTACGACGCG ACCAGCCTCA CCTTCGTCAA CGGCTGA
|
Protein sequence | MTDVPTRHLG ELAVSAQGLG CMGMSHGYGA ADDAQSIATL HHALALGVTF LDTSDFYGDG HNEELIGRAI AGRRDEVVLA TKFGFANRLG EPTRIRGDAA YVRQACEASL RRLGVDHIDL YYQHRVDPQV PIEETVGAMA ELVRAGKVRH LGLSEAGVRT IRRAHAVHPI AALQSEWSLW TRDLEAEIVP VCRDLGIGLV PFSPLGRGFL TGRYSSVEGL AETDVRRGQP RFADGNLERN LAIVAKLNEL AAAKGVTAGQ LALAWVQHRG DDVVPIPGTR RQRYLEENLA ALAVELSTED LAAIEAAAPP EQVAGTRYDA TSLTFVNG
|
| |