Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1556 |
Symbol | |
ID | 5669959 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1863560 |
End bp | 1864609 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641240475 |
Product | aldo/keto reductase |
Protein accession | YP_001505901 |
Protein GI | 158313393 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.206898 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.568157 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGTACC GCAAGTTCGG GCGGACGGGC ATCGAGGTCA GCCGGCAGTG CCTGGGATCC ATGATGTTCG GTGTGATCGG CAACCCCGAC CACGCCGCGT GCGAGCGCAT CATCGCCCGG GCGCTGGACG CCGGGATCAA CTTCATCGAC ACCGCCGACA TCTACTCGCG GGGTGAGAGC GAGCAGATCG TCGGGAAGGC CATCAAGGGC CGCCGCGACG ACATCGTCCT GGCCACCAAG TGCTTCAACC CGATGGGCGG GGACCGCAAC CGCCGCGGCG CCTCCCGCCG GTGGATCACG CGCGCCGTCG AGGACAGCCT CCGCCGCCTC GACACCGACT ACATCGACCT CTTCCAGATC CACCGGCATG ACTGGAACAC CGACCTGGAA GAGACCCTCG GCGCGCTGAA CGACCTCGTG CACCAGGGCA AGATCCGTTA CCTGGGCTCG TCGACGTTTC CCGCGGACTG GATCGTCGAG GCGCAGTGGG CCGCCCGCCG CCGCAACACC GAGCGTTTCG TCTGCGAGCA GCCCCAGTAC TCGATCTTCG CCCGCTCGGT CGAGCAGGCC GTCCTGCCCG CCTGCCGGCG GCACGACATC GCGGTGATTC CCTGGAGCCC GCTCTCCGGC GGCTGGCTGA CCGGGAAGTA CCGGCGCGGG CAGGCGGTGC CGGCCGACGC CCGGTACGCG GCCGGCAATG TCATGGCACA GGGGCGGGCC GTCGGGGAGA GCCCCGAGTC GCAGGCCCGG TTCGACGCGG TCGAGCAGCT GTCCGCGGTG GCGGCGGAGG CCGGCCTCTC CCTCACCCAT CTTGCGCTCG GGTTCGTGGA GAGCCATCCG GCGATCACTT CGACGATCAT CGGCCCGCGC ACGATGGAGC AGCTCGAGGA CGTCCTCAGC GGGGCCGACG TCGTGCTCGA CGCGGCGACG CTCGACGCGA TCGACAAGAT CGTGGAGCCC GGGACCGATT TCGTCGGCGT CCGGCACATG ACCGGTGACC CGTCCCTGCT GCCCGAGACG CGCCGGCGGC TGGCAACGCA GTTCGGCTGA
|
Protein sequence | MRYRKFGRTG IEVSRQCLGS MMFGVIGNPD HAACERIIAR ALDAGINFID TADIYSRGES EQIVGKAIKG RRDDIVLATK CFNPMGGDRN RRGASRRWIT RAVEDSLRRL DTDYIDLFQI HRHDWNTDLE ETLGALNDLV HQGKIRYLGS STFPADWIVE AQWAARRRNT ERFVCEQPQY SIFARSVEQA VLPACRRHDI AVIPWSPLSG GWLTGKYRRG QAVPADARYA AGNVMAQGRA VGESPESQAR FDAVEQLSAV AAEAGLSLTH LALGFVESHP AITSTIIGPR TMEQLEDVLS GADVVLDAAT LDAIDKIVEP GTDFVGVRHM TGDPSLLPET RRRLATQFG
|
| |