Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6159 |
Symbol | |
ID | 5674480 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 7492759 |
End bp | 7493712 |
Gene Length | 954 bp |
Protein Length | 317 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641245011 |
Product | aldo/keto reductase |
Protein accession | YP_001510409 |
Protein GI | 158317901 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.145044 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.449147 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTTATC GCCCCCTGGG ATCTTCGGGC CTGATGGTCT CCGTCGTCGG GCTGGGATGC AACAACTTCG GGTCCAGAGT CGACCTGACC GGTACGCGTG CCGTCGTCGA GGCCGCCCTC GACTCCGGGA TCAACTTCTT CGACACCGCC GACACCTACG GGAACAAGGG CGGCTCGGAG ACCCTGCTGG GGCACGTGCT CCGGGGCCGG CGCGACGACG TCGTCCTCGC CACCAAGGTC GGCAACGACA TGGGCGGCAT GTACGGCCAG GACTTCAACG CGCGCGCCTC CCGCCGCTAC ATCCGCAAGG CGGTCGAGGG GTCGCTGAGC CGGCTGCAGA CCGACTACCT CGACCTTTAC CAGCTCCACA ACCTGGACGC CTTCACCCCC GTCGAGGAGA CGCTGGAGGC GCTCGGCGAG CTCGTCCAGG AGGGCAAGGT CCGCTACGTC GGCTCGTGCA ACCTGGACGC CTGGCAGGTG TCCGACGCCG AGTGGACGGC ACGGGCGTCC GGCACCACCC GGTTCATCTC TGCCCAGAAC CACTACAACC TGCTCGTCCG GGACGTCGAG GCCGAGCTGG TGCCGGCCGC CCTCAAGTAC GGGATCGGGG TCATCCCCTA CTTCCCGCTG GAGAACGGCA TCCTGAGCGG CAAGTACCTG CGCGACGAGG CGCCGCCGGC CGGCACCCGG ATGGCGGGCC GCCAGGACGA GCTCACCGAC GACCTGTTCG ACCGGGTGGA GACCCTGGAG ACCTTCGCCA GGGAGCGGGA CCGCTCGCTG CTGGACGTGG CGATCGGCGG CCTCGCCGCC CAGCCCGCGG TCGCCTCCGT GATCGCCGGT GCGACGAGCG CGGCGCAGGT CCGGGCGAAC GCCGCGGCCG GCCAGTGGCA GCCCCGAGCG GACGACCTCG CCGTCCTCGA CAAGATCGCT CCCACCCGCC GCCCGGGGGC CTGA
|
Protein sequence | MRYRPLGSSG LMVSVVGLGC NNFGSRVDLT GTRAVVEAAL DSGINFFDTA DTYGNKGGSE TLLGHVLRGR RDDVVLATKV GNDMGGMYGQ DFNARASRRY IRKAVEGSLS RLQTDYLDLY QLHNLDAFTP VEETLEALGE LVQEGKVRYV GSCNLDAWQV SDAEWTARAS GTTRFISAQN HYNLLVRDVE AELVPAALKY GIGVIPYFPL ENGILSGKYL RDEAPPAGTR MAGRQDELTD DLFDRVETLE TFARERDRSL LDVAIGGLAA QPAVASVIAG ATSAAQVRAN AAAGQWQPRA DDLAVLDKIA PTRRPGA
|
| |