Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3297 |
Symbol | |
ID | 5671669 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3908445 |
End bp | 3909851 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641242186 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001507606 |
Protein GI | 158315098 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGACG TGTCAATGAT CATCGGTGGG GAGCGCCGGG CCGCCCCGTC GACCTTCGGT GTTGTGAATC CGGCGACGGG CGAGGTCCAC GCGGATGCGC CGGACTGTGA TGAGGGCCAG CTGAACGAGG CGTTCGACGC CGCGGCCAAG GCATTCACCG ACTGGCGCAC CGACGAGGAC GCGCGCCGCT CCATCCTGCG CGAGGCATCG AAGCGACTCT CGGCGGCGGC GGACCGGATC GCCCCGGTCC TGACCGCCGA ACAGGGCAAG CCGCTCAGAT CTGCGCACAT GGAGGTGCTC GGCGCCGGCT ACTGGCTGAG GTACTTCTCC CGCCTGGAGA TCCCGCGCGA GGTCATCCAG GACGACGAGC ATGTCCGCGC CGAGGTGGTG CACCGCCCGA TGGGCGTCGT CGCGGCCATC ACGCCGTGGA ATTTCCCGAT CATGCTGTCG GCCTGGAAGC TCGGACCGGC GCTACTCGCC GGGAACACGG TCGTGCTGAA GCCGTCGCCG TTCACTCCGC TCAGCACCCT GCTGATGGGC GAGATCCTCA GCGAGGTGCT CCCACCGGGT GTGCTCAACG TCGTCTCAGG CGGAGATCAG CTGGGCCGGT GGATGACCTC CCATCCGGTC CCGCGGAAGA TCAGCTTCAC CGGGTCGACC GGGACCGGCA AGCACATCGC CGCCTCGGCG GCCCCTGACC TCAAGCGCCT CACTCTGGAG CTCGGCGGGA ACGACGCCGC GATCCTTCTC GACGATGTCG ATCCCGCCGC CATCGTGCGG AAGCTTTTCG AGGGGGCCTT CGACAACAGT GGGCAGGTCT GCTCGGGCAT CAAGCGCCTT TACGTTCCCG AATCGTTGCA CGATGTCGTC GTCGACGCTC TCGCGGCCCA GGCCGCGGCG GCCCGGGTCG GTGACGGCAT GGACCCGGAC ACCCAGGTGG GCCCGATCCA GAACCGCCCG CAGTACGAGC GGGTCTGCGA CCTGGTCGCC GAGGCGCTGG CGGGTGGCGC GACGGCGGCG GCCGGCGGCG CCCCGATCGA CGGCCCCGGC TACTTCTTCC CGCCGACCGT GCTCGTCGGG GCGGCCGAGG GCACCCGGAT CGTCGACGAG GAGCAGTTTG GACCGGTGCT GCCGGTGCTG CCGTACCGGG ACATCGACGA AGCGGTCGCC CGGGCCAACG CCACGAACTA CGGGCTTTCC GGTTCGGTCT GGTCCGCGGA CCCGGATCGG GCCGGCGCCG TCGCCGAGCG CCTGGACTGC GGCACCGCCT GGGTCAACGC CCATGTGGCA CTCGGCCCCC ATCAGCCCTT CGGCGGTCTC AAATGGAGCG GCGTCGGGGT GGAGAACGGG CCCTGGGGGC TCGCGGGCTA TACCGACCTC CAGGTCCAGT ACCGCGCCAA GGACTGA
|
Protein sequence | MSDVSMIIGG ERRAAPSTFG VVNPATGEVH ADAPDCDEGQ LNEAFDAAAK AFTDWRTDED ARRSILREAS KRLSAAADRI APVLTAEQGK PLRSAHMEVL GAGYWLRYFS RLEIPREVIQ DDEHVRAEVV HRPMGVVAAI TPWNFPIMLS AWKLGPALLA GNTVVLKPSP FTPLSTLLMG EILSEVLPPG VLNVVSGGDQ LGRWMTSHPV PRKISFTGST GTGKHIAASA APDLKRLTLE LGGNDAAILL DDVDPAAIVR KLFEGAFDNS GQVCSGIKRL YVPESLHDVV VDALAAQAAA ARVGDGMDPD TQVGPIQNRP QYERVCDLVA EALAGGATAA AGGAPIDGPG YFFPPTVLVG AAEGTRIVDE EQFGPVLPVL PYRDIDEAVA RANATNYGLS GSVWSADPDR AGAVAERLDC GTAWVNAHVA LGPHQPFGGL KWSGVGVENG PWGLAGYTDL QVQYRAKD
|
| |