Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2501 |
Symbol | |
ID | 5670897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2977387 |
End bp | 2978943 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641241418 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001506839 |
Protein GI | 158314331 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.169486 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAAGT TCGAGACACG CAATCCGGCG ACGGACGAGG TCATCGGGAC CTATTTGGCG ATGTCCGCCG ACGACGTGGC GGCGAGCGTG CGGTCGGCCA GGGCCGCCGC GGGGCAGTGG CGGGCCGCGG GCTTCGCGGG CCGCCGTGCC GCCCTGCTGC GCTGGGACGC CTGGCTCGCC GCCCACGACC GCGACCTGAT CGAGCTCATA CACCTGGAGA ACGGGAAACC GGAGATCGAC GGCCGGCTCG AGCTCCTCCT GGCCTGCGAG CAGCTGCGCT GGGCGGCGCG CAACGCGCAC CGGGTCCTGC GCGCCCGCCG GGTCCGCACC GGGCTCGTGC TGGCCAACCA CAAGGCCCGC ATCGACCACC TGCCCTACGG CGTGGTCGGC GTGATCGGCC CGTGGAACTA CCCGGTCCTG ACACCGATGG GGTCCATCGC CTACGCGCTG GCCGCGGGCA ACACCGTCGT CTTCAAGCCC AGCGAGCTCA CGCCCACCGT CGGCGTGTAC CTCGCCGAGG CGTTCGCCGC CGCGAACCCG GACCTGCCCC CCGGCGTGTT CACCGCGGTG ACCGGGCTCG CCGAGACCGG GGCGGCGCTG TGCACGGCCG GCGTCGACAA GATCGCCTTC ACCGGCTCGG CGTCGACGGC GCGGCGGGTC ATGGCCACCT GCGCCGAGAC GCTGACCCCC GTCGTCGTCG AGTGCGGCGG CAAGGACGCT GCGATCGTCG CGGAGGACGC CGACCTGGTC GCCGCGGCAC GCGCGGTGGC CTGGGGGGCG ACGTCGAACG CCGGCCAGAC CTGCGCCGGG GTGGAGCGCG TCTACGTCGT CGCCGGTGTC CGGGACGCCT TCCTCGCCCA GCTGCGCCGC GTCCTCGCCG ACATCCGGCC CGGGTCGGAC GCCGGCGCCG ACTACGGCCC GATGACACTG CCCCGCCAGA GCGAGGTCGT CCGCCGCCAC CTCGACGACG CGCTCGCCCG CGGCGGGACG GCGCTGCTCG GCGGCCCGGA GTCGGTGCGC GCGCCGTACA TCGACCCGAT CGTGCTCGTG GACGTCCCCG AGGACAGCGC GGCCGTGCGC GAGGAGACCT TCGGGCCGAT GATGACCGTG CGCACCGTCG CCGACGTCGA CGAGGCCGTC GCCCTGGCGA ACGGCACCGC CTACGGCCTC GGCGCGACCG TGTTCTCCCG GGCCCGCGGC GAGGAGATCG CCGCGCGTCT CGATGCCGGG ATGGTCTCGG TCAACGCGGT GCTGTCGTTC GCGGCGATCC CGGCGCTGCC GTTCGGCGGG AGCGGCGACA GCGGGTTCGG CCGGGTCCAC GGCGCCGCCG GCCTGCGGGA GTTCGCCCGG CCGCGCTCGG TGGCCACCCG CCGGGTGGGC CTGCCCTGGG TCAACGCGGC CAGGTTCAAC CCTGTCCCGG GGACGTCCGC GGTGCTGCGG TGGCTCATCC GGCTCCGTCA CGCTGTGGGG GGTCACGGGG CGGGACGCCC AGACGCGGGC CGGCGCGATC CGTCGGGCCA GGGCCAGGGC CGTCGGTCGG CGCGAGCAGC CAGGTGA
|
Protein sequence | MAKFETRNPA TDEVIGTYLA MSADDVAASV RSARAAAGQW RAAGFAGRRA ALLRWDAWLA AHDRDLIELI HLENGKPEID GRLELLLACE QLRWAARNAH RVLRARRVRT GLVLANHKAR IDHLPYGVVG VIGPWNYPVL TPMGSIAYAL AAGNTVVFKP SELTPTVGVY LAEAFAAANP DLPPGVFTAV TGLAETGAAL CTAGVDKIAF TGSASTARRV MATCAETLTP VVVECGGKDA AIVAEDADLV AAARAVAWGA TSNAGQTCAG VERVYVVAGV RDAFLAQLRR VLADIRPGSD AGADYGPMTL PRQSEVVRRH LDDALARGGT ALLGGPESVR APYIDPIVLV DVPEDSAAVR EETFGPMMTV RTVADVDEAV ALANGTAYGL GATVFSRARG EEIAARLDAG MVSVNAVLSF AAIPALPFGG SGDSGFGRVH GAAGLREFAR PRSVATRRVG LPWVNAARFN PVPGTSAVLR WLIRLRHAVG GHGAGRPDAG RRDPSGQGQG RRSARAAR
|
| |