Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3914 |
Symbol | |
ID | 5672275 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4679615 |
End bp | 4681096 |
Gene Length | 1482 bp |
Protein Length | 493 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641242793 |
Product | methylmalonate-semialdehyde dehydrogenase |
Protein accession | YP_001508210 |
Protein GI | 158315702 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01722] methylmalonic acid semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.860306 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.105772 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGACGA TCGGGCACTG GATCAACGGC AAGTCCGCCC CCTCGGCCTC CGGCCGCGTC GGCCGCGTCT TCGACCCGGC CCGCGCAGTG CAGACCGGGC AGGTCACCCT CGCCTCCACC GCGGAGGTCG ACGACGTGGT GCGCGTCGCC CGGGACGCCG CCGTGAGCTG GGGCGCGTCC TCGCTCAGCA ACCGCTCGAC GCTGCTGTTC CGGCTACGCG AGCTGCTCGA CGCCAGCCGT GACGAGCTCG CCGCGGCCGT CGCCGCCGAG CACGGCAAGG TGCACTCCGA CGCGCTCGGC GAGGTCGCCC GCGGCATCGA GTGCGTCGAG TTCGCCTGCG GCATCCCCCA CCTGCTGAAG GGCTCGCACA GCTCGGAGGT CTCCCGGGGC GTCGACGTCC ACACCGAGCT ACATCCGGTC GGCGTGGTAG CGGGCATCAC CCCGTTCAAC TTCCCCGTCA TGGTGCCGCT GTGGATGCTG GCCAACGCGG TGGCCACCGG CAACACCTTC ATCCTGAAGC CCTCGGAGAA GGACCCGTCG GCCTCGCTGA TCCTGGCCGA CCTCGTCACC CGGGCCGGTT TCCCGGACGG CGTGTTCAAC GTCCTGCAGG GCGACGCCGA GGCGGTGCGC GCCCTGCTCA CCCACCCCGG CGTGGACGCC GTGTCGTTCG TCGGCAGCAC CCCGGTGGCC CGCTCCATCT ACGAGACGGG CACCGCCGCC GGCAAGCGGG TGCAGGCACT CGGCGGCGCG AAGAACCACA TGGTCGTGCT GCCGGACGCC GACATCGAGT CGGCCGCCAA CGCCGCCATC TCGGCCGGCT ACGGGTCCGC CGGTGAGCGC TGCATGGCGA TCTCGGTCGT GGTCGCGGTC GGCGCGGTCG CCGACCCGCT GGTCGACGCG ATCGCCGCGC GCATCCCCGA CGTGGTGGTC GGCCCGGCCT CGGACGAGTC GTCCCAGATG GGCCCGCTGA TCACCGCGGA GCACCGCGAC CGGGTCCGGT CCTACGTCCA GGGCGCGACC GACGAGGGTG CCCGCGTCGT CGTCGACGGC TCCGCCGGCC GCGACGAGGG GTACTTCGTC GGCTGCTCGC TGCTGGACGG CGTCAAGCCG GGCATGCGCG TCTACGACGA CGAGATCTTC GGCCCGGTCC TGAGCGTCGT GCGGGTGGAC AGCTACGACG AGGCCATCGA GCTGATCAAC AGCAACCAGT ACGGCAACGG CGTGGCCCTG TTCACCCAGG ACGGCGGCGC CGCCCGCCGC TTCACCCGGC AGGTCGACGT CGGCATGATC GGGATCAACG TGCCGATCCC GGTGCCGGTC GCCTGGCACT CGTTCGGCGG CTGGAAGGCG TCCATCTTCG GCGACGCCCC GATCTACGGC CCAGAGGGGA TCCGCTTCTA CACCCGGCCG AAGGTCGTCA CCTCACGGTG GCCCGAGTCG ACCCCCACCG CCGTCGACCT GGTCTTCCCC GCCAACCGCT GA
|
Protein sequence | MKTIGHWING KSAPSASGRV GRVFDPARAV QTGQVTLAST AEVDDVVRVA RDAAVSWGAS SLSNRSTLLF RLRELLDASR DELAAAVAAE HGKVHSDALG EVARGIECVE FACGIPHLLK GSHSSEVSRG VDVHTELHPV GVVAGITPFN FPVMVPLWML ANAVATGNTF ILKPSEKDPS ASLILADLVT RAGFPDGVFN VLQGDAEAVR ALLTHPGVDA VSFVGSTPVA RSIYETGTAA GKRVQALGGA KNHMVVLPDA DIESAANAAI SAGYGSAGER CMAISVVVAV GAVADPLVDA IAARIPDVVV GPASDESSQM GPLITAEHRD RVRSYVQGAT DEGARVVVDG SAGRDEGYFV GCSLLDGVKP GMRVYDDEIF GPVLSVVRVD SYDEAIELIN SNQYGNGVAL FTQDGGAARR FTRQVDVGMI GINVPIPVPV AWHSFGGWKA SIFGDAPIYG PEGIRFYTRP KVVTSRWPES TPTAVDLVFP ANR
|
| |