Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5389 |
Symbol | |
ID | 5673721 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6497646 |
End bp | 6499127 |
Gene Length | 1482 bp |
Protein Length | 493 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641244245 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001509651 |
Protein GI | 158317143 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCTGACA GACCTCAGTT CGAGTCGAGG ATGCTCATCG ACGGCAAGCT CGTCGAGGCC GCGACCGGCA AGACGTTCGA CAACGTCAAC CCGGCGACCG AGGAGGTGCT CGGCCAGGTC ACCGATGCCT CCGCCACCGA CATGCACCGC GCCATCGACG CCGCCCGCCG GGCCTTCGAC GAGACCGACT GGCCGACGAA CCGCGCGTTG CGCAAGCTCT GCCTCTCCCA GCTACAGAAG GCGCTCGAGT CGGAACGCGA GCAGTTCCGC GAGGAGCTGA TCGCAGAGGT CGGCTGCCCG CGGACGATCA CCAACGGCGA ACAGCTCGAC GCGCCGCTGG CGAACGCGCT GCGCCACCCG ACCAAGCTCA TCGACACCTA CCCCTGGGAG ACCGACCTCG GTAACACGGT CGACGAGCGA AGCGGAAGGC TGACATCGCG GAGGATCTGG CGCGAGCCGA CCGGCGTCGT CGGGGCCATC GTGCCCTGGA ACTTCCCGCT CCAGATCGCA CTGCACGCAC TCGGCCAGGC GCTGGCCACC GGCAACACGG TCGTGCTCAA GCCTGCGCCG GACACCCCGT TCCACGCCAC CCGCCTCGGC CGCCTCATCG CCGAACACAC CGACTTCCCG CCTGGAGTCG TCAACGTCGT CACGGCGTCG GACCACCTCG TCGGCGAGGA ACTGACCCTC TCCCCCAAAG TCGACCTGAT CTCCTTCACC GGGTCGACCG CCGTCGGAAA ACGGATCATG GAGAAGGGTG CGGCCACCCT CAAACGACTG TTCCTGGAGC TGGGCGGCAA ATCGGCGACG ATCGTGCTCG ACGACGCCGA CCTGCAGACC GCCACGAGGA TGGGCATCGC CGCATGCGTC CACGCCAGCC AAGGCTGTGC CATCCCGACC CGGATGCTGC TCCCCCGCTC CCGCTACGAC GAGGGCGTAG CCCTGCTCGA GGCCATGTAC GCGGGCGTCA AGGTCGGTGA TCCCCAGCAG GCGGATACCG TCACCGGACC GGTCATCTCG ACAAAACAGC GTGAGCGGGT GCTCGGCTAC ATCCGCAAGG GCATCGATGA GGGAGCCAAA CTTCTCGTCG GCGGCACCGA GCGGCCCGAA GGGCTCGACA AGGGGTACTT CGTCAAACCG ACGCTGTTCG TCGACGTCGA CAACTCCATG ACCATCGCCC AGGAAGAAAT CTTCGGGCCC GTCCTGGCGG TGATCCCGTT CGAGGACGAC GAGGACGCGA TCAGGATCGC CAACGACAGC TCCTACGGCC TGTCCGGCGC GGTGATGTCG GGATCGCTGG AACGGGCGCT GGCGGTGACC CGACGTATCC GGACGGGCAC CGTCCACGTC AATGGTGGCC TGGCGTCCGG CCCGGACATG CCCTTCGGCG GCTACAAGGC CAGCGGCATC GGCAGGCACA AAGGACATGC GGGCTTCGAC CAGTACCTCG AGACCAAGTC CACAGCTTGG CCGGCGAGCT GA
|
Protein sequence | MADRPQFESR MLIDGKLVEA ATGKTFDNVN PATEEVLGQV TDASATDMHR AIDAARRAFD ETDWPTNRAL RKLCLSQLQK ALESEREQFR EELIAEVGCP RTITNGEQLD APLANALRHP TKLIDTYPWE TDLGNTVDER SGRLTSRRIW REPTGVVGAI VPWNFPLQIA LHALGQALAT GNTVVLKPAP DTPFHATRLG RLIAEHTDFP PGVVNVVTAS DHLVGEELTL SPKVDLISFT GSTAVGKRIM EKGAATLKRL FLELGGKSAT IVLDDADLQT ATRMGIAACV HASQGCAIPT RMLLPRSRYD EGVALLEAMY AGVKVGDPQQ ADTVTGPVIS TKQRERVLGY IRKGIDEGAK LLVGGTERPE GLDKGYFVKP TLFVDVDNSM TIAQEEIFGP VLAVIPFEDD EDAIRIANDS SYGLSGAVMS GSLERALAVT RRIRTGTVHV NGGLASGPDM PFGGYKASGI GRHKGHAGFD QYLETKSTAW PAS
|
| |