Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4733 |
Symbol | |
ID | 5673075 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5650134 |
End bp | 5651612 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641243590 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001509006 |
Protein GI | 158316498 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGCGG TGACCGATGT CAGGTCCGAG TCCAGGATGC TTGTGGACGG CGAGCTCGTC GAAGCCGACT CGGGTAAGCG GTTCGACAAC ATCAACCCCG CGACCGAACA GTTATTAGGC CAGGTGGCGG ACGCCTCCAC GGTGGAGATG CGGCGGGCGA TCACGGCCGC CCGCCGCGCG TTCGACACAA CCGACTGGTC GACGAACCAT GCCTTCCGCA AGCTTTGCCT CGAACAGCTG CAGGACGCGC TGGAAAGCGA GCGGGAGGAG CTGCGCGAGG AGCTCATCCT CGAAGTCGGC TGCCCGCGGA TGGTGACCAG CGGCCCGCAG CTCGATGATC CCCTGGAGAG CGCGCTTCGT TACCCGGCGA AGCTGATCGA CGAGTTCCCC TGGGAGTCGG ATCTCCCGGA CGGCATGGCG ATGGGCGTGC TCAACCGGCG GCGGGTCCGC AAGGAGCCGG CCGGTGTGGT CGGCGCCATC GTGCCGTGGA ACTTCCCCTT CGAGGTGGCC CTCAACAAGC TCGGCCAGAT CCTCGCCACC GGCAACACGG TGGTGCTCAA GCCCGCTCCG GACACGCCGT GGAACGCCAC CCGCATCGGC CGCCTGGTCG CCGAGCGCAC GGACATCCCG CCCGGCGTGG TCAATGTCGT CACCTCCTCG GACCACCTGG TGGGCGAGGA ACTGACGCTC TCCCCGCAGG TCGACCTGAT CTCCTTCACC GGATCGACCA CCGTCGGGCA GCGCATCATG GCGAAGGGCG CCGCGACGCT GAAGCGGGTG TTCCTCGAGC TCGGCGGCAA GTCCGCGACG ATCGTCCTCG ACGACGCGGA TTTCCCGAGC GCGCTCATGG CCGGCCTGGG CGTGTGCTAC CACGCCGGTC AGGGCTGCGC CGTCCAGACC CGGATGCTGT TGCCGCGGTC GCGCTATGCC GAGGGCGTCG AGGTGCTGAA GGCCGCCTTC GCGGCGGTGC CCTACGGCGA CCCGCAGCGC CCGGACGTGA TGATGGGCCC GCTGATCTCG GCCAAGCAGC GGGCTCGCGT GCTCGGCTAC ATCGAGAAGG GGGTGGCGGA GGGCGCCACG CTCGCGCTCG GTGGCGGTCG GCCCGAGCAC CTGCCCGCCG GCTGGTATGT CGAGCCCACC TTGTTCATCG ATGTGGACAA CTCGATGACG ATCGCCCAGG AGGAGATCTT CGGGCCGGTT CTCGTGGTGA TTCCGTTCGA CGATGACGAC GATGCCGTCC GGATCGCCAA CGAGAGTGTG TACGGACTCA GCGGCGGTGT CTTCTCCGGA TCCCTGGAAC GGTCGTTCGC CGTGGCCCGC CGGGTGCGCA CCGGGGGAAT GTGCGTCAAC GGCGGCACGC CCTACGGTGC GGACCTGCCC TTCGGGGGCT ACAAGCACAG CGGGATCGGC CGGCAGAACG GCACGCCGGG ATTCGAGCAG TACCTGGAGA CCAAGTCCCT CGCCTGGCCG AAGGCCTAG
|
Protein sequence | MTAVTDVRSE SRMLVDGELV EADSGKRFDN INPATEQLLG QVADASTVEM RRAITAARRA FDTTDWSTNH AFRKLCLEQL QDALESEREE LREELILEVG CPRMVTSGPQ LDDPLESALR YPAKLIDEFP WESDLPDGMA MGVLNRRRVR KEPAGVVGAI VPWNFPFEVA LNKLGQILAT GNTVVLKPAP DTPWNATRIG RLVAERTDIP PGVVNVVTSS DHLVGEELTL SPQVDLISFT GSTTVGQRIM AKGAATLKRV FLELGGKSAT IVLDDADFPS ALMAGLGVCY HAGQGCAVQT RMLLPRSRYA EGVEVLKAAF AAVPYGDPQR PDVMMGPLIS AKQRARVLGY IEKGVAEGAT LALGGGRPEH LPAGWYVEPT LFIDVDNSMT IAQEEIFGPV LVVIPFDDDD DAVRIANESV YGLSGGVFSG SLERSFAVAR RVRTGGMCVN GGTPYGADLP FGGYKHSGIG RQNGTPGFEQ YLETKSLAWP KA
|
| |