Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2374 |
Symbol | |
ID | 5670770 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2820615 |
End bp | 2822216 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641241291 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001506712 |
Protein GI | 158314204 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.238793 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACTCG TTCATCCACG CCGGGTGGCG GCGGTGATCG GCGGCCGCCA GCACGACCTG TACGCCGCGG CCGGCCCGGC GGGCATCGCC CCGGAGACCG CGCGCGTCGT CAGCTCGACG AACCCGGCCC GCCTGAGCGA CGTCGTCGCC GAGGTCGTGC TGGACGGGGC GGAGGCCATC GTCGCGGCCG CGGCCGCCGC CCGCGCCGCC CAGCGGGAGT GGGCCGACGT CCCGGCGCCG GTGCGTGGCG AGGTCATCGG CGCCTTCGGC CGGCTCGTCG AGGACAACGC GGAGACGCTG GCCCGGCTGG TCACCCGGGA GATCGGCAAG CCGCTGGCGG AGGCGCGCGG GGAGGTCCGC GAGATCATCG ACACCTGCGC GCTGTTCCGC GGCGAGGGCC GGCGCCCGCA CGGCCAGACG GTGCCGTCCG AGATGCCCGA CCGGGAGCTG TTCACCTACC GTGAGCCGCT CGGCGTGGTC ATGGTGATCA CTGCTGGCAA CTTCCCGGTG GCGGTGCCGT CCTGGTATCT GGTGCCGGCG CTGCTGACCG GCAACGCGGT GGTGTGGAAG CCCGCCGAGT ACGCCGCCGC CTGCGCCGCG GCCCTGATGG ACCTGCTGAC CGCCGCCGGC GTTCCGCCCG GGGTGGCGAA CCTAGTCCTC GCCGACGGCC CGGCCACCTC GCTCGGCCTC GAACGCGCCC TGGACGCCGG CCTCGTCGAC AAGGTCGGCT TCACCGGCTC GACGTCCGTC GGCCGCTTCG TCGGCGCGCT GTGCGGGCGG CACCTGCAGT CGCCGTGCCT GGAACTGGGC GGCAAGAACC CGATGGTGCT GGCCCCCGAC GCCGACCTGG ACGCCGCCGT CGCCGCCGCG CTGTTCGCCG GGTTCGGGAC GGCCGGCCAG CGGTGCACCT CGCTGGGCAC GGTGATCGCG CACGAGTCGG TGCACGGCGC GTTCCGGCGC CGGCTGGACG CCGCCGTCAG CGGCGCCGTG CTCGGCGACC CCACCCGGGA CGTCCTCTAC GGCCCGCTGC TCGACGCCCG GTTCGCCGCC GGCTTCGAGG ACCACCTGGC GTGCGTGCGC GACCATCACG AGCCGTTCGG GTCCACCGCG CTCGGCCGGA TCGGCCCGGC CAGCCCGCGG CGCGGGTTCG TCGGCGACCC CGAGACCGGC CTGTACTACC ACCCGGTCGT CGTCGACCGG GTGCGCCCCG AGGACGAGCT GTTCACCGCG GAGACCTTCG GCCCGATCGT CGGGCTGACC ACCTACCGCC ACCTGGAGGA GGCCGTCGAG CTCGCGAACC TGCCCGGCTA CGGGCTGTCC TCGTCGATCT TCACCGGTGA CCCGGTGAGC GTCCGGCGCT TCCGGCGCGG GGTGCGTGCC GGAATGGTCA GCGTGAACAC CTCCACCTCC GGCGCCGAGG CGCACCTGCC CTTCGGCGGC AACGGGCGCT CCGGCAACGG CGCCCGCCAG TCCGGCCAGT GGGTGCTGGA CCAGATGACC CGCTGGCAGT CCCTGACCTG GGAACTGTCC GGCCGCCTGC AGAAGGCCCA GCTCGACGTC TCCGTCCCCC CGGCCGACCT CGGCTTCCGC CTGCCGCGGT GA
|
Protein sequence | MTLVHPRRVA AVIGGRQHDL YAAAGPAGIA PETARVVSST NPARLSDVVA EVVLDGAEAI VAAAAAARAA QREWADVPAP VRGEVIGAFG RLVEDNAETL ARLVTREIGK PLAEARGEVR EIIDTCALFR GEGRRPHGQT VPSEMPDREL FTYREPLGVV MVITAGNFPV AVPSWYLVPA LLTGNAVVWK PAEYAAACAA ALMDLLTAAG VPPGVANLVL ADGPATSLGL ERALDAGLVD KVGFTGSTSV GRFVGALCGR HLQSPCLELG GKNPMVLAPD ADLDAAVAAA LFAGFGTAGQ RCTSLGTVIA HESVHGAFRR RLDAAVSGAV LGDPTRDVLY GPLLDARFAA GFEDHLACVR DHHEPFGSTA LGRIGPASPR RGFVGDPETG LYYHPVVVDR VRPEDELFTA ETFGPIVGLT TYRHLEEAVE LANLPGYGLS SSIFTGDPVS VRRFRRGVRA GMVSVNTSTS GAEAHLPFGG NGRSGNGARQ SGQWVLDQMT RWQSLTWELS GRLQKAQLDV SVPPADLGFR LPR
|
| |