Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0346 |
Symbol | |
ID | 5668770 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 414755 |
End bp | 416194 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641239278 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001504718 |
Protein GI | 158312210 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0211334 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0136031 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGCGGT TACTGATCGA CGGGAAGCTT GTCGAGACCG AGCGGACGGT CGACTCGATC AACCCCTCGA CCGGCGAAGT CATCGGCCAG GCCGCGGACG CGACGGTCGA AGAGACCACC GCCGCGGTCA AGGCCGCCCG TAAGGCCTTC GACACCACCG ACTGGTCGAC CAACGTCGCG TTCCGCGTCC AGTGCCTCAA CCAGCTCCAC GACGTCCTCG TCAAGCACAA AGAAGAACTC CGCGAACTCA CCATCGCCGA GGTCGGCCAC CCCCGCATGA TCACCGACGG GCCCGCCCTC GGCGACCCGA TCAACCTCGT CAAGTACTAC GCCGACCTCA CCGCCGGCTA CCAGTTCACC CAGGACCTCG GCACCGTCGA ATCCCGCGGC GCCCAGCACC ACCGCTGGAT CGAACGCGAA CCCGCCGGCG TCGTCTCCGC GATCGTCGCC TACAACTACC CCACCCAGCT CGCCCTCGCG AAACTCGCCC CCGCCCTGGC CGCCGGCTGC ACCGTCATCC TCAAAGGCGC CCCCGACACC CCCCTGCTCG CCCTCGCCCT CGGCGAACTC ATCGCCAACG AGACCGACAT CCCCGCCGGC GTCGTCAACG TCATCACCTC CATCGACATC GACGCCGCCG AAGTCCTCAC CGGCCACCCC GACGTCGACC TGATCACCTT CACCGGGTCC ACCGCCGTCG GCCGACGCAT CATGGAAGTC GCCAGCAAGA CCGTCAAAAA AGTCTTCCTC GAACTCGGCG GGAAATCCGC CCTCGTCATC CTCGACGACG CCAACCACGA CCTCGCCGCC ATGATGGCCG CGTTCACCAT CTGCTCCCAC TCCGGGCAGG GCTGCGCCAT CACCAGCCGC CTCGTCGTCC CCCGCGCCCA ACACGACGCC ATCGTCGAGA AGGTCGCCGC CATGCTCGGC CAGATCAAAG TCGGGAACCC CACCGAACCC GACACCTACA TGGGCCCGCT CATCAGCGAG AAGCAACGCG ACAAGGTCGA CGGCATCGTC CAACGCGCCA TCGCCGCCGG CGCCACCCTC GTCACCGGCG GCGAAAAGAT CAACCCCGGG TTCTTCTACG CCCCCACCCT GCTCGCAGGC GTCGACCCCG ACAGCGAGAT CGCCCAGGAA GAAATCTTCG GCCCCGTCCT CGCCGTCATC CCCCACGACG GCGACGACGA CGCCGTGAAC ATCGCCAACA ACTCCATCTT CGGCCTCTCC GGATCCGTCC TCAGCGCCGA CACCGACCGC GCCCTCGCCG TCGCCCGCCG CATCCGCAGC GGCACCATCA GCGTCAACGG CGGCAGCTGG TACGCCCCCG ACGCCCCCTT CGGCGGCTAC AAGCAGTCCG GCATCGGCCG CGAAAGCGGC ACCCCCGGCC TCGAGGAATT CCTCGAGATC AAAACCATCG CCACCCCGGC CGCGTCCTGA
|
Protein sequence | MKRLLIDGKL VETERTVDSI NPSTGEVIGQ AADATVEETT AAVKAARKAF DTTDWSTNVA FRVQCLNQLH DVLVKHKEEL RELTIAEVGH PRMITDGPAL GDPINLVKYY ADLTAGYQFT QDLGTVESRG AQHHRWIERE PAGVVSAIVA YNYPTQLALA KLAPALAAGC TVILKGAPDT PLLALALGEL IANETDIPAG VVNVITSIDI DAAEVLTGHP DVDLITFTGS TAVGRRIMEV ASKTVKKVFL ELGGKSALVI LDDANHDLAA MMAAFTICSH SGQGCAITSR LVVPRAQHDA IVEKVAAMLG QIKVGNPTEP DTYMGPLISE KQRDKVDGIV QRAIAAGATL VTGGEKINPG FFYAPTLLAG VDPDSEIAQE EIFGPVLAVI PHDGDDDAVN IANNSIFGLS GSVLSADTDR ALAVARRIRS GTISVNGGSW YAPDAPFGGY KQSGIGRESG TPGLEEFLEI KTIATPAAS
|
| |