Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5513 |
Symbol | |
ID | 5673843 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6679634 |
End bp | 6681277 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641244368 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001509773 |
Protein GI | 158317265 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACCGCG TGCGCGACGG CATTGAAGGC TGTTTGGTGG CGCCGCGATC GGATCAGAGA GACCCTCGTC AGATGGACGC CGCAGCCAGC ACTCCCCCGT CCCGCCTGCA TCTCAAGCCG GGCACCGCCT GGGCCGACGC CTACTCGCGC GCCCGGCAGG AGGCCCCGGA AGCGTTCCAC GAGGACCGCC TGCTCAACCT GTGGGGCGGT CAGTGGCGTC GCACCGGCAA CCCGCTGCAC AGTCTCACGC CGGTGGACGG CACCCCGATC GCCGGCCCGC CGATGATCGA GCCCGACGAG GCGCGCGAGG CCATCCGCGC GACCCTCGAC GACCACAAGG AATGGCGCGA CGTCCCGCTG GCCGACCGCA AGGCCCGGGT GACCGCCGCG ATCGAGGCGA TGGAGGAGCA CCGCGACCTG CTCGCACTGC TGCTGGTCTG GGAGATCGGC AAGCCGTGGC GGCTCGCCCG CACCGACGTC GACCGCGCCC TGGACGGCGT GCGCTGGTAC GTCGACGAGA TCGACTCCAT GATCGGCGGC CGGGCGGCCC TGCCGGGCCC GGTCAGCAAC ATCGCGAGCT GGAACTACCC GATGAGCGTG CTCATGCACG CCATGCTCGT CCAGGTGCTC GCCGGCAACG CGGCGATCGC CAAGACGCCG ACCGACGGCG GGGCGGCCTG CCTGACGCTG GCCTGCGCGC TCGCCCGCCG GGCCGGGCTG CCGGTGTCGC TGGTCTCCGG GTCGGGGTCG CGGCTGTCGT CCGCGCTGGT GCGGGCGCCG GAGATCGGCT GCCTGGCGTT CGTGGGCGGG CGCTCCGCCG GCGGCCAGGT GGCGGCCGCG CTCGTCGACA CCGGCAAGCG GCACTTCCTC GAGCAGGAGG GCCTCAACGC CTGGGGCATC TGGGACTTCT CCCAGTGGGA CCTGCTGGCC TCGCACCTGC GCAAGGGCTT CGAGTACGGC AAGCAGCGCT GCACCGCCTA CCCGCGCTAT GTCGTCCAGC GCCAGCTCTT CGACAAGTTC CTGGAGATGT ACCTGCCGGT GGTCTCCTCG GTGCGGTTCG GGCATCCGCT CGCCGTCGAG AACGACTCCG ACCCGCTGCC CGACCTCGAC TACGGGCCGG TGATCACCGC GGAGAAGGCG GCGGAGCTCG CCGCCAAGAT CGACGAAGCG GTGACCAAGG GCGGCGTGCC GCTCTACCGC GGCGACCTCG CCGACGGCCG GTTCCTGCCC GGCCAGGACC GGGCCGCCTA CGTCCCGCCG GTGGCGGTCC TCAACCCGCC GCCGTCGGCC GCGCTGCACC ACGCGGAGCC GTTCGGGCCG GTCGACAGCA TCGTGGTCGT CGACTCCGAG GCCGAGCTGC TGTCCGCGAT GAACGCCTCG AACGGCGCGC TCGTCGCCTC ACTGGCCTGC GACGACGACG CCACGGCGCG CCGGCTGGCC GGGGAGCTCG CCGCGTTCAA GGTCGGCGTC AACAAGCCCC GCTCGCGAGG CGACCGGTCC GAGCCGTTCG GCGGGCGCGG CGCGTCGTGG AAGGGCGCGT TCGTCGGCGG GGAGCACCTC GTCCGCGCGG TCACCGTCGG CGCGGACCCG AACGAACGCC TCTACGGCAA CTTCCCCTCC TACTCCCTCT ACCCGGAGAC GTGA
|
Protein sequence | MYRVRDGIEG CLVAPRSDQR DPRQMDAAAS TPPSRLHLKP GTAWADAYSR ARQEAPEAFH EDRLLNLWGG QWRRTGNPLH SLTPVDGTPI AGPPMIEPDE AREAIRATLD DHKEWRDVPL ADRKARVTAA IEAMEEHRDL LALLLVWEIG KPWRLARTDV DRALDGVRWY VDEIDSMIGG RAALPGPVSN IASWNYPMSV LMHAMLVQVL AGNAAIAKTP TDGGAACLTL ACALARRAGL PVSLVSGSGS RLSSALVRAP EIGCLAFVGG RSAGGQVAAA LVDTGKRHFL EQEGLNAWGI WDFSQWDLLA SHLRKGFEYG KQRCTAYPRY VVQRQLFDKF LEMYLPVVSS VRFGHPLAVE NDSDPLPDLD YGPVITAEKA AELAAKIDEA VTKGGVPLYR GDLADGRFLP GQDRAAYVPP VAVLNPPPSA ALHHAEPFGP VDSIVVVDSE AELLSAMNAS NGALVASLAC DDDATARRLA GELAAFKVGV NKPRSRGDRS EPFGGRGASW KGAFVGGEHL VRAVTVGADP NERLYGNFPS YSLYPET
|
| |