Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3699 |
Symbol | |
ID | 5672065 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4379954 |
End bp | 4381369 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641242582 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001508002 |
Protein GI | 158315494 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGAAT ACTTAAAGTT CTACATCGGC GGCCAGTGGG CGGAACCGGC CGAGCAGCGG ACCTTTGATG TCGTGAATCC GGCGACCGAG CAGGTCGCCG GCCGGGTGGC ACTCGGCTCC GCCACCGACG TGGACCGGGC GGTCGCGGCC GCCCGGGCCG CCTTCCCCAG CTGGTCGGCG ACCAGCCGCG AGGAGCGGAT CGCGGTCCTC GAGAGCATCC TCGACGTCTA CCAGAAGCGT GCCGGCGACC TCGCCACCGC GCTGACCGAG GAGATGGGCG CGCCGGCCGC GCTGGCCAAC GGCTTCCAGG TCGGCCTCGG CGCCGGGCAC CTGACCACTG CGATCGAGAT CCTCAAGAAC TTCTCCTTCG AGGAGCAGCG CGGGGTCACC CGCGTGGTCC TCGAGCCGAT CGGGGTCTGC GGCCTGATCA CGCCGTGGAA CTGGCCGATC AACCAGATCG CGGTCAAGGT CCTTCCCGCG CTCGCCACGG GCTGCACCGT GGTGCTCAAG CCGTCCGAGG AGTCGCCCTT CACCGGGCAG ATCCTCGCCG AGATCTTCGA GGCGGCCGGG GTCCCCGCCG GCGTGTTCAA CCTGGTCCAG GGCGACGGCC CCAGCGTGGG CGTGCCGCTG TCGGCGCATC CCGACGTGGA CCTGATCTCG TTCACCGGCT CCACCCGCGC GGGCATCGAG ATCGCCAAGA ACGCCGCGCC CACGGTGAAG CGGGTGACCC AGGAGCTGGG CGGCAAGAGC CCGAACATCG TCCTCGACGA CCAGGACTTC GCCGAGAACG TCGCCAAGGG CGTCATCAAC ATGATGGGCA ACTCCGGGCA GACCTGCACG GCGCCCGCCC GCCTGCTGGT GCCCAGCGCC CGGATGGAAG AGGCGATCAG CGCCGCCCGC GAGGCCGCGG CGCAGGTGAC CGTGGGCGAT CCCAACGGCG AGTTCACGAT CGGGCCGGTG GCCTCCGGGC GCCAGTTCGA GAAGATCCAG GGCCTGATCC AGCAGGGCAT CGACGAGGGC GCCACGCTGG TCGCCGGCGG GACCGGTCGA CCGGACGGGC TGGAGACGGG CTTCTACGTC AAGCCCACCG TCTTCGCCGA CGTCACGAAC GACATGATCA TTGGCCGGGA GGAGATCTTC GGGCCGGTGC TCACGATTCA CGGCTACGAC AGCGTGGATC ACGCCGTCGA GCTCGCGAAC GATACCGAGT ACGGCCTCGC CGGCTATGTG GCCGGCGCGG ACCTCGATGC GGCGCGCGCC GTCGCCCGGC GGATCCGGGC CGGGTGGGTC GCGATCAACG ACGGGTTCGA CTTCGGTGGT CCGGTCGGCG GCTACAAGAA GAGCGGGAAC GGGCGCGAGT GGGGCGAGTT CGGTTTCCAC GAGTACCTGG AGACCAAGGG CATCCACGGC TACTAG
|
Protein sequence | MREYLKFYIG GQWAEPAEQR TFDVVNPATE QVAGRVALGS ATDVDRAVAA ARAAFPSWSA TSREERIAVL ESILDVYQKR AGDLATALTE EMGAPAALAN GFQVGLGAGH LTTAIEILKN FSFEEQRGVT RVVLEPIGVC GLITPWNWPI NQIAVKVLPA LATGCTVVLK PSEESPFTGQ ILAEIFEAAG VPAGVFNLVQ GDGPSVGVPL SAHPDVDLIS FTGSTRAGIE IAKNAAPTVK RVTQELGGKS PNIVLDDQDF AENVAKGVIN MMGNSGQTCT APARLLVPSA RMEEAISAAR EAAAQVTVGD PNGEFTIGPV ASGRQFEKIQ GLIQQGIDEG ATLVAGGTGR PDGLETGFYV KPTVFADVTN DMIIGREEIF GPVLTIHGYD SVDHAVELAN DTEYGLAGYV AGADLDAARA VARRIRAGWV AINDGFDFGG PVGGYKKSGN GREWGEFGFH EYLETKGIHG Y
|
| |