Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0618 |
Symbol | |
ID | 5669035 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 718750 |
End bp | 720219 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641239545 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001504983 |
Protein GI | 158312475 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.527378 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCAGCGA TCATCGCGGA GCGCCGCTCG TACGTGGCCG GCACCTGGGT CGAGGGCGAC GAGGTCTTCG CCGTCGAGAA CCCGGCCGAC GAGACCTCGG TGGCCGACGT GGCGGCCACC CCCCTGCCCG AGATCGAGCG CGCGATCACC GAGGCCCGCC GGTCCTTCGA CGAGGGGGTG TGGGCGGACC GCTCACCGGC GGAGCGGGCC AGGGTGCTGG GCGCGTTCCT CGACTACTTC CAGTCCGCGC GGGCCGAGCT TGTCGCCACC ATGGTCGCCG AGGCGGGCCA GCCCACCGGG TTCGCCGAGC GGGCGCAGTT CGGTGCCGGC CTCGGCCTGG CCCGCGGCAC CATCGACCTC TACCTGTCGA TGTCGCACGA GGAGGCCAAC CCGGTACCGC TGGACGACCT CGTCCGGGCC GGGGCGAGCC TGAGCTTCCG CCGGCACGAG CCCGTCGGCG TCGTCACCGC GATCACCCCC TACAACGGGG CGATCATCAT GGCGATGCAG AAGATCATCC CGGCGCTGAT CGCCGGGAAC TCGGTGATCC TGCGGCCCAG CCCGCTCACC CCGCTGTCCT CGCTGGTGTT CGGCGCGGCG GCCGAGGCGG CCGGGCTGCC GCCCGGCGTG CTCAGCGTGG TGGTGGAGGG CGGCGCCGCC GGTGCCGAGC TGCTGACCAC CCACCGGGCC GTCGACATGG TCTCGTTCAC CGGCTCGACC GTGGTCGGCC GGCAGATCCT CGCCCAGGCG GCCCCGACGG TGAAGCGGGT CGCCCTCGAG CTGGGCGGCA AGTCGGCCCA GATCTACCTG CCCGACGCCG TCGGGAGGGC CACCGCCGGG GCCGTCGCGG TCGTCGCCGC CACCGCCGGC CAGGCGTGCG TCGCCGCCAC CCGGATGCTG GTGCCGCGCG AGCGCAAGGA CGAGGTCCTC GACGCGGTGT CGCGCGCCTA CGGCGCCCTC ACCGTCGGCC CGCCCACCGA CCCGTCGGCG AAGCTCGGGC CGGTCATCAG CGCCGGCCAG CGCGACCGGT GCGAGCGCTT CGTCCGGTTG GCCGAGGAGA ACGGCGGGAA GGTGGTCACC GGCGGCGGGC GGCCCGCCGG GCTGGAGCGC GGCTACTACT TCGAGCCGAC CGTGCTCGAC CTCCCCGACA ACGCCAACCC GGCGGCCCAG GAGGAGATCT TCGGGCCGGT GATCAGTGTC CTGGGCTACC GGGACCTCGA CGACGCCGTG CGGATCGCCA ACGACAGCGA CTACGGGCTG TCCGGCCAGG TCTACGGCGC CGACGTCGCC GCGGCGGTGG GCGTCGCCCG CCGGCTGCGA ACGGGAGCGG TCAACGTCAA CACCGCCGTG TTCAGCGCCT ACGCGCCGGG CGGCGGCTAC AAGCACAGCG GCCTCGGCCG CGAGCGCGGG CCGGACGGCA TCCGCGCCTT CCAGGAAGTC AAGCACCTCG CCATCGGCGA GCTCCGCTGA
|
Protein sequence | MAAIIAERRS YVAGTWVEGD EVFAVENPAD ETSVADVAAT PLPEIERAIT EARRSFDEGV WADRSPAERA RVLGAFLDYF QSARAELVAT MVAEAGQPTG FAERAQFGAG LGLARGTIDL YLSMSHEEAN PVPLDDLVRA GASLSFRRHE PVGVVTAITP YNGAIIMAMQ KIIPALIAGN SVILRPSPLT PLSSLVFGAA AEAAGLPPGV LSVVVEGGAA GAELLTTHRA VDMVSFTGST VVGRQILAQA APTVKRVALE LGGKSAQIYL PDAVGRATAG AVAVVAATAG QACVAATRML VPRERKDEVL DAVSRAYGAL TVGPPTDPSA KLGPVISAGQ RDRCERFVRL AEENGGKVVT GGGRPAGLER GYYFEPTVLD LPDNANPAAQ EEIFGPVISV LGYRDLDDAV RIANDSDYGL SGQVYGADVA AAVGVARRLR TGAVNVNTAV FSAYAPGGGY KHSGLGRERG PDGIRAFQEV KHLAIGELR
|
| |