Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2715 |
Symbol | |
ID | 5671106 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3212950 |
End bp | 3214407 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641241627 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001507047 |
Protein GI | 158314539 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.57091 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGGGC ACAACGAGCT CTACATCGAC GGGCGCTGGA CCACCCCTGC CAGCACCGAC CTGTTCGAGG TGGTGTCCGC GGCCACCGAA GAGATCATCG GGACGGTTCC CGCAGGGCAG CTCCAGGACA TCGACCGGGC GGTCGCCGCG GCCCGGGCGG CGTTCGACTC CGGTCCGTGG CCGCGGCTGG ACCCGGCTGA GCGCGGTGCC GCCCTGGGAC GGCTGTCGAA GGTGCTGCAG GCGAGATCCG AGGATCTCGC CGTGACGATC AGTCAGGAGA ACGGCACCCC CGTCGCGGCC GCCCGCATGG CGCAGGTGTT GATCGCGACG ATGACGCTGG ACTACTACGC GGGCCTGGGA GCCGGGCTCG CCCTCACCGA CACCAGGCCC GGCATGCTCG GGCCGGCTGT CGTGCGGCGG CGGCCGGTCG GGGTGGTCGG CGCGATCGTC GCCTGGAACG TGCCGCTGTA CCTGAGCGTG CTCAAGCTCG GCCCCGCGCT CCTCGCTGGC TGCACCGTCG TGCTCAAGCC GTCGCCGGAC GCACCGCTGT CCCTGCAACT GCTCGCCGAG GCCGCCGCCG AGGCCGACCT GCCGCCCGGT GTGCTCAACG TCGTCCCCGC CGACCGTGAC GTCAGCGAGT ACCTGGTCAC CCACCCGGGC GTCGACAAGA TCTCGTTCAC CGGCAGCACC GCGGCCGGCC GGCGCATCGC GGCACTGTGC GGCGAGCGGC TGCGCCGGGT GAGTCTGGAA CTCGGCGGGA AGGGCGCGGC GATAGTGCTC CCCGACGCCG ACCTGAACGC GGCGCTGCCC GGGATCCTCC AGTACGGATT CATGCTCAAC GGGCAGGCCT GCGTCGCGCA GACCCGGATC CTGGCCCCCC GTGCGCGGTA CGCCGAGCTG GTGGAGGGAC TACTCGCGCA GGTCGGCGCG CTGAAGGTCG GTGATCCCCT GGACGAGGCG ACCGAGCTCG GCCCGCTGGC CAACGCCCGG CAGCGCGACC GGGTCGAGGA GTACATCCGG ATCGGCCGGA GCGAGGGTGC CAGGCTGGTC CTCGGTGGCG GCCGGCCGGC GGACCACCCG CGCGGCTTCT TCGTCGAGCC CACGGTCTTC ACCGACGTCG ACCCGGCGAT GCGGATCGCC CAGGAGGAGA TCTTCGGCCC GGTGCTGACC GTCATCCCCT ACGACACGGT GGACGATGCG GTAGACATCG CCAACGGAAC CCGGTTCGGG CTGGGCGGGT CGATCTGGAC CGCCGACGTG GCGGCGGGCT ATGCCCTGGC CGACCGGATC GACGTCGGTG TCCTCGGCGT CAACATGTTC ATGCTGGACA ACGTCGCGCC GTTCGGTGGA TTCAAGGACT CCGGGCTCGG TCGGGAGCTC GGGCCGGAGG CCCTCGCGGC CTACCTCGAC TACCAGACGG TCAACCTTCC CGCCGGCGCC ATCGTGCCCG GGGTGTGA
|
Protein sequence | MSGHNELYID GRWTTPASTD LFEVVSAATE EIIGTVPAGQ LQDIDRAVAA ARAAFDSGPW PRLDPAERGA ALGRLSKVLQ ARSEDLAVTI SQENGTPVAA ARMAQVLIAT MTLDYYAGLG AGLALTDTRP GMLGPAVVRR RPVGVVGAIV AWNVPLYLSV LKLGPALLAG CTVVLKPSPD APLSLQLLAE AAAEADLPPG VLNVVPADRD VSEYLVTHPG VDKISFTGST AAGRRIAALC GERLRRVSLE LGGKGAAIVL PDADLNAALP GILQYGFMLN GQACVAQTRI LAPRARYAEL VEGLLAQVGA LKVGDPLDEA TELGPLANAR QRDRVEEYIR IGRSEGARLV LGGGRPADHP RGFFVEPTVF TDVDPAMRIA QEEIFGPVLT VIPYDTVDDA VDIANGTRFG LGGSIWTADV AAGYALADRI DVGVLGVNMF MLDNVAPFGG FKDSGLGREL GPEALAAYLD YQTVNLPAGA IVPGV
|
| |