Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3305 |
Symbol | |
ID | 5671677 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3917023 |
End bp | 3917829 |
Gene Length | 807 bp |
Protein Length | 268 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641242194 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001507614 |
Protein GI | 158315106 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.552464 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCCGGGA GGCGCGGCGC GCTTCGTCGC CGTCGCGCCG TCGCGGGCAG CGGCCGGTTC CAGATGAAGC GGGTCGCCGT GGTGACGGGC GGTGCGTCGG GGATGGGGCT TTCCATCTGC CGGCATCTCG CCGACCGGGG TCACAGAGTC GCCGTGGTCG ATCGCAACGG TTCGGCCGCG CGGCAGGCCG CGAAGGAGCT GGCCGCGGAC GGGGCGGAGG CGTTGGCGTG CGAAGTCGAC GTCACCAGCC GTCCGGACGT GGAGAACGCG CTGAGCCAGG TCCGCGAGGA GTTCGGCCCG GTGGAGATAC TGGTCACGAG TGCGGGGCTC GCGGCGTTCG AGCCTTTTGT TGAAATCACA CTCGAGTCCT GGAACAGGGT GATCGAGGTG AATCTTACCG GGACTTTCCA CTGCGTTCAG GCTGTGATCC CTGATATGGT CGCGGCGCGG TGGGGGCGTA TCGTGACCCT CTCGTCGTCA AGCGCTCAGC GGGGTTCGCC CCGCATGGTC CACTACGCCG CGTCCAAGGG TGCCGTCATC GCCATGACGA AGGCCCTCGC GCGGGAGTAC GCGTCGTTCG GGATCACGGT GAACAGCATC CCGCCGTCGA GTATCGACAC ACCGATGTCC CGCAGCGCCC AGGCGGCCGG CGACCTGCCG GACAACGAGG TCCTGGTGAA CGCGATTCCG GTGGGTCGGC TGGGGACGGG TGATGACATA GCCGCCGCCT GCGCCTTTCT CTGCTCCGAT GACGCCGGTT TCATCACCGG CCAGGTCCTC GGGGTGAACG GCGGGAGTGT CATGTGA
|
Protein sequence | MPGRRGALRR RRAVAGSGRF QMKRVAVVTG GASGMGLSIC RHLADRGHRV AVVDRNGSAA RQAAKELAAD GAEALACEVD VTSRPDVENA LSQVREEFGP VEILVTSAGL AAFEPFVEIT LESWNRVIEV NLTGTFHCVQ AVIPDMVAAR WGRIVTLSSS SAQRGSPRMV HYAASKGAVI AMTKALAREY ASFGITVNSI PPSSIDTPMS RSAQAAGDLP DNEVLVNAIP VGRLGTGDDI AAACAFLCSD DAGFITGQVL GVNGGSVM
|
| |