Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4489 |
Symbol | |
ID | 5672839 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5356385 |
End bp | 5357224 |
Gene Length | 840 bp |
Protein Length | 279 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641243356 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001508772 |
Protein GI | 158316264 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGGGC GAGTCGCCGG GAAGGTCGCG CTGATCACCG GGGCGGCGCG CGGGCAGGGC CGCAGCCACG CGGTTCGGCT CGCGCAGGAG GGGGCCGACA TCATCGCCGT CGACCTCTGC GCCGACGTGC CGGGCGTGCC GTACCCGGGG GGCACCCGCG AGGATCTGGC CGAGACGGTA CGGCAGGTGG AGGCCCTCGA TCGGCGGGCC GTGGCGACGG TCGCCGACGT GCGTGACCAC GAGCAGCTCG CGGCGGCTGT GGTGGGGGGT GTCGCCGAGT TCGGGCGGCT CGACGTGGTC AGCGCGAACG CGGGCATCGC CATGCCGCCC TTCCCCACCC ACGAGATGCC CGAGGAGGTG TGGCAGGGCA TGCTTGCGGT CAACCTGACC GGCGTCTGGC ACACCTGCAA GGCCGCCATA CCGCACCTGA TCGCGGGTGG CCGCGGCGGG TCGATCATCC TTACGAGTTC CGCGGCTGGT CTCAGGGGTT ACGAGAACAT CGCCAACTAC GTCGCGGCCA AGCACGGCGT GGTCGGTCTG ATGCGGACGC TGGCCAACGA GCTCGCCCGG CACTCGATCC GGGTGAATTC GGTGCATCCC ACCACTGTCT CGACCGAGAT GATCCAGAAC GAGTCGACCT ACCGCCAGTT CCGGCCGGAC CTGACCGACA CGCCGACCGA GGACGACGTG CGCGACGCGT TCACGTCGCT CAATCTGATA CCGGTGCCCT GGATTGAGTC GATTGACGTG TCGAACGCGC TGCTGTTTCT CGCGTCCGAC GAGTCTCGGT ACATCACCGG CATCACGCTG CCGATCGACG CCGGCCAGAT GGTCAAGTAG
|
Protein sequence | MAGRVAGKVA LITGAARGQG RSHAVRLAQE GADIIAVDLC ADVPGVPYPG GTREDLAETV RQVEALDRRA VATVADVRDH EQLAAAVVGG VAEFGRLDVV SANAGIAMPP FPTHEMPEEV WQGMLAVNLT GVWHTCKAAI PHLIAGGRGG SIILTSSAAG LRGYENIANY VAAKHGVVGL MRTLANELAR HSIRVNSVHP TTVSTEMIQN ESTYRQFRPD LTDTPTEDDV RDAFTSLNLI PVPWIESIDV SNALLFLASD ESRYITGITL PIDAGQMVK
|
| |