Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4846 |
Symbol | |
ID | 5673187 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5811722 |
End bp | 5812600 |
Gene Length | 879 bp |
Protein Length | 292 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641243702 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001509118 |
Protein GI | 158316610 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.145119 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTCTCCA ACGCACCGCC GAACGCCCGG AAACCTGTGG GCCGGGTCCT CGTCGTCAGT GGCGGGACAG ACGGGATGGG CCGCGCCCTC GCACTCGCTC GCGCCGATCG CGGCGACCAG GTCGTCGCGA TCGGCAGCAA CCCGACAAAG GGCGAGGCGT TGCTGGCCGA GGCCGCCCGG CGTGGCGTGG CCGGGCGAGT CCGCTTCGTC CGGGCTGACC TCGCGACTGT CGCGGGCAAC CGCCGTGTCC TCGAAGACAC TCTCGGCCAG CACGACAGGA TCGACGTGCT GGCACTGTTC GCCAACCGGC AAGCACCCAA ACGAACCCTG ACAGCGGACG GGCTCGAAAG CACCTTCGCG CTCTACTACC TCAGCCGCTA CGTCCTCAGC CATGGCTTCC GCGACGCGCT GGAGGCCAGT GACGCTCCGG TCATCGTGAA TGTCGCCGGC GTCGGCATCA CCAAGGGATC GATCCACTGG GACGACCTCC AACTGGAACG TGGCTACAGC ATGATCGCCG CGCAGCTGCA AGCAGCCCGA GCCAACGACC TACTCGGCGT CGCCTACACC GAGCACGCCA ACAGCAAGGC GCGCTATGTG CTCTACCACC CCGGATTCAC CAGGAGCGGA GACCTCAGCC CCCTGCCCGC GGCGCTGCGC GCCAGCATCC GGGCCGCCGC GAGGATCTCG GCGCGCCCCA TCGCCGAATC GATCGGCGCC ATCCACCACT TCATTGATGC GCCCCCTGCC GCAGGGTTGA CCGCGATCGA TCGGAACAAG CACCTACCGC TGACGCTCGA AACCCTGAAT CCGCAGAACG CGGAACGCCT CGCGCGAGCA ACCGAAGCGC TGGTCGCCGC GCTACCCAGC ACCCCGTAG
|
Protein sequence | MFSNAPPNAR KPVGRVLVVS GGTDGMGRAL ALARADRGDQ VVAIGSNPTK GEALLAEAAR RGVAGRVRFV RADLATVAGN RRVLEDTLGQ HDRIDVLALF ANRQAPKRTL TADGLESTFA LYYLSRYVLS HGFRDALEAS DAPVIVNVAG VGITKGSIHW DDLQLERGYS MIAAQLQAAR ANDLLGVAYT EHANSKARYV LYHPGFTRSG DLSPLPAALR ASIRAAARIS ARPIAESIGA IHHFIDAPPA AGLTAIDRNK HLPLTLETLN PQNAERLARA TEALVAALPS TP
|
| |