Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6443 |
Symbol | |
ID | 5674758 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 7834192 |
End bp | 7834953 |
Gene Length | 762 bp |
Protein Length | 253 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641245291 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001510686 |
Protein GI | 158318178 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.246968 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.169503 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGTAT TGGACAGATT CCGCCTCGAC GGCAGGGTCG CCATCGTCAC CGGGGCCTCG TCGGGGATCG GCGTCGGCTT CGCCCGCGGG CTCGCCCAGG CCGGCGCGGA CGTCACTCTC GGTGCCCGGC GCACCGACCG GCTCGCCGCG ACCGCCGCGC TGGTCGAGGC GGAAGGCCGC CGGGCCGCCG CCGTCGGCAC CGACGTGGCC GACCCGGAGT CGTGCCGGAA CCTCGTCACG GCCGCCATGG ACACCTTCGG GCGGGTCGAC ATCCTGGTCA ACAACGCCGG GGTCGGTTCG GCCCACCCGG CGCTGCGGGA GACCCCGGAG CAGTTCCGCT CGGTCATCGA CGTCAACCTC AACGGCTGCT ACTGGATGGC CCAGGCCGCC GCGGCCGTCA TGCAGCCGGG CAGCAGCATC ATCAACATCT CCAGCGTGCT CGGCCTGACG ACGGCCGGCC TCCCGCAGGC CGCCTATTCG GCCAGTAAGG CGGCCCTGCT CGGGCTGACC CGCGACCTCG CCCAGCAGTG GACGGGCCGG CGCGGGATCC GCGTGAACGC CCTGGCCCCC GGGTTCTTCG AGTCCGAGAT GACCGACCAG TACCTGCCCG GGTACCTGGA GAGCCAGTCG GCACGGATCC TCGCCGGCCG CTTCGGCGAC CTTGAGGAGC TGACCGCCGC GCTGGTCTTC CTCGCCTCCG ACGCCGGCGG CTACGTCACC GGACAGACCC TCGTCGTCGA CGGCGGTGTC TCCATCACCT GA
|
Protein sequence | MSVLDRFRLD GRVAIVTGAS SGIGVGFARG LAQAGADVTL GARRTDRLAA TAALVEAEGR RAAAVGTDVA DPESCRNLVT AAMDTFGRVD ILVNNAGVGS AHPALRETPE QFRSVIDVNL NGCYWMAQAA AAVMQPGSSI INISSVLGLT TAGLPQAAYS ASKAALLGLT RDLAQQWTGR RGIRVNALAP GFFESEMTDQ YLPGYLESQS ARILAGRFGD LEELTAALVF LASDAGGYVT GQTLVVDGGV SIT
|
| |