Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4459 |
Symbol | |
ID | 5672810 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5326188 |
End bp | 5327006 |
Gene Length | 819 bp |
Protein Length | 272 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641243327 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001508743 |
Protein GI | 158316235 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACATCA GACTGGACGG CCGGACGGCT CTGATCACCG GGGCGTCGAG CGGGATCGGA CTGAGCATCG CCGAGGCGTT CCTCGAGGCG GGCGCCTCAG TCATGCTCAC CTCGCGGAAG GAGGAGTCAC TGCGTGCCGC GTCCGAGCGC CTCGGCGAAC GCTGCGACTT CCGGACAGCC AACGCGGGCG ATGGCGACGC CGCCGTGCGC TGCGTCGGGG ACACCCTCGA CCGGTTCGGC GGCCTCGATA TCCTGGTCAA CAACGCCGCG ACGAACCCCT TCTACGGCTC CCTGGTCGAC CTGGACCTCC AACGGGCCCA GAAGACAGTG CAGGTCAACC AGTTCGGCCC GGTGACGTGG ACCGCGGCTG CCTGGCACGC GGCGATGCGG GACCACGGCG GCGCCGTCCT CAACCTCTCG GCCATCGGCG CCTACCGGGT GGCACCCGGA CTCGGCTGGT ACGAGTCCAC CAAGGCAGCT CTGAACCACC TCACCGAGCA GCTGTCCTAC GAGCTGGCGC CGCAGGTCCG GGTGAACGCC ATCGCGCCGG GCCTGGTGAA AACCGAGCTG TCGCGTGCGC TGTGGTCGGA GCAGGAGACA CAGCTCGCCC GCTCGACGCT CCTGGGCAGG CTCGGCAGGC CCGAGGACAT CGCCCGCGCC GCGACCTTCC TCGTCTCCAA CGCGGCGTCC TGGATCACCG GGACCACACT GGTGGTCGAC GGGGGGATGC TCTGCGTACC CCCGGCGCAG GAGGTGATGG GAGCGGCACT CGAACGCGGG GAGAAAGCCG CCCCCGGCGA TCCGGTGGCC GCCCGATAG
|
Protein sequence | MDIRLDGRTA LITGASSGIG LSIAEAFLEA GASVMLTSRK EESLRAASER LGERCDFRTA NAGDGDAAVR CVGDTLDRFG GLDILVNNAA TNPFYGSLVD LDLQRAQKTV QVNQFGPVTW TAAAWHAAMR DHGGAVLNLS AIGAYRVAPG LGWYESTKAA LNHLTEQLSY ELAPQVRVNA IAPGLVKTEL SRALWSEQET QLARSTLLGR LGRPEDIARA ATFLVSNAAS WITGTTLVVD GGMLCVPPAQ EVMGAALERG EKAAPGDPVA AR
|
| |