Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3365 |
Symbol | |
ID | 5671736 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3990170 |
End bp | 3991060 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641242253 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001507673 |
Protein GI | 158315165 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGATGT TCTTGCCGCC CTACGCAGAC CGGCCCGTCG CCGTCGTCAC GGGCGCCAGC TCGGGGATAG GCCGGGCGAC CGCCCGCACT CTCGCCGAGC AGGGCTGGCA GGTGATCGGC GTCGGCCGTG ATCCGGTACG CAGCGCCGCC ACCGAGGCCG AGATCGCCGC GGCGGCGCGC AAGGACGGCG GCTTCACGAT GCTGCGCGGC GACTTCACCC TCATGGCGGA TGTCCGGCGC GTCGCCGGCG AGATCCAGCA CCTCACCCCG CGCCTCGACA TCCTCATCAA CAACGCCGGC GGCGTGCGGG ACCGGCAGAT CATCAGCGCG GAGGGTACCG AGGCGACCTT CGCCGTCAAC CACCTCGCCC CGTTCCTGCT GACCCGGGAG CTCACGCCCC TGCTGCGGGC GAGCGCGGCC AGCCTGCCCG CCGGGTCGAC GCGGGTGATC GCCGTGTCGT CGAGCGGCCA CCGCGCGGTC GACGGGCTCG ACTGGGACGA CCTGCAGAGC CTCGGCGAGT TCCGTCCCGC CGTCGCCTAC TGCCGGGCCA AGCTCGCCAA CATCCTGTTC ACCCGCGAGC TGGCCCGGCG GGCCGGGCCG GACGGGATCG TCGCCCAGGC CATGCATCCC GGCGTCGTGG CCAGCAACTT CAGCGCGCAC GGCGACGCGG CGATGCGGGC GCACATGGCC GCGGCCGACA CGGTGCCACC CGACGAACCG GCGGAGACGC TGGCCTGGCT GGCCACCGAG CCCGAAGGCG GCCGCACCGG CGGGCGCTAC TTCCACCGCA GGGCGGAGGA GCTACCCGCC GAAGCCGCGC GGGACGATGC CGCCGCCGCC CGCCTCTGGA CCGAGAGCGA GCGGCTGCTG GACGGGCTCG GCTTCCGCTG A
|
Protein sequence | MEMFLPPYAD RPVAVVTGAS SGIGRATART LAEQGWQVIG VGRDPVRSAA TEAEIAAAAR KDGGFTMLRG DFTLMADVRR VAGEIQHLTP RLDILINNAG GVRDRQIISA EGTEATFAVN HLAPFLLTRE LTPLLRASAA SLPAGSTRVI AVSSSGHRAV DGLDWDDLQS LGEFRPAVAY CRAKLANILF TRELARRAGP DGIVAQAMHP GVVASNFSAH GDAAMRAHMA AADTVPPDEP AETLAWLATE PEGGRTGGRY FHRRAEELPA EAARDDAAAA RLWTESERLL DGLGFR
|
| |