Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4592 |
Symbol | |
ID | 5672937 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5473599 |
End bp | 5474459 |
Gene Length | 861 bp |
Protein Length | 286 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641243453 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001508869 |
Protein GI | 158316361 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.338469 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGACGCA TGGACGGCAA GGTCGTCTTC ATCACCGGTG CGGCGCGTGG CCAGGGGCGG GCGCACGCCG TCCGGGTAGC GGCGGAGGGA GGCGACGTCG TGGCCGTCGA CCTGTGCGCC GACATCGCCT CCACGCCCTA CCCGATGGCG ACCCGGGACG ACCTGGACGA GACGGCCCGC CTGGTCAAGG AGCGCGGCGG CCGCGTCGTC GCGCAGGTCG CCGACGTCCG CGACCGGGCC GCGCTGGCCG CCGCCGTCGC CGAGGGCATC GCCCAGTTCG GCCGGTTGGA CGGCGTGGTG GCCCAGGCCG GTATCTGCCC GCTCGGTACG ACGGCGCCGC AGGCCTTCGT CGACGCGGTC AGCGTCGACT TCGGTGGCGT CTTCAACGCC GTCGACGTCG CCCTGCCCCA CCTGCAGCCC GGGGCCTCGA TCGTCGCGAC GGGAAGCCTG GCCGCGTTGA TCCCCGGCAC ATTGGACAAC GCGGCCAAGG GGTCCGGCGG CCTGGGCTAC GCCTGGGCCA AGCGGGCGGT GGCGTCGCTG GTCCACGACC TCGCCGTCGT CCTGGCTGGT CAGAGCATCC GGGTGAACGC CGTCCACCCG ACCAACGTCA ACACCGACAT GCTGAACAAC GACGTCATGT ACCGGGCGTT CCGCCCGGAC CTGGCCGAGC CCACCCTCGA GGACGTGCTG CCGTCGTTCC CGGCCATGAC CGCGACAGGC GACCCGTACG TCGAGCCCGA GGACATCGCC GACGCGGTCC TCTTCCTGCT CTCCGACGAG TCCCGCTTCA TCACCGGCAC CCAGCTCCGC GTCGACGCGG GAGGTTACGT CAGGCTGCGG CCGCAGGTGC CCGCCTTCTG A
|
Protein sequence | MGRMDGKVVF ITGAARGQGR AHAVRVAAEG GDVVAVDLCA DIASTPYPMA TRDDLDETAR LVKERGGRVV AQVADVRDRA ALAAAVAEGI AQFGRLDGVV AQAGICPLGT TAPQAFVDAV SVDFGGVFNA VDVALPHLQP GASIVATGSL AALIPGTLDN AAKGSGGLGY AWAKRAVASL VHDLAVVLAG QSIRVNAVHP TNVNTDMLNN DVMYRAFRPD LAEPTLEDVL PSFPAMTATG DPYVEPEDIA DAVLFLLSDE SRFITGTQLR VDAGGYVRLR PQVPAF
|
| |