Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3592 |
Symbol | |
ID | 5671961 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4254950 |
End bp | 4255810 |
Gene Length | 861 bp |
Protein Length | 286 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641242478 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001507898 |
Protein GI | 158315390 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.894525 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACC TGAGCATGAC CCACCTGAGC ATGACCCACC TGAGCATGAC CGAGCAGAGC ACCACCGACC AGACCGGGAA CGTGACCGGG AGACAGGTGC TCAAGGGGCG TGTCGCGCTG GTCACCGGCG GGTCCCGGGG CATCGGCCTC GGCATCGCGA GGGCGTTCCG CGCCGAGGGG GCGCACGTGG TCATCTGCGC GCGCAAGGCC GAGGGCGTGG CGGCCGCGGC GAAGGAGCTG CTGGCGGGCG AGGGCGACGG CGAGGTGCTC GGGCTCGTGG CGAACGCCGG TGAGCCCGAG CACGCCGAGC GGACGGTCAC GGCGGCGCTC GAGCGCCTCG GCCGGCTCGA CATACTGGTC AACAATGCCG CCACCAACCC CTACATGGGG CCGCTCGTCG ACATCGACCT GCCGCGCGCG GAGAAGACCA CCAAGGTCAA CCAGATCGGG ATGCTCGCGT GGATCCGGTA CGCGGTCCGC GGCTGGATGG TCGAGCACGG CGGCTCGGTT ATCAACATCG CGTCAGTCGG CGGGCTGATC GTCGACCCGG GCATTGGCTT CTACAACGCC ACCAAGGCGG CCATCCTGCT CATGACACGT CAGCTCGCCT ACGAACTCGG CCCGGCCGTC CGGGTGAACG CTCTCGCACC CGGCGTCATC AAGACCGAGC TGGCCCGCGC GGTGTGGGAG GTCCGCGAGC CTGTGCTCAC CGCCCAGCTC CCGCTGCGGC GGCTCGGCAC AGTGCAGGAC GTCGCCAACG CGGCTGTGTT CTTCGCGAGC GACGCGTCGT CGTGGATCAC CGGCCAGACC CTGGTGCTCG ACGGCGGCGC GCTCACCCTG CCCATCGCGC CGGGCCCGTG A
|
Protein sequence | MTDLSMTHLS MTHLSMTEQS TTDQTGNVTG RQVLKGRVAL VTGGSRGIGL GIARAFRAEG AHVVICARKA EGVAAAAKEL LAGEGDGEVL GLVANAGEPE HAERTVTAAL ERLGRLDILV NNAATNPYMG PLVDIDLPRA EKTTKVNQIG MLAWIRYAVR GWMVEHGGSV INIASVGGLI VDPGIGFYNA TKAAILLMTR QLAYELGPAV RVNALAPGVI KTELARAVWE VREPVLTAQL PLRRLGTVQD VANAAVFFAS DASSWITGQT LVLDGGALTL PIAPGP
|
| |