Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4594 |
Symbol | |
ID | 5672939 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5476099 |
End bp | 5477019 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641243455 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001508871 |
Protein GI | 158316363 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.90193 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCTGA TCGGCACCAG TCGAAAATCC TTAGCCGTGA TCACCGGCGC CTACGGCGGA ACAGGCCGCG CCGTGGCCCG GCGCCTCGGC GTGCGCTACC GGCTCGTCCT CTCCGGCCGC AACGAGGTCG CGCTGGCCGC GCTGCACGAC AGCCTCACAG AAGAGGGCTA CGACATCGCC CTCTCCGTCG CCACCGACGT GGCGAGCTCG GACAGCGTCC AGCGGCTCGC CGCGGCGGTG AGCGAGGCCG GCACCCTGGG CACGCTGGTC CACACGGCCG GCCTCTCCCC CGCCCTCGCC GGCCCGCGGA CGATCGTGGA GGTCAACCTC CGCGGGACGG CGCACCTGCT CGACGCGTTC CTGCCGCTGG CCGTCCCCGG CAGCTCGGCG GTCTGCATCG GCTCGGTGGC GGCCCACACC TTCAGCTCAT CGGCCGAGGT CGACGCCGTG CTCGACGACC CGCACTCCGG CCGCCTCGCC GTCGACCTGG AGCGGCTCCT GCTCGCCGCC GACCCGAACC CGACTCCCTA TTTCTTCGCT GTACGCGCCT ACGGGGCGTC GAAGCGCGGC GTCCTGCGGC TGGTGGAACG GTCGGCCGGT GCCTGGGCCG AGCGCGGTGC CCGCATCCTG TCGGTCTCGC CGGGCACCGT GCTGACCCCG ATGGGCCGCC AGGAGATGGC GGCGAACCCG CTGGCCGCCG CGGCCGCGGA GGTGACCCCG CTACGGCGGC TCGGAATGCC CGCCGACATT GCGGCGGCGA TCGACTTCCT CGTCTCCGAC AGCGCCAGCT ACATCACCGG GTGCGACCTG CGCGTCGACG GCGGCATCGT CGCCGCCAGG CGTCATCCGG CCGACGCTCC CCGTCCCGCC AGCCCGCCGA CTCCCGTGAG CGAGCCTCAT GTCACCCTCG ATCCAGCCTG A
|
Protein sequence | MNLIGTSRKS LAVITGAYGG TGRAVARRLG VRYRLVLSGR NEVALAALHD SLTEEGYDIA LSVATDVASS DSVQRLAAAV SEAGTLGTLV HTAGLSPALA GPRTIVEVNL RGTAHLLDAF LPLAVPGSSA VCIGSVAAHT FSSSAEVDAV LDDPHSGRLA VDLERLLLAA DPNPTPYFFA VRAYGASKRG VLRLVERSAG AWAERGARIL SVSPGTVLTP MGRQEMAANP LAAAAAEVTP LRRLGMPADI AAAIDFLVSD SASYITGCDL RVDGGIVAAR RHPADAPRPA SPPTPVSEPH VTLDPA
|
| |