Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4448 |
Symbol | |
ID | 5675735 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5312886 |
End bp | 5313707 |
Gene Length | 822 bp |
Protein Length | 273 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641243316 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001508732 |
Protein GI | 158316224 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGACGTG TCGAAGGCAA GGTCGCGGTA ATCACCGGTG CCGCGCGGGG GCTAGGCCGC GCATCCGCGC TGCGGCTGGC CGAGGAGGGG GCCGACATCA TCGCGCTTGA CCTTGCGGCC GATCCGGGGT TAACCAGCTA CGCGATGAGC GGAGCCAATG ACCTCGAATG CACGGCTGAG GCGTTGCGGT CGCTCGGCCG CCGCGTGGTC ACCGGATACG CGGACGTCCG GGACCTGGAC GCAGTCGTGG CCTCGATCGA GGCCGGGGTC CGCGAGCTGG GGCGGCTCGA TCTGGTTTGC GCCAACGCGG GAATGGTCTC CTACGGCGGC GTGCTTGACC TCACCGAGAC GCAGTGGCGG GACCTGCTCG ACGTGCACAT CACCGGCTCC TGGCACACCG TGCGTGCCTC CGCACCCCAT CTCATCGCCG CCGGCGGTGG GTCCATCGTG CTCACCAGTT CAGTGGCGGG GCTGCGGGCC GAGGCCGGCG TCGCCCACTA CGTCACGGCT AAGCATGGGC TGGTGGGGCT GATGCGCAAC CTGGCTGTCG AGCTCGCTCC ACACGGCATC CGGGCCAACT CCGTGCATCC CACGATGACG AAGACAAGGA TGATCACGAA CGAGTCGACC CAGGCTCTGA TCGCCTCGGG AGCCGAGGCC GGGATCAGTG CCGAGGACGC CGCTCGAAAG CGCCACCTGC TCGACGTGCC GTGGATCGAG GCTATCGACG TCGCCAACGC CGTGCTCTAC CTGCACTCGG ACGAAGCGCG CTACATCACC GGAGTCGCAC TACCGGTGGA CGCCGGGGCT TGCCTTGTCT GA
|
Protein sequence | MGRVEGKVAV ITGAARGLGR ASALRLAEEG ADIIALDLAA DPGLTSYAMS GANDLECTAE ALRSLGRRVV TGYADVRDLD AVVASIEAGV RELGRLDLVC ANAGMVSYGG VLDLTETQWR DLLDVHITGS WHTVRASAPH LIAAGGGSIV LTSSVAGLRA EAGVAHYVTA KHGLVGLMRN LAVELAPHGI RANSVHPTMT KTRMITNEST QALIASGAEA GISAEDAARK RHLLDVPWIE AIDVANAVLY LHSDEARYIT GVALPVDAGA CLV
|
| |