Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3875 |
Symbol | |
ID | 5672238 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4605029 |
End bp | 4606039 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641242753 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001508173 |
Protein GI | 158315665 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.893798 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0397379 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGCGG CCACCTCCGG CGCCGGCGGC CCGGCCACCG GTGCCGGCGC GGTGGCCGGG CCCGGGTTCG ACCGGGAGTC GACGGCGCTG GAGGTGGCCC GCAGTGCCGA CCTGCGTGGG CGCGTCGCGG TGCTGACGGG GGCCTCCTCC GGCATCGGCG TCGAGACGGC GCGCGCCCTG GCAGCCACCG GCGCCGACGT CGTGCTGGGC GTTCGGGATG TCGCCGCCGG CGAGGAGCTG GTCCGCGAGG TGCGGGCCGG CGCCACAGGC GACATCCGGG CGGAGCGGCT GGACCTGAGC GATCTCGGTT CGGTCGTCGC GTTCGCCGCC CAAGTCACTG GCCCCGTCGA CCTGCTGATC GCCAACGCGG GGGTCTCCAG GACCCCGGAG TCACACCTGC CCAACGGGCT CGACGTCCGC TTCGCGACGA ACCACCTGGG CCACTTCCTG CTGGCGCTGC GCCTGAGCGA ACAGCTCGCC GAGCGTGGAG CGCGGATCGT CGTGGTCAGC TCGGGCGCGC ACAGGAGCAT CCCCGTCCGC CTCGACGACC TGCAGTGGAC CGCCCGGCGG CACAACCCGG GGATGGCCTA CGCCGAGTCG AAGACCGCGA ACATCCTCTT CGCCCAGGAG GCGACCCGCC GGTGGGGACC CGACGGGATC TTCGCGAACG CGGTGCTGCC CGGCTCGGCG CTGACCGGCC TGCAACGCTT CCACGGGGAC GAGATGAAAC GCCGGATCGG CTTCCTCAAC GAGGACGGAT CTCCCAACCC GGTGCTTAAA TCCCCTGCCC AGGCCGCGGC CACGACCCTC TGGGCCGCCA CGGCCCCGGA ACTGGCCGGG CGTGGCGGCC TCGTCCTCGA GGACTGCGCA GAGGCGCTAC CCCCCGGCCC GCCCGGCTCG GACGTCCTGG TCCGCTCGGG CTTTGACCCC TCGGTCGCCG ACCCCGACAC GGCCCGCCGC CTGTGGGACC GCTCCATCGA GCTGCTCCGG GTCCTCGGCC GACCAGAATG A
|
Protein sequence | MTAATSGAGG PATGAGAVAG PGFDRESTAL EVARSADLRG RVAVLTGASS GIGVETARAL AATGADVVLG VRDVAAGEEL VREVRAGATG DIRAERLDLS DLGSVVAFAA QVTGPVDLLI ANAGVSRTPE SHLPNGLDVR FATNHLGHFL LALRLSEQLA ERGARIVVVS SGAHRSIPVR LDDLQWTARR HNPGMAYAES KTANILFAQE ATRRWGPDGI FANAVLPGSA LTGLQRFHGD EMKRRIGFLN EDGSPNPVLK SPAQAAATTL WAATAPELAG RGGLVLEDCA EALPPGPPGS DVLVRSGFDP SVADPDTARR LWDRSIELLR VLGRPE
|
| |