Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6540 |
Symbol | |
ID | 5674855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7954269 |
End bp | 7955207 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641245389 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001510783 |
Protein GI | 158318275 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.216575 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAAGG ACCACCGAGG GGCCAGGCCC CCGGGCCGCC GGAGCCTGCT TGCCGGGGCG GCGGGACTCG TCACCACGGT GGCCGCGGCG ACGGCCGTAC CCGCCGGCGC GGCCGCTGCG GCGCCCACCC CTGCCCCGGC CCCCATACCG GCGCCGGCGC GTCGGTTCGC GGGGAAGGTC GTCCTTGTCA CGGGAGCGAC CTCGGGCATC GGACGGGCGG CGGCCATCGC GTTCGCCCGG GAGGGAGCGA GAGTCGGCTT CTGCGGCAGG CGGGAGGAGC TCGGCCGCCG TGTGGAGGAG GAGATCCGCG CCGCCGGCGG AGAGGCCACC TACACGCGGG CCGACGTCCG CGATCCTGCC CAGGTCGAGT CGTTCGTGGC CGGTGTCGCC GACCGGTACG GCCGGCTGGA CGTCGCGCTC AACAACGCCG GAACCCAGAT TGTGAAGCCG CTGCACGAGA TGACCGTCGA GGAGTGGGAC GACACCGCCC ACACCAACAC CCGCGGGGTC TTCCTCGCCA TCAAGTACGA GGTGCCACCG ATGCGGGAGT CCGGCGGCGG CGTCATCCTC GTCACCGGCT CCGCGAACGA GTTCGCCAGC AGGCCCGGAC TGGGCGCCTA CAGCGCGAGC AAGGGAGGCG TCACCGGCCT GGTACGCACC GCGGCCCTGG ACTACGGACA GGACAACATC CGGGTCGCCG CGCTCTCTCC GGGCACCACC GACACGGGGC TGGTCGACCG CCGCCGCCCC CCGAACATCA CCGACGAGCA GTGGGCCGCC GGCAAGGCCC AGTACGGTGC GGACAACGTC GACGCGCTGC GCCGGATGGC CCGCCCGGAG GAGATGGCCG CCGCCGCCCT CGCCCTGGCC TCGCCGGACA TGAGTTTCCT GACGGGCACC TCGGTTCTCG TCGACGGGGG CATGCTCGCC GGCCTGTGA
|
Protein sequence | MTKDHRGARP PGRRSLLAGA AGLVTTVAAA TAVPAGAAAA APTPAPAPIP APARRFAGKV VLVTGATSGI GRAAAIAFAR EGARVGFCGR REELGRRVEE EIRAAGGEAT YTRADVRDPA QVESFVAGVA DRYGRLDVAL NNAGTQIVKP LHEMTVEEWD DTAHTNTRGV FLAIKYEVPP MRESGGGVIL VTGSANEFAS RPGLGAYSAS KGGVTGLVRT AALDYGQDNI RVAALSPGTT DTGLVDRRRP PNITDEQWAA GKAQYGADNV DALRRMARPE EMAAAALALA SPDMSFLTGT SVLVDGGMLA GL
|
| |