Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4480 |
Symbol | |
ID | 5672830 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5345640 |
End bp | 5346557 |
Gene Length | 918 bp |
Protein Length | 305 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641243347 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001508763 |
Protein GI | 158316255 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGAGT TTCGTTTTGG CAACCGGGTT GCCCTGGTGA CCGGTGCGGG GGGCGGTCTC GGACTGCAGC ACGCGAAGCT CCTGGCGTCG CGCGGCGCCC GCGTGGTTGT CAATGACCTC GGTGGAACCG TCGCTGGTCA GGATGCCGAC GGCGGCGCCG CCCACCGCGC CGCCGACGAG ATCAAAGAAG CCGGCGGGGA AGCGGTGGCT GACACGCGCT CGGTATCCAC CCCGGACGGC GCGGCCGCCA TGGTCCAGAC CGCACTCGAC ACGTTCGGCC GTATCGACAT CGTGGTCAAC AACGCCGGAA TTCTCCGGGA CAGATCGCTG ACCAAACTCG AGCCGGCCGA CTTCGACGCC GTGATCGACG TCCATCTGCG GGGTTCCTTC CTGGTCACCC AGGCAGCCTT TCCGCATCTG CGTGAGCAGC GCTACGGCCG CATAGTGAAC ACGACATCGC CAGCCGGCCT GTACGGCAAC TTCGGGCAGG CCAACTACTC GGCGGCGAAG GCCGGACTGA TCGGCCTCAC TCGGACGGTG GCGGTCGAGG GAGCGAAGTA CGGTGTTTCC TGCAATGCCG TCTCACCGGC CGCGCTGACC CGCATGACCG AAGAAATCAT GGGACAGCTC TTCGTCGACC CGGACGGAGC TGCCCGCCTC GACGCGGCCA AGGTCTCTCC GGTTGTGGCC TGGCTGTGTC ACGAGCAGTG CGCACTGACC GGTCAGGTCT TCGGTGTGGC CGGTGGACTG GTCACCAAGA TAATCATCGC GGAGACCCGT GGCTTCTTCG ATCCGGACCT GACCATCGAG ACGGTGGCGG AACGTGTCGA TGCTATCCAG AACATGGCGG ACCTCGCGGT TCCGGGCAGC GTCGGCGACA GTATGTCCCT GCTCTTCGAT CATTACGCCA AACCTTGA
|
Protein sequence | MAEFRFGNRV ALVTGAGGGL GLQHAKLLAS RGARVVVNDL GGTVAGQDAD GGAAHRAADE IKEAGGEAVA DTRSVSTPDG AAAMVQTALD TFGRIDIVVN NAGILRDRSL TKLEPADFDA VIDVHLRGSF LVTQAAFPHL REQRYGRIVN TTSPAGLYGN FGQANYSAAK AGLIGLTRTV AVEGAKYGVS CNAVSPAALT RMTEEIMGQL FVDPDGAARL DAAKVSPVVA WLCHEQCALT GQVFGVAGGL VTKIIIAETR GFFDPDLTIE TVAERVDAIQ NMADLAVPGS VGDSMSLLFD HYAKP
|
| |