Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0088 |
Symbol | |
ID | 5668513 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 104451 |
End bp | 105926 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641239016 |
Product | aldo/keto reductase |
Protein accession | YP_001504461 |
Protein GI | 158311953 |
COG category | [C] Energy production and conversion [K] Transcription |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) [COG0789] Predicted transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCGG TCGACGAGCT CAGCGAGGAC GGTTCTCGGG CGCCGGATGG GCCTACGTCG GGCCGCCGGA AATCCGGGGC GAGACGGCCC TCTACCGGCG TCGCCCGGTC GGCGGACAGC GGCCTCACCA TCGCTGAGGC GGCGACGGCG AGCGGCGTGA GCGCGCACAC TCTGCGGTAC TACGAGCGGG TGGGCCTCAT GCTCGAACGG GTGGCGCGGG CTCCGTCGAG CCATCGCCGC TACGACGACG AGGACCTGCG CTGGATCGCG ACGCTCACCG CGCTGCGCCA TACCGGCATG CCGATCCGCC AGATCGCGCG GTACGCGGCG CTTGTACGGG CCGGCCACGG CAACGAGACC GAACGGCTCG AGCTGCTCAC CGCCCACAAG GAGCGCGTCA CCGAGCGTCT CGATGAGGTT CGCCGCCATC TCACAGCCAT CGACACCAAG ATTGATATCT ATCGGGAGAG GACCAGACGA CCGATGCCCC ACGACACATT GCGCCAGGTA CCGCTCGGTT CCCAGGGACT CCGGGTCAGC GTGCAGGGCC TCGGCTGCAT GGGCATGTCC GACTTCTACG GCGCGACCGA CGACAACGAA TCGATCGCCA CCATTCAGCG GGCGCTCGGC CTGGGCGTGA CCTTCCTCGA CACCGCCGAC ATGTACGGGC CGTTCACGAA CGAGCGGCTG GTCGGGCGGG CCATCTCCGG CCGGCGCGGC GAGGTGACGC TCGCGACCAA GTTCGGGATC GTCCGCGACC CGGACAACCC CCAGGCCAGG AACATCAACG GCCGGCCGGA GTACGTCCGC TCAGCCTGCG ACGCGTCCCT GTCCCGCCTC GGGGTCGACC ACATCGATCT CTACTACCAG CATCGGGTCG ACCCGACTGT GCCGATCGAG GACACCGTCG GCGCCATGGC CGAGCTGGTC ACCGCAGGGA AGGTGCGCTA CCTCGGCCTC TCCGAGGCGT CGCCGGCCAC GATCCGCCGG GCGCACGCCG TGCATCCCAT CTCCGCGCTG CAGACCGAGT ACTCGATCTG GTCACGTCAC CCGGAGGAGG AGATCCTCCC GACGCTGCGC GAACTCGGCA TCGGCTTCGT TGCCTACAGC CCGCTGGGGC GGGGGTTCCT GACCGGAACC TTCCGCACCC CGAACGACTT CGAGGCCGGC GACTTCCGCG CCAGCATGCC CAGGATGAAC TCCGAGAACC TGGACGCCAA CCTCTCGGTC GTCGCCCAGA TCGAGGAGAT CGCGGCGGCG CGAAACGCGA CACCCGCGCA GGTGGCACTC GCCTGGGTGC ACCACCAGGG CGACGACATC GTCCCGATCC CGGGGACGAA GCGACGCCAC TACCTGGAGC AGAACGTCGC CGCCGTCGGC CTCGCCCTGA CGCCCGACGA GGTGGAAATC CTGACGAAGG CTGGCGAGAC CGTGCGGGGC GCGCGCTATC CGGACATGTC CAACGTCAAC CTTTGA
|
Protein sequence | MTAVDELSED GSRAPDGPTS GRRKSGARRP STGVARSADS GLTIAEAATA SGVSAHTLRY YERVGLMLER VARAPSSHRR YDDEDLRWIA TLTALRHTGM PIRQIARYAA LVRAGHGNET ERLELLTAHK ERVTERLDEV RRHLTAIDTK IDIYRERTRR PMPHDTLRQV PLGSQGLRVS VQGLGCMGMS DFYGATDDNE SIATIQRALG LGVTFLDTAD MYGPFTNERL VGRAISGRRG EVTLATKFGI VRDPDNPQAR NINGRPEYVR SACDASLSRL GVDHIDLYYQ HRVDPTVPIE DTVGAMAELV TAGKVRYLGL SEASPATIRR AHAVHPISAL QTEYSIWSRH PEEEILPTLR ELGIGFVAYS PLGRGFLTGT FRTPNDFEAG DFRASMPRMN SENLDANLSV VAQIEEIAAA RNATPAQVAL AWVHHQGDDI VPIPGTKRRH YLEQNVAAVG LALTPDEVEI LTKAGETVRG ARYPDMSNVN L
|
| |