Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0701 |
Symbol | |
ID | 5669118 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 819956 |
End bp | 821002 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641239629 |
Product | aldo/keto reductase |
Protein accession | YP_001505066 |
Protein GI | 158312558 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.699479 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTACC TGACCTTCGG ACGGCACACC GGCCTGCGGG TCTCGGAGTA CGCGCTCGGG ACCGCGAACT TCGGCACGGC CGACACCAGC GCCGGTCGGG ACACCTCGCG GCAGATCTTC GACGCCTTCG TCGCCGCCGG CGGTACGACG TTCGACACGT CCAACCTCTA CCAGGACGGA CAGGCCGAGA CCGTGCTCGG TGAGCTGCTC GGCAGTCGCC GGGACGACTT CGTCGTGATC ACCAAGTACG GCGGGACGAG GCGGGACCAG CCCCGCCCCG GCACGACCGG CAACAGCCGA AAGATCATGA TTCGCTCCCT GGAGGCCAGC CTGCGGCGGC TGAACACGGA CTACGTCGAC GTCTTCATGC CGCACTTCCC CGACGGAACC ACCCCGATCG ACGAGATCCT GGCCGGGTTC GACGACCTGA GCCGCGCCGG GAAGATCCGG TACGGCGGGC TGTCCAACTT TCCGGCCTGG CGGGTCGCCG GCGCCGTGGT CCGGGCGGAC CTGCGCGGAC TCGCCCCGCT GGTCGGCATC CAGACCGAGT ACAGCCTGGC CGAACGCTCC GCCGAGCGGG AACTGCTACC GATGGTGCAG GCCCATGGCC TGGGAGTGTT CCTGTACTCG CCGCTGGCCG GAGGTCTGCT CAGCGGCAAG TACCGGCAGG GTGGGAAAGG CCGGTTGAGC GTTCGCGGCG ACGCCGTCGA GCGCACCGTT CAGCAGAGCG CCGTCGTCGA CGGGGTGTTG GCCGTCGCCG ACGAGATCGG CAGCACCGCG GTCCAGGTCT CGCTGGCCTG GCTGCGTCGC CGGGCCGTGT CGGCCCGGAC CGCGCTGATC CCGATCGTCG GGCCTCGCAC GCTGTCCCAC CTCGAGCAGT ACCTGCGCTC GCTCGAGCTG GAACTCGACG AACAGCACTG TCGACGTCTC GACGAGATCA GCGCGATCCG GTTGGGCACT CCGCACGAGG ACGTCGCGGC AGCTCTGGCC CACGGCTTCG ACGGCGACCG CACCCTGCTC CAGACCCTGT ACCTCCCCGT GATCTGA
|
Protein sequence | MRYLTFGRHT GLRVSEYALG TANFGTADTS AGRDTSRQIF DAFVAAGGTT FDTSNLYQDG QAETVLGELL GSRRDDFVVI TKYGGTRRDQ PRPGTTGNSR KIMIRSLEAS LRRLNTDYVD VFMPHFPDGT TPIDEILAGF DDLSRAGKIR YGGLSNFPAW RVAGAVVRAD LRGLAPLVGI QTEYSLAERS AERELLPMVQ AHGLGVFLYS PLAGGLLSGK YRQGGKGRLS VRGDAVERTV QQSAVVDGVL AVADEIGSTA VQVSLAWLRR RAVSARTALI PIVGPRTLSH LEQYLRSLEL ELDEQHCRRL DEISAIRLGT PHEDVAAALA HGFDGDRTLL QTLYLPVI
|
| |