Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4540 |
Symbol | |
ID | 5672889 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5416764 |
End bp | 5417762 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641243405 |
Product | aldo/keto reductase |
Protein accession | YP_001508821 |
Protein GI | 158316313 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATACG CCAACCTCGG AACCTCCGGG CTGAAGGTCA GCCGTATCGC GCTCGGCTGC ATGAGTTTCG GCAGGCCCGG CACGGGTCGC GACTGGGCGC TGGACGCCGA CGCCGCCAAG CCGATCTTTC AGCAGGCCGT CGACCTCGGC ATCACGCTTT GGGATACCGC CAACACCTAC AGTCGGGGGA CGTCCGAGGA GATCACCGGT GAAGCTGTGA AGCGGTACAC GAGCCGCGAC CAGGTCGTGA TCGCGACGAA GCTGTTCGCG CCGATGGGCT CCGGCCCCGG CGGCCGGGGC CTTTCGCGCC GCGCCGTCTT CGAGCAGCTC GACGCGTCGC TGCGACGGCT CGGCACCGAT TGGATCGACC TTTACCAGAT CCACCGATTC GACCCGCACA CCCCCGTCGA GGAGACGATG GAGGCCCTCC ACGACGCCGT GAAATCGGGC AAGGTGCGCT ACCTGGGCGC TTCCTCGATG TGGGCCTGGC AGTTCTCCAA GCTGCAGTAC ACCGCGCAGC TGCACGGCTG GACGAAGTTC ATCTCCATGC AGGACCAGTA CAACCTCGTC GCCCGCGAGG AAGAGCGCGA GATGTTCCCG CTGCTGTCGG ACCAGGGCGT CGGCAGCCTG CCATGGTCGC CGCTCGCTGC CGGCCTGGTG ACCCGGCCGT GGGGTGACGC GAGCACGACC CGCGGCGCGC TGAACCCGAC GGCCGACGCG TCCGGCACCC CGCTGTTCCT CGACAGCGAC CGCGGGACGG TCGACGCGGT CCAGCGCATC GCCGAGGCAC GGGGGACCTC GATGGCGCAG GTGGCGATGG CGTGGGTGCT GCGCAACCCT GTCGTGACCG CCCCGATCGT CGGAGCGACC AAGTCGCACC ATCTCGCCGA CGCCGTGGCC GCGCTCGACA TCAATCTCGG TAAGGACGAG GTCACCGCTC TGGAGGAGAG CTACACGCCG CGCAGGCCGA CCTACTACGG CTCCGAGTCG GGGTACTGA
|
Protein sequence | MEYANLGTSG LKVSRIALGC MSFGRPGTGR DWALDADAAK PIFQQAVDLG ITLWDTANTY SRGTSEEITG EAVKRYTSRD QVVIATKLFA PMGSGPGGRG LSRRAVFEQL DASLRRLGTD WIDLYQIHRF DPHTPVEETM EALHDAVKSG KVRYLGASSM WAWQFSKLQY TAQLHGWTKF ISMQDQYNLV AREEEREMFP LLSDQGVGSL PWSPLAAGLV TRPWGDASTT RGALNPTADA SGTPLFLDSD RGTVDAVQRI AEARGTSMAQ VAMAWVLRNP VVTAPIVGAT KSHHLADAVA ALDINLGKDE VTALEESYTP RRPTYYGSES GY
|
| |