Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4232 |
Symbol | |
ID | 5672587 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5039552 |
End bp | 5040589 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641243105 |
Product | aldo/keto reductase |
Protein accession | YP_001508522 |
Protein GI | 158316014 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGCATC GATCTCTCGG CCGTACCGGT GTCTCGGTCA GCAAGCTGTG CCTCGGAGCC ATGATGTTCG GGCGTTGGGG AACCGAGGAT CACGACGAAA GCATCCGGAT CATCCATCGT GCGCTCGATG CCGGCATCAC CTTCATCGAC ACCGCGGACG TCTACTCGCG GGGCGAGTCG GAGACCATCG TCGGGAAGGC GCTGGCCGGT GGTCGGCGCG AGGACGTCAT CCTCGCCAGC AAGGTCCACA TGCCGATGGG CGACGATCCC AACCAGCGGG GCAACTCGCG CCGCTGGATC ATCCGCGAGG TCGAGGACTC GCTGCGGCGG CTCGGGACCG ACTGGATCGA CCTGTACCAG ATCCATCGCT ACGACCCCGG CACCGATCTC GACGAGACGC TCGGCGCGCT CACCGACCTG GTCCGGGCCG GAAAGGTCCG CTACGTCGGC CACTCGACGT TTCCGGTCTC CGCGATCGTC GAGGCCCAGT GGACGGCGCG GGAGCGCGGC CGGGAGCGTT TCGTCTGCGA GCAGCCGCCG TACTCGATCC TCACCCGCCG GATCGAGGTC GACGTCCTGC CGACCTGTGC CCGGTACGGG ATGGGTGTGA TCCCGTACAG CCCGCTCGCC GGGGGCTGGC TCTCGGGCCG TTACAGCGCC GGCGCGGACG CCGCGGCGCC GGCGTCCGAG GCGCGGCGGC GGCTCAGCGA CCGCTTCGAT CTGACGCTGC CGGCCAACCG GCGCAAGCTG GACGCCGCCG AGCAACTCGC CAAGGTGGCC GCCGAGGCGG GGATCAGCCT CATCGAGCTG GCGATCGCCT TCGTCCTACG CCATCCCGCG GTGACCGCGG CGATCATCGG CCCGCGGACG ATGGACCATC TGGAATCCCA GCTGACCGCC ACCGACATCG AGCTCTCCGA CGACGTCCTG GACCGCATCG ACGAGATCGT GCCCCCCGGT ACGACCATCA ACCCCGCCGA CGACGGATGG GTCAGCCCGG CCCTGGCTGC TCCGGCACGC CGCCGACCGG CGCGCTGA
|
Protein sequence | MEHRSLGRTG VSVSKLCLGA MMFGRWGTED HDESIRIIHR ALDAGITFID TADVYSRGES ETIVGKALAG GRREDVILAS KVHMPMGDDP NQRGNSRRWI IREVEDSLRR LGTDWIDLYQ IHRYDPGTDL DETLGALTDL VRAGKVRYVG HSTFPVSAIV EAQWTARERG RERFVCEQPP YSILTRRIEV DVLPTCARYG MGVIPYSPLA GGWLSGRYSA GADAAAPASE ARRRLSDRFD LTLPANRRKL DAAEQLAKVA AEAGISLIEL AIAFVLRHPA VTAAIIGPRT MDHLESQLTA TDIELSDDVL DRIDEIVPPG TTINPADDGW VSPALAAPAR RRPAR
|
| |