Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0447 |
Symbol | |
ID | 5668869 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 526304 |
End bp | 527542 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641239379 |
Product | aldo/keto reductase |
Protein accession | YP_001504817 |
Protein GI | 158312309 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGTGC GCCTGCTGTA CCTGATCTTC GTTCGGGTCT GTGGCTGGCT GGTTCTACTC GGTCGCTCGT CGGCCGCCAA GGACTTGGAA TTGCTGGTGC TACGGCATGA GGTCACGGTG CTGCGCCGTA CCCAGCCCAG GCCCCGGTGG GACTGGGCGG ACCGGGCGGT CCTCAATCAA ACAAGATCAA GGAATGGCGG GGTCTGGCCA CCCGCTACGA CAAGACACCC GAAAGCTACG CCGCAGGACT CCACCTGCGC GGATCCATCC TCTGGCTACG CAGCCTGCCA ACCCCATGAT CCGAACTTGG AACAGACCCT AGGTACCGAC GGACCCGAGG TTCCCGTGGT CTGCGTCGGA ACGAGCCCGC TGGGCGGGCT CCCGACAATC TACGGCTATG ACGTCGAGGC GGGGCAGGCG GTGGCGACGA TTCGCCGCGT GTTGGAGTCA CCGATCGACT TCATCGACAC CTCGAACGAG TACGCGAACG GCGAAAGCGA GCGGCGCATC GGCGAGGCGT TGCGCAGCGC GGCAGGGGGT CCCGGCAATG TCGTGCTCGC GACCAAGGCA GATCCCGCGC TGTGGGCCAC GGAGTTCCCC GGCAGCCGAG TGCGGGAATC GTTCCGGGAG AGCACCGAGC GGCTCGGGGT TGATCGGTTC GAGGTGTTCT ACCTGCACGA CCCGGAGCGC TTCGATTTCG GGTACATGAC GGCCCCCGGA GGTGCGGTCG AGGCGATGGT GCAGCTGCGT ACCGACGGCC TGGCCACGGC AATCGGCGTG GCCGGCAGTG ACATCAGCGA GATGCGCCGC TACGTCGACC TCGGCGTCTT CGACGTCATC CTGAACCACA ACCGGTATAC ACTTCTTGAT CGTTCTGCGG ACGCCCTCAT CGACCACGCG GTCAACGCCG GCCTGTCCTT CATTAATGCC GCACCGTATG CCAGCGGCAT GCTCGCGAAG CAGGTCTCGG CCCGTCCGAG GTATCAATAT CGCGCGCCAT CGCCCGAGAT CGTCCGCACC ACCGCGTGGT TGCACCAGGA GTGCGCCCGG TTCCACGTGC CGCTCGCGGC ACTCGCCCTC CAGTTCTCGA CACGCGATCC TCGGATCAGC TCGACGGTCG TCGGCGTATC AGCTCCGGAG CGTGTGGATG AACTCGTGGA GAACGAGCAG CGTGAGATCC CGTCGGAGCT GTGGGACTCC GTGCGCGAGC GGCTGGTGTT GCCGCCCACA GTGACGTAA
|
Protein sequence | MSVRLLYLIF VRVCGWLVLL GRSSAAKDLE LLVLRHEVTV LRRTQPRPRW DWADRAVLNQ TRSRNGGVWP PATTRHPKAT PQDSTCADPS SGYAACQPHD PNLEQTLGTD GPEVPVVCVG TSPLGGLPTI YGYDVEAGQA VATIRRVLES PIDFIDTSNE YANGESERRI GEALRSAAGG PGNVVLATKA DPALWATEFP GSRVRESFRE STERLGVDRF EVFYLHDPER FDFGYMTAPG GAVEAMVQLR TDGLATAIGV AGSDISEMRR YVDLGVFDVI LNHNRYTLLD RSADALIDHA VNAGLSFINA APYASGMLAK QVSARPRYQY RAPSPEIVRT TAWLHQECAR FHVPLAALAL QFSTRDPRIS STVVGVSAPE RVDELVENEQ REIPSELWDS VRERLVLPPT VT
|
| |