Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3784 |
Symbol | |
ID | 5672148 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4487024 |
End bp | 4488085 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641242663 |
Product | aldo/keto reductase |
Protein accession | YP_001508083 |
Protein GI | 158315575 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.116589 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGGTC CCTGCCGCGG CGTCGCCCGC CGCAGCAGAC TGGCGGTCAT GACGCAGACG ACGATGACGA CAAGGCGGCT CGGGGCAACC GGGCCGGAGG TGGGTGCGCT CGGGCTCGGT TGCATGGGCA TGTCCGGCAT GTACGGGCCA GCCGACGACG ATGAGAGCGT GGCGACGATC CTCGCCGCGG TCGACGCGGG GATGACCCTG CTCGACACCG GCGACTTCTA CGGCATGGGA CACAACGAGC TGCTGGTCGG CCGCGCCCTG CGCCAGCTCG ACCGGGACGC GGTGACGGTG AGCGTGAAGT TCGGGGCGCT GCGCGACGCG GTCGGCGGCT GGGGTGGTCT CGACGCCCGG CCGGCGGCGC TGCGCAACTT CCTCGCCTAC TCGCTGGTGC GCCTGGGCAC CGACCATGTG GACGTCTACC GGCCGGCCCG GCTCGACCCC GCCGTGCCGA TCGAGGAGAC GATCGGTGCC ATCGCCGAGC AGGTCAAGGC CGGTCTGGTC CGGCACATCG GCCTGTCGGA GGTCGGCCCG GAGACGATCC GGCGGGCGGC GGCCGTGCAC CCGATCTGCG ACCTGCAGAT CGAGTACTCG GTGCTTTCCC GCGGCATCGA GGACGAGATC CTGGCGACCT GCCGCGAGCT CGGGATCGCG ATCACCGCGT ACGGGGTGCT CTCCCGCGGG CTGATCGCCG GCACCGGCCC GTCGGGGCAC AGCAACGACT TCCGGGCACA CAGCCCGCGC TTCCAGGGCG CGAACCTCGA CCACAACCTG GGGCTCGTCG AGCGGCTGCG GCCGGTCGCC GAGCGCCACG GGATCTCCGT CGCGCAGCTG GCCATCGCCT GGGTCGCCGC CGCAGGTCCG GACGTCATCC CGCTGGTGGG GATGCGCCGG CGCAGCCGCA TCGATGACGC CCTGGCCGCC GCGGCCGTCA CCCTGTCCGA GCAGGATCTG GCCGACGTCG ACCGGGCGGT GCCCGCCGGG TCGGCCGCGG GCGGACGATA CGAGGACGCC CAGCTCGCCG CCCTCGACAG CGAGCGTCCC GCCGAGCGCT GA
|
Protein sequence | MNGPCRGVAR RSRLAVMTQT TMTTRRLGAT GPEVGALGLG CMGMSGMYGP ADDDESVATI LAAVDAGMTL LDTGDFYGMG HNELLVGRAL RQLDRDAVTV SVKFGALRDA VGGWGGLDAR PAALRNFLAY SLVRLGTDHV DVYRPARLDP AVPIEETIGA IAEQVKAGLV RHIGLSEVGP ETIRRAAAVH PICDLQIEYS VLSRGIEDEI LATCRELGIA ITAYGVLSRG LIAGTGPSGH SNDFRAHSPR FQGANLDHNL GLVERLRPVA ERHGISVAQL AIAWVAAAGP DVIPLVGMRR RSRIDDALAA AAVTLSEQDL ADVDRAVPAG SAAGGRYEDA QLAALDSERP AER
|
| |