Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3371 |
Symbol | |
ID | 5671742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3996230 |
End bp | 3997078 |
Gene Length | 849 bp |
Protein Length | 282 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641242259 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001507679 |
Protein GI | 158315171 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGAAGG ACGAGGCGAT GCCACATACC AAGACACAGC GGTCCCGGGT GGCGGTCGTC ACCGGCGGGG CGTCGGGGAT CGGCGAGGCC ACCTGTCATC ACCTGGCCGA GCGTGGCCAC CGCATCGCCT TACTGGATCT CGACCCTGAC GCGGCGAACC GGGCGGCCAA GGAGCTGCAG GCCGCCGGCG CGGACGTGCT CGGCATCGCG GTCGACGTCA CCGACCGCCC CGCCCTCGAC GCCGCGTTCG CCGAGATCCG CTCGAAGCTC GGCCCGACGG AGATCCTGGT CACCAGCGCC GGTCTGGTCG CCTTCGACCC GTTCGAGCAG ATCACCCTCG AGGAGTGGGA CCGGGTTGTC GGGGTGAACC TCACCGGCAC CTTCCACTGC TGCCAGGCTG CCGTGCCCGA CATGGTCGCG GCCCGTTGGG GCCGCATCGT GACCATCGCC TCGTCGAGCG CCCAGCGGGG CTCGCCGAAC ATGGCGCACT ACGCGGCGGC CAAGGGCGGG GTGATCGTGC TGACCAAGTC ACTGGCCCGC GCCTACGCCT CCTACGGCAT AACCGTCAAC AGCATTCCGC CGTCGGGAAT CGAGACACCG ATGCAGCACC AGTCCCAGGC CGCGGGGCAT CTGCCCCCCA ACGAGGTGAT GGCCGGGGCC ATCCCGCTCG GCCATCTCGG CACCCCGGAC GACATCGCGG CGGCGGCCGC CTTCCTCACC TCGGAGGAGG CCCGCTTCAT CACCGGACAG GTCCTGGGAG TCAACGGCGG GAGCGGGCCA CCCAGGGAAC CTGCCGGCAG TTCACCTGCA AGTACCACGG GTGGCGCTAC GACCTCGACG GTGCTCTGA
|
Protein sequence | MRKDEAMPHT KTQRSRVAVV TGGASGIGEA TCHHLAERGH RIALLDLDPD AANRAAKELQ AAGADVLGIA VDVTDRPALD AAFAEIRSKL GPTEILVTSA GLVAFDPFEQ ITLEEWDRVV GVNLTGTFHC CQAAVPDMVA ARWGRIVTIA SSSAQRGSPN MAHYAAAKGG VIVLTKSLAR AYASYGITVN SIPPSGIETP MQHQSQAAGH LPPNEVMAGA IPLGHLGTPD DIAAAAAFLT SEEARFITGQ VLGVNGGSGP PREPAGSSPA STTGGATTST VL
|
| |