Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3687 |
Symbol | |
ID | 5672053 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4364147 |
End bp | 4365007 |
Gene Length | 861 bp |
Protein Length | 286 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641242570 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001507990 |
Protein GI | 158315482 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTTTAC TTTCGGGCCG GGTCGCCGTC GTCACCGGCG CCAGCCGGGG CATCGGGAAG GGCATCGCGC TCGCGCTCGC CGACGAGGGC GCGACGGTCT ACGTCACCGG GCGGACGGTG TCCCCCGGCT CGCATCCGCT TCCCGGCACG GTCGGTGAGA CCGCGGCGGC GGTGGACCGC CGGGGCGGTC AGGGCATCGC CGTCCAGGTC GACCACGGTG ACGACGAGCA GGTCGCCGCA CTGTTCGAGC AGGTCGAGCG GGAGCAGGGC CGGCTGGACA TCCTGGTCAA CAACGCCTTC TCCCTGCCCG AGGACCTGAC GGAGCCGAAC CCGTTCTGGG AGAAGCCGCT GTCGAACTGG GAGATGGTCG ACGTCGGGGT GCGGTCGAAC TTCGTCGCCG CCTGGCACGC GGCGCGGCTC ATGGTTCCGC GCCGGGCCGG CCTCATCGTG GCGATCTCCG GCTACGTGGG AGTGACCTAC ACCTACGGCG TCGTCTTCGG CACCTGCAAG TCCGCGGTCG ACCGCATGGC CAGGGACATG GCCATCGAGC TCAAGCCGCA CAACATCGCG TCGATCTCGC TGTGGCAGGG CCTCACCTTC ACCGAGCGGG CCGAGCGCAA CATCTCGCTG AACCCCGCGA TGAAGGAGCA GCTCGTCACC AGCCCGACGA TCGGCTGCTC GCCGGAGTTC CCCGGCCGCG TCATCGCGGC GCTCGTCCAG GACACGGACC TGATGCGGCA CTCCGGGGGC ACCTTCATCA CCGCCGAGCT GGCCCAGGAG TACGGCGTCA CCGACCTCGA CGGGAAGGTC ATCCCGTCCC TGCGGGCCCA GCGCGGCGCC CCCCTCTGGT CCCCCGTCTA G
|
Protein sequence | MGLLSGRVAV VTGASRGIGK GIALALADEG ATVYVTGRTV SPGSHPLPGT VGETAAAVDR RGGQGIAVQV DHGDDEQVAA LFEQVEREQG RLDILVNNAF SLPEDLTEPN PFWEKPLSNW EMVDVGVRSN FVAAWHAARL MVPRRAGLIV AISGYVGVTY TYGVVFGTCK SAVDRMARDM AIELKPHNIA SISLWQGLTF TERAERNISL NPAMKEQLVT SPTIGCSPEF PGRVIAALVQ DTDLMRHSGG TFITAELAQE YGVTDLDGKV IPSLRAQRGA PLWSPV
|
| |