Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4672 |
Symbol | |
ID | 5673014 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5579596 |
End bp | 5581635 |
Gene Length | 2040 bp |
Protein Length | 679 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641243529 |
Product | short chain dehydrogenase |
Protein accession | YP_001508945 |
Protein GI | 158316437 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only [S] Function unknown |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) [COG3347] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR02632] rhamnulose-1-phosphate aldolase/alcohol dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.174921 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.873986 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGGGA ACCAGACCGT CGCCGAGCTC CTCGCCCGGT CGAACCGGCT CGGCGCCGAC CCGCGCAACA CCAACTACGC GGGCGGCAAC ACCTCCGCCA AGGGCGTCGA GATCGATCCC GTCACGGGCC GCGACGTCGA GCTGCTGTGG GTCAAGGGGT CGGGCGGCGA CCTCGGCACG CTGACCGAGC CCGGCCTCGC CGTCCTCCGG CTGGACCGGC TGCGCGCGCT CGTGGACGTC TACCCCGGCG AGGGCCGCGA GGACGAGATG GTCGCCGCCT TCGACTTCTG CCTGCACGGC CGGGGCGGTG CCGCGCCCTC GATCGACACC GCCATGCACG GGCTCGTCGA CGCCTCGCAC GTCGACCACC TGCACCCGGA CAGCGGCATC GCCCTCGCGA CCGCCGCCGA CGGCGAACAG CTCACGAAGG ACGTCTTCGG CACGAAGGTC GCGTGGGTGC CGTGGCGGCG GCCGGGCTTC CAGCTCGGCC TCGACATCGC GGCCGTGCGG CGGGACAACC CCGGCGCCGT CGGCTGCATC CTCGGCGGGC ACGGCATCAC CGCGTGGGGC GACACGAGCG AGGAGTCCGA GCGCAACTCC ACCTGGATCA TCGAGCAGGC GCGGCTCCAC CTCGAGCGGC ACGGGCGGCC GCGGCCCTTC GGCGCCGTGG CCGACGGCCG GCGCGCGCTG CCAGCCGGGC AACGGCGGGC CAGGGCCGCC GTGCTCGCGC CCCACCTGCG TGCGGTCGCG TCCCGCGACC ACCGCATGGT CGGCCACTTC ACCGACAGCG ACGCCGTCCT GGAGTTCCTG GCCAGCGAGA AGCTGTTCGC CCTGGCCCGG CTCGGGACGT CCTGCCCCGA CCACTTCCTG CGGACGAAGG TCCGCCCGCT GGTGCTCGAC CTGCCGGCCG CGGCGTCACC CGAGGAGTGC GTCCGGCGGC TCGCGGAGCT GCACGAGGCC TACCGCGCCG ACTACCGCGC GTACTACGAG CGGCATGCCA CCGCCGACTC GCCGCCGATG CGCGGCGCGG ACCCGGCGAT CATCCTCGTG CCGGGCGTGG GCATGTTCTC CTACGGCCGG GACAAACAGA CGGCACGGGT GGCCGGCGAG TTCTACGTCA ACGCGATCAA CGTGATGCGC GGTGCCGAGG CGGTCTCGTC CTACGCGCCG ATCCCGGAGC GCGAGAAGTT CCGGATCGAG TACTGGGCGT TGGAGGAGGC GAAGCTGCGC CGCCTACCGC CGCCCAAGCG GCACGCGGGC CGGGTCGCGC TGGTCACCGG CGCGGCCAGC GGCATCGGCC GGGCCACCGC GTCCCGGCTC GCCGCCGACG GCGCCTGTGT GGTCGTCGCC GACCTCGACG CGACCAGCGC CGTGTCGGCG GCGGGCGAGC TCGGCGGCGC CGACGTCGCG GTCGGCGTCG GCGCCGACGT GACGAAGGAG GCCGAGGTCG CGGCCGCCGT CGCGGCGGCA CTGCTCGCCT TCGGCGGGAT CGACCTCGTC GTCAACAACG CGGGCCTGTC CATCTCCAAA CCGCTGCTGG AGACCACCGA GCGGGACTGG GACCTGCAGC ACGACGTCAT GGCGAAGGGA AGCTTCCTCG TCGCGCGGGC CGCGGCGCGA GCCATGATCG ACCAACGGCT CGGCGGTGAC ATCGTCTACA TCGTCTCCAA GAACGCGCTG TTCGCCGGGC CGAACAACGT CGCCTACGGC GCCGCGAAGG CCGACCAGGC GCACCAGGTC AGGCTGCTGG CGGCCGAGCT CGGCGAGCAC GGGATCCGGG TCAACGGCGT CAACCCGGAT GGCGTCGTGC GTGGTAGCGG CATCTTCGCC GGCGGGTGGG GAGCCCAGCG TGCGGCGGTC TACGGCATCC CCGAGGAGGA GCTCGGCGCC TTCTACGCCC GGCGGACGCT GCTCGGCCGG GAGGTGCTGC CCGAGCACGT GGCCAACGCG GTCGCGGCCG TGTGCTCGAC CGAGCTCAGC CACACGACCG GCCTGCTCGT CCCCGTCGAC GCCGGCGTCG CGGCCGCCTT CCTGCGCTGA
|
Protein sequence | MSGNQTVAEL LARSNRLGAD PRNTNYAGGN TSAKGVEIDP VTGRDVELLW VKGSGGDLGT LTEPGLAVLR LDRLRALVDV YPGEGREDEM VAAFDFCLHG RGGAAPSIDT AMHGLVDASH VDHLHPDSGI ALATAADGEQ LTKDVFGTKV AWVPWRRPGF QLGLDIAAVR RDNPGAVGCI LGGHGITAWG DTSEESERNS TWIIEQARLH LERHGRPRPF GAVADGRRAL PAGQRRARAA VLAPHLRAVA SRDHRMVGHF TDSDAVLEFL ASEKLFALAR LGTSCPDHFL RTKVRPLVLD LPAAASPEEC VRRLAELHEA YRADYRAYYE RHATADSPPM RGADPAIILV PGVGMFSYGR DKQTARVAGE FYVNAINVMR GAEAVSSYAP IPEREKFRIE YWALEEAKLR RLPPPKRHAG RVALVTGAAS GIGRATASRL AADGACVVVA DLDATSAVSA AGELGGADVA VGVGADVTKE AEVAAAVAAA LLAFGGIDLV VNNAGLSISK PLLETTERDW DLQHDVMAKG SFLVARAAAR AMIDQRLGGD IVYIVSKNAL FAGPNNVAYG AAKADQAHQV RLLAAELGEH GIRVNGVNPD GVVRGSGIFA GGWGAQRAAV YGIPEEELGA FYARRTLLGR EVLPEHVANA VAAVCSTELS HTTGLLVPVD AGVAAAFLR
|
| |