Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_1994 |
Symbol | |
ID | 8447603 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 2201743 |
End bp | 2202792 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645041122 |
Product | aldo/keto reductase |
Protein accession | YP_003201368 |
Protein GI | 258652212 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0000673515 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0830186 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACTACG TCAAGCTCGG AACCACCGGA TTGGACGTCT CGCCGATTGC GATCGGTGCG ATGACCTACG GCGACCCCGG CCGTGGGCAC CCGGTCTGGT CGCTGGACGA GGAACGCAGC CGCCCACTGA TCAAGCACGC GCTGGAAGCC GGCATCACGT TCTTCGACAC GGCCAACATG TACTCCCAGG GTTCGAGCGA GGAGATCCTC GGGCGCGCGC TGCGCGATTT CGCCGACCGC GACGAGGTGG TCATCGCGAC CAAACTGCGG CACCCGATGC GCTCGGGGCC GAACGGAAAG GGACTGTCCC GCAAGGCGAT CATGATCGAG GTCGAGCACT CGCTGCGCCG GCTGGGCACC GACTACATCG ACCTGTACCA GATCCACCGC AACGATCACC GCACGCCGCT GACCGAAACC CTGGAAGCCC TGCACGATCT GGTGAAGGCG GGCAAGGTCC GGTACCTGGG CGCCTCCTCG ATGCCGGCCT GGGAGTTCGC CAAGGCGTTG CACACGCAAC AGGCGCACGG CTGGGCGCGG TTCGTCACCA TGCAGGACCA CTACAACCTG CTGCATCGCG AGGAGGAACG GGAGATGATC CCGCTCTGCC TGGACGAGGG GGTCGGCACC ATCGTGTGGA GCCCGCTGGC CCGGGGGCGG CTGGCCCGGC CGTGGGCGCA GGCCAAGTCC ACCGACCGCG GTCGCAGCGA CCCGTTCGCC GACCTGCTCT ACCTTCCCCA GAACGAGGCC TCCGACCGGG CCACCATCGA CGCCGTCGGC CGGGTCGCGC AGGCCCGTGG GATCAGCCGG GCCCAGGTCG CGCTCGCCTG GCTGCACGCC CAACCGGTGG TCACCGCACC GCTGGTCGGG GCCAGCAGCA TCGGCCAGAT CGACGAGGCC GTCGCCTCCC TGCAGATCGA GCTGGAGGCC GGCGAGCTGC GGGAACTGCA GGCGCACTAC ACGCCTCGGC ACGACTTCCA GGGCATCTCC GAGGACGCCG AGCTGCAGGC CATCATGGAG CGGATGCCTC AGTTCACCAC CGCGCGCTGA
|
Protein sequence | MDYVKLGTTG LDVSPIAIGA MTYGDPGRGH PVWSLDEERS RPLIKHALEA GITFFDTANM YSQGSSEEIL GRALRDFADR DEVVIATKLR HPMRSGPNGK GLSRKAIMIE VEHSLRRLGT DYIDLYQIHR NDHRTPLTET LEALHDLVKA GKVRYLGASS MPAWEFAKAL HTQQAHGWAR FVTMQDHYNL LHREEEREMI PLCLDEGVGT IVWSPLARGR LARPWAQAKS TDRGRSDPFA DLLYLPQNEA SDRATIDAVG RVAQARGISR AQVALAWLHA QPVVTAPLVG ASSIGQIDEA VASLQIELEA GELRELQAHY TPRHDFQGIS EDAELQAIME RMPQFTTAR
|
| |