Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_4041 |
Symbol | |
ID | 8449660 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 4454923 |
End bp | 4455864 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 645043086 |
Product | short chain dehydrogenase |
Protein accession | YP_003203322 |
Protein GI | 258654166 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.138801 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0368212 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAAGA CCATCGACAT CACCGTCCCC GACCTGAGCG GGCGGCGCGC GGTCGTCACC GGGGCCAGTG ACGGCCTCGG CGTCGGCCTG GCCGGCCGGT TGGCCGCGGC CGGCGCCGAG GTGATCATGC CCGTGCGCAA CCAGCGCAAG GGCGAGGCGG CGATCGACCG GATCCGGCGG TCGGCACCGG ATGCCACCGT GTCGCTGCGC GAGCTGGACC TGTCGAGCTT GGATTCGGTG GCTGAACTCG GCCGGACGCT GACCCAGGAG GATCGGCCGA TCCACCTGCT GATCAACAAC GCGGGGGTGA TGACCCCGCC GGAACGGCAG AACACGGCCG ACGGCTTCGA GCTGCAGTTC GGGTCCAACC ACCTGGGCCA CGTCGCCCTG GTCGCGCACC TGCTGCCGCT GCTGCGGGCG GGGCAGGCCC GGGTCACCTC ACAGGTCAGC GTCGCGGCGG CCCGGGGGTC CATCAACTGG GACGACCTGA ACTGGGAACG GTCCTACGAC GGGATGAAGG CCTACCGCCA GTCCAAGATC GCGCTCGGAC TGTTCGGGCT GGAGCTGGAC CGGCGCAGCC GAGCCGCCGG CTGGGGCATC AGCAGCAACC TGGCGCACCC CGGGGTCGCC CCGACGAACC TGCTGGCCGC CCGACCCGAG GTGGGCCGGG CCAAGGACAC CCTGGGCGTG CGCGTCATTC GCGCCCTGTC CGCGCGCGGG CTCCTGGTCG GCACGGTCGC CAGCGCCGCG CTCCCGGCGG TGTACGCGGC CACCTCGCCC GACGCCCAGC CGGGGCGGCT GTACGGGCCC GGCGGGCTGG GCCACCTGGG CGGTGCGCCG GCGGAGCAGA AGCTCTACCC CACCCTGCGC GGCGACGAGC AGGCCGATCG CATCTGGCGG GTCTCACAGG AGCTGACCGC GGTGCCGTTC CCGCAGGACT GA
|
Protein sequence | MTKTIDITVP DLSGRRAVVT GASDGLGVGL AGRLAAAGAE VIMPVRNQRK GEAAIDRIRR SAPDATVSLR ELDLSSLDSV AELGRTLTQE DRPIHLLINN AGVMTPPERQ NTADGFELQF GSNHLGHVAL VAHLLPLLRA GQARVTSQVS VAAARGSINW DDLNWERSYD GMKAYRQSKI ALGLFGLELD RRSRAAGWGI SSNLAHPGVA PTNLLAARPE VGRAKDTLGV RVIRALSARG LLVGTVASAA LPAVYAATSP DAQPGRLYGP GGLGHLGGAP AEQKLYPTLR GDEQADRIWR VSQELTAVPF PQD
|
| |