Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2111 |
Symbol | |
ID | 8447722 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 2329348 |
End bp | 2330151 |
Gene Length | 804 bp |
Protein Length | 267 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645041234 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_003201478 |
Protein GI | 258652322 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.00668393 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.136098 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATACCA CCCGCCTCGC CGGCAAGAAC GTGCTCATCA CCGGTGGCGC ACAGGGCATG GGCGCCGCCG TCGCCCACTA CTACGCCGAG CAGGGCGCCA AGGTGTGCGT GGGGGACGTC AACGTGGACG GCGTCGCCCA GGTCGCGGAT GAGATCACCG CCAAGGGCGG CACGGCCACC TTCGTCAAGC TCGACGTGAC CAGCGAGCAG GACGCGGCTG CCGCCGTCGC GCACACGGTC GAACAGTTCG GCAGCATCAA CGTGCTGCTG AACAACGCGG GCATCAACAA GCCCTTGTTC TTCCTCGACA TCACCCGCGA GAACTGGCAG CGCATCATGG ATGTCAACGC CTGGGGGACG ATGAACTGCA TGCAGGCGGC GGCCCGCCAG ATGAAGGCGC AGGGCCGGCA GGACTACCCC TACAAGATCA TCAACGTCGG TTCGATCCTG TCCCGGGACG TCTTTGACGA CGTGGTTGTC TACGGCGCGT CCAAGCATGC GGTGCTGGCC CTGATCAAGG GCGGGGCCAA GGCGCTGATC GAACACAACA TCACCGTCAA CGGGTACGGC CCGGGGGTGG TGCGCACGGA GCTGTGGGAG CAACTGGACA AGGACCTGGT GGCGATCGGC AAGTTCGACG CGCCGGGTAA GTCCATGGAC CAGCTGGCCG AGACGATGAT CTTGATGAAG CGCTACTCCT ACCCCAAGGA CGTCGTGGGC ACGGCGGCCT TCCTGGCCAG CCCGGAGAGC GACTACATGA CCGGTCAGCT GCTGATGATC GACGGCGGCA TGGTGATCCA GTGA
|
Protein sequence | MDTTRLAGKN VLITGGAQGM GAAVAHYYAE QGAKVCVGDV NVDGVAQVAD EITAKGGTAT FVKLDVTSEQ DAAAAVAHTV EQFGSINVLL NNAGINKPLF FLDITRENWQ RIMDVNAWGT MNCMQAAARQ MKAQGRQDYP YKIINVGSIL SRDVFDDVVV YGASKHAVLA LIKGGAKALI EHNITVNGYG PGVVRTELWE QLDKDLVAIG KFDAPGKSMD QLAETMILMK RYSYPKDVVG TAAFLASPES DYMTGQLLMI DGGMVIQ
|
| |