Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3688 |
Symbol | |
ID | 8449307 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 4046223 |
End bp | 4047230 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645042752 |
Product | short chain dehydrogenase |
Protein accession | YP_003202988 |
Protein GI | 258653832 |
COG category | [R] General function prediction only |
COG ID | [COG4221] Short-chain alcohol dehydrogenase of unknown specificity |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.000592094 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0665627 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCCAC AGGTCGTAGT GGTCACCGGG GCCAGCGGCG GTATCGGCCG CGCGGTCGCC TCGGCGTTCG GCGCCCGCGG GGCCCGCGTC GCGATGCTGG CGCGCGGCGA GAGCGGGCTG ACGGGCGCCG CCCAAGATGT GCGTGCCGGC GGCGGCACCG CGCTGCCCAT CCCGACGGAC GTGGCCGACC AGGCGCAGGT TTTTTCGGCC GCCGACCGCG TCGAAAGCGA GCTCGGCCCC ATCGATGTCT GGGTGAATGT CGCTTTCACC TCGGTGTTCG CGCCCTTCGC GAAGATCCAA CCCGACGAAT ACCGGCGGGT GACCGAGGTG AGCTATCTGG GATACGTCTA CGGCACCATG GCCGCGCTAC AGAACATGAA ACCCCGCGAC CGGGGCACCA TCGTGCAGGT CGGGTCCGCG CTGGCCTACC GCGGCATTCC CTTACAGACG GCGTACTGCG GCGCTAAACA CGCGATCCAG GGCTTTCACG AGGCGCTGCG CTGCGAACTA CTGCATGACA AGTCGAACGT GCACGTGACG ATGGTGCAGA TGCCCGCGGT GAACACCCCG CAGTTCTCCT GGGTGCTGTC CCGGCTACCC CACCACGCCC AACCCGTCCC GCCGATCTAC CAGCCCGAGG TCGCCGCCCG CGGCGTCCTG TACGCGGCCG ACCACCCGAA GCGGCGGGAA TACTGGGTCG GCGCCAGCAC CGTCGGCACC CTGGCCGCCA ACGCCATCGC CCCGGGACTG CTGGACCGCT ACCTGGGCAA AACCGGGTTC TCCTCCCAAC AGACCAAGCA GAGGCAACCC CCCGACGCGC CGGCGAACCT GTGGAAACCG GCCGACGGAC CCGACGGCAG GGACTTCGGC ACACACGGCA TCTTCGACGA CCGAGCCAAG AACTCCGCAC CGCAACTGTG GGCGTCGCAC CACCACGGCC TGCTCGCCGC CACGGCGAGC GGTGCGCTGG CCGGCGCCGC GGCCCTGATG CTGGTCCGCC GCAGATGA
|
Protein sequence | MTPQVVVVTG ASGGIGRAVA SAFGARGARV AMLARGESGL TGAAQDVRAG GGTALPIPTD VADQAQVFSA ADRVESELGP IDVWVNVAFT SVFAPFAKIQ PDEYRRVTEV SYLGYVYGTM AALQNMKPRD RGTIVQVGSA LAYRGIPLQT AYCGAKHAIQ GFHEALRCEL LHDKSNVHVT MVQMPAVNTP QFSWVLSRLP HHAQPVPPIY QPEVAARGVL YAADHPKRRE YWVGASTVGT LAANAIAPGL LDRYLGKTGF SSQQTKQRQP PDAPANLWKP ADGPDGRDFG THGIFDDRAK NSAPQLWASH HHGLLAATAS GALAGAAALM LVRRR
|
| |