Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_0234 |
Symbol | |
ID | 8445814 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 263062 |
End bp | 264339 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645039379 |
Product | protein of unknown function DUF21 |
Protein accession | YP_003199654 |
Protein GI | 258650498 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 70 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCATCT GGCTGAACCT GCTGGTGGTG CTGCTTCTGC TGATCATCGG CGGCTTGTTC ACGGCGACCG AACTCGCCCT GGTCTCCCTG CGCCCGGGTG AACTGGCCGA CCTGCGGGCG CAGGGACCGC GCGGCGCCCG GGTGGCCCGG CTGGCCGGCC AGCCAACCCG GTTCCTCGGG GCCGTGCAGA TCGGCTCCAT CGTGGCCGGG TTCTTCGCCG CCGCCTACGC CACCGCGACC CTGGCCGAAC CACTCGGTGC GGCGATGGGC CGGGCGGGAC TGGGCGAGGA CGGCGGGGAG ACCCTGGCCG TGCTGATCGT CACCCTGGCG ACCACCTTCC TGGCCCTGAT CATGGCCGAG CTGACCCCGC GCCGGTACGC GATGCAGCGC CCGCAATCCG TGGCCGCCCT GCTCGGTCCG ATCCTGGACC GGCTGGCCAC GCTGCTGCGA CCGGTCATCT GGCTGTTGGA GAAGTGCACG AACGGCCTGT TGCGGCTGTT GCGGGTCGAT CCGAAGGATT CCCGCGCGGA GATGAGCGTG GAGGAGGTGC GCGAGCTGGT CCTGGCCCAT GAGGAGGTCC CGGACCAGGA GAAACAGATC ATCCGGCAGG TGTTCGCGGC CGGCGAGCGG ACCATCCGGC AGGCCATGGT CCCCCGGGCC GCGATCGACT TCCTGTCCAC CGCGGCCACC GGCGCGCAGG CCCGGCGGGC CGCCTGGGAG CACACCCACA CCCGGTACCC GGTGCTGGAC GAGGCCGGCC AGGTGGCCGG GTTCCTGCAC GTGCGGGATC TGTTCGCGCC CGAACTGGAT CCCGGCGCCC CGATCCGGGA CCTGGTCCGG CCGATCAGCG CCTACCCGCC GAACAAGAAG TTGTTGGCCG TGCTGCGCGA GATGCAGACC GGGGCCGAGA ACATCGCCGC GGTCGTGGAC GAGTACGGCC AGCTCAAGGG CATGGTCACC CTCGAGGACG TCGTCGAGGA ACTCGTCGGC GAGATGTACG ACGAGTACGA CCGCATCCCC ACCGCCACCC CCGGGGACGC CACCGTGGTC GACGGCCTGA CCGGCCTGTC CGACTTCGGC CGCCGGCTCG GTTTCGAGCT GCCGGCCGGC CGGTACGACA CGGTCGGCGG GTACCTGCAG GCCGCCCTGG ACCGGACGCC CCGCGCCGGG GACGCGGTCG AGGTCGCCGG CCACCGGCTC ACGGTCTCCT CGGTCGCCGG CTGGCGGGTC GGTCAGGTCA CCGTCGAACC GCTGTCGACG CCGGTTCAAC CGGACTGA
|
Protein sequence | MGIWLNLLVV LLLLIIGGLF TATELALVSL RPGELADLRA QGPRGARVAR LAGQPTRFLG AVQIGSIVAG FFAAAYATAT LAEPLGAAMG RAGLGEDGGE TLAVLIVTLA TTFLALIMAE LTPRRYAMQR PQSVAALLGP ILDRLATLLR PVIWLLEKCT NGLLRLLRVD PKDSRAEMSV EEVRELVLAH EEVPDQEKQI IRQVFAAGER TIRQAMVPRA AIDFLSTAAT GAQARRAAWE HTHTRYPVLD EAGQVAGFLH VRDLFAPELD PGAPIRDLVR PISAYPPNKK LLAVLREMQT GAENIAAVVD EYGQLKGMVT LEDVVEELVG EMYDEYDRIP TATPGDATVV DGLTGLSDFG RRLGFELPAG RYDTVGGYLQ AALDRTPRAG DAVEVAGHRL TVSSVAGWRV GQVTVEPLST PVQPD
|
| |