Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_1689 |
Symbol | |
ID | 8447290 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 1855308 |
End bp | 1856168 |
Gene Length | 861 bp |
Protein Length | 286 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 645040814 |
Product | protein of unknown function UPF0099 |
Protein accession | YP_003201068 |
Protein GI | 258651912 |
COG category | [S] Function unknown |
COG ID | [COG1990] Uncharacterized conserved protein |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 0.765723 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0247017 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAGG CCGGCCGGTC GAACCATCAA CGCATGCACG AGGGCGCGGT GCTCGCGCCG CTGGCCGACC GGTACGCGCG CTGGTTGGGC ATGGACGCCG CCCAGGTGCT GGCCGACCGG GACGAGGCCG CCCACGACAT CCGGGCCATG CAGCTGATCC TGCGCATGGA ACGAGCCACC CCGCCGTCCT GGCACCGGGC CGTCGCGCTG GCCGCGGCCG GCGCCACCGC ACTGTGCCTG GACGAGCGCA GCGCGCCGGG CGGCGAATGG CACGACGCGG TCGCCGCCTA CGTCCGCGGC CACATCCGCA AGGTCACACG CCGCGCCCGC GGCGCGCACT GGGCCGCCGC GCAGGAGCTG CCCGGCCTGA CCCTGGAGGC CGACGGCACC CAGGTGCGGG TGCTCGTGCC CGGTCCGGTG GTCGAGCTGG ATCCGCGCAT CGGCCGGTTG CAGGTGGGCG GCACGGACGT GCCGCCGGAC GTGCCGCCGG ACGAGCCCGG GGCGGCGTCG GGCGCGCTGC GGCTCTGGAT TCCCGAGGCC TTGCCGATGA CGGTGGGCAA GGCGATGGCC CAGGCCGGCC ACGCCGGGAT GATCTGCGCG GCCCTGCTGG CCCGTGACGA CGCCCCGCCG GCCGACCGGG AGCGGTTGGC CGGCTGGCGG GAGGCCGGTT TCCCGGTGAC GGTGCGCCGG GTCGACGCGG CCCAGTGGGC CCGGCTGGCG CAGCCGGTGG CCCGCGACCA GGACCTCGCC TGGCGGTCCG AGGGCCGGCT GGCCGTGCGC GACGCCGGAT TCACCGAGGT CGAGCCGGGC GCCATCACGA TCATCGCGAC CCGGCCCGAC CCGTCGCCCC GGTCGAGCTA A
|
Protein sequence | MSEAGRSNHQ RMHEGAVLAP LADRYARWLG MDAAQVLADR DEAAHDIRAM QLILRMERAT PPSWHRAVAL AAAGATALCL DERSAPGGEW HDAVAAYVRG HIRKVTRRAR GAHWAAAQEL PGLTLEADGT QVRVLVPGPV VELDPRIGRL QVGGTDVPPD VPPDEPGAAS GALRLWIPEA LPMTVGKAMA QAGHAGMICA ALLARDDAPP ADRERLAGWR EAGFPVTVRR VDAAQWARLA QPVARDQDLA WRSEGRLAVR DAGFTEVEPG AITIIATRPD PSPRSS
|
| |