Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2962 |
Symbol | |
ID | 8448575 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 3246670 |
End bp | 3248130 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645042047 |
Product | protein of unknown function DUF21 |
Protein accession | YP_003202289 |
Protein GI | 258653133 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.387429 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00195258 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATCGCGC AGTGGATCAT GGTTGGCGTC GGCCTCGTCC TGATCGCCGG CACCGCGCTC TACGTCGCCG CCGAGTTCAG CCTGGTCACG GCGGACCGGG CGGCCGTGGC CAAGCAGGCC GCGCAGGGCG ATCGCGGCGC CCGCTCCCTG ATGATCGGGC TGCGCTCGCT GTCCACCCAG CTGTCCGGGG CGCAGATCGG CATCACCATC ACCACCCTGG CGTTGGGCTT CGTCATGCAG CCGGCGCTGG CCGACCTGGT CGCGCCGCTA CTGGACGCGA TCGGCCTCGG CGCGGGCGTC TCGCAGACCG TCGGCGCCCT GTTCGGGCTG GTCGTGGCGA CGGTGCTGTC GATGGTGTTC GGCGAGCTGG TGCCCAAGAA CATCGCCATC GCCGAACCGC TGGACACGGC CAAGACGGTG ATCACCCCCA TGCGGGTGTC CACCATGCTG TTCAAGCCGC TGATCATCGT GCTCAACGGC ACTGCCAACG CGGTGCTGCG GGCCATCGGG GTGGAACCCC AGGAGGAGCT GCGCTCGGCC CGGTCCGCGG TGGAGCTCGA TTCCCTGGTC CGCCGTTCGG CCGCCCAGGG CACCCTGGAA CAGCCCACCG CCGGCCTGCT GGCCCGGTCG ATCTCGTTCT CCGGCAAGAC CGCCGACGAC GTGCTCACCC CCCGGGTGCG GGTCCGGTTC GTCAAGGCCA CCGACACCGC GAACGCGGTA CTCACCGCGG CCGTCGAGAC CGGGCATTCC CGGTTCCCGG TGTTCGGGGA GGACTCCGAC GACGTCGTCG GGCTGGTGCA CCTCAAACGC GCGGTGGCCA TCCCGCCCGA CGAACGCGCC GGCGTGCGGG TCGAGCAGCT GATGGTGCCG GTGCCGGTGG TCCCGGGGTC GATCCCGCTG GACGACCTGA TGGACGAGCT GCGCAGCGGG CTGCAGATGG CGGTGGTGGC CGACGAGTAC GGCGGCACGG CCGGGCTGCT CACCCTGGAG GACGTCGTCG AGGAGCTGGT CGGCGAGATC AAGGACGAGC ACGACCCGGT GGACAGCCGG GCCGAACGGC GGGCCGACGA CACCTGGCTG CTGCCGGGCA CCCTGCGGCC GGACGAGATC GTCGACATCA CCGGGGTGCG GCTGCCCGAA TCCAGCGCCT ACGAGACGGT GGCCGGCCTG CTCATCGCCC GGCTGGGCCG GATGCCCAAG GAGTTCGACG CCGTCGAGGT GGACGCGACC ATGGATGCCT CCGCCCATCT GGGCGTGCCC GACCGGCAGG TCGTGCACTC CGACGACCGC CGGATCGAGC CGGAGACCGA GGACGACCTG CCGCGGGCGG TGACCGTCCG GCTGACCGTG CACGGGCTGG CCCGGCGCCG GATCGAGGCC GTGCTGCTGT CCGCGGTCAG CCTGAACACC GGCGACGAGG ACGAGAACGA GTCCGACGAA CGCCGCGGAG ACGGCCGATG A
|
Protein sequence | MIAQWIMVGV GLVLIAGTAL YVAAEFSLVT ADRAAVAKQA AQGDRGARSL MIGLRSLSTQ LSGAQIGITI TTLALGFVMQ PALADLVAPL LDAIGLGAGV SQTVGALFGL VVATVLSMVF GELVPKNIAI AEPLDTAKTV ITPMRVSTML FKPLIIVLNG TANAVLRAIG VEPQEELRSA RSAVELDSLV RRSAAQGTLE QPTAGLLARS ISFSGKTADD VLTPRVRVRF VKATDTANAV LTAAVETGHS RFPVFGEDSD DVVGLVHLKR AVAIPPDERA GVRVEQLMVP VPVVPGSIPL DDLMDELRSG LQMAVVADEY GGTAGLLTLE DVVEELVGEI KDEHDPVDSR AERRADDTWL LPGTLRPDEI VDITGVRLPE SSAYETVAGL LIARLGRMPK EFDAVEVDAT MDASAHLGVP DRQVVHSDDR RIEPETEDDL PRAVTVRLTV HGLARRRIEA VLLSAVSLNT GDEDENESDE RRGDGR
|
| |