Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2374 |
Symbol | |
ID | 8447985 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 2614358 |
End bp | 2616058 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645041494 |
Product | M6 family metalloprotease domain protein |
Protein accession | YP_003201738 |
Protein GI | 258652582 |
COG category | [S] Function unknown |
COG ID | [COG4412] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03296] M6 family metalloprotease domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.00239302 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.000554149 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTGATC CGCAGATCCA TGCCCGAGCC ATTCACCGCG AGGCGTGTTT CGTCGCGCCG AGCCCGGAAC TGGCCGAGCG TTGGAAGGCC GACCTCGCCG GACTGCGCGG GGGCGGTGTC GGCTCCGACA TCGCCTCGGT CCTGGCGATC GCCCGCCAAC CCAGACCGCT GGGCTTCGAC GACGGCGTCA TCCTGCCGCC CGAGGAATAC CCGGTGGACA CACCGCTGAC CGCCATCCGC AATGCCGCCG CCGACCGCGC TCCGTTGCGT GGCACCGTCC GGGTCATCGT GGTGCTGGTG GACTTCAGCG ACAAGAAGAT GACCGCGACA ACCGACCACT TCCGGGACCT GTTCTTCTCC ACCGGCGTCC TGCCGCACGG CAGCGTGCGC GAGTACTACC GCGAGGTCAC CAACAATCTC GTCGACCTGG ACGGCGAGGT GGTGGGGCCG TTCCGGATGC CGCAGACGTT GGCCTGGTAC GCCAACGGGA ACTTCGGCAT CGGCCGGCCG ACGGGCACGA CCCGGGCCCG CGACATGGCC ATGGACGCGT TCCTGGCGGC CAACCCCAGT GTGAACTTCG GTCCCTACGA CAACGACGGC AACGGCTACG TCGACGCGTT CATCGTCATC CATGCCGGGA CCGGCGGCGA GGCGTCCGGC AACAGTGGCG ACATCTGGTC GCACAAATGG ACTCTCACCT CGCAGCAGAA CGCGGACGGG ACCAAGGTCT ACGGCTACCT GACCATCCCG GAGGACGCCA AGATCGGCGT CAGCGCGCAC GAGCTGGGCC ATCTGCTCTT CGGCTTCCCC GACCTGTACG ACACCGACAA CACCTCGGAG GGCATCGGGA ACTGGTGCCT GATGGCCGCC GGCTCGTGGG GCGGTGGGGG CGACGTCCCG GTCCACCCAT CGGCCTGGTG CAAGGCCAAT CAGGGCTGGG CCGCGGTCAC CAACGTGACG GCCAACGGGC CGGCCACGAT CCCCGACGTC AAGGCCAGCC ACACGGTGCA TCGCCTGTGG GAGGACGGTG CGGCCGGGCA GGAGTATTTC CTGGTCGAGA ATCGGCAACA GACCGGTTAC GACGTCAGCC TGCCGGCCGG CGGATTGCTC ATCTGGCACA TCGACGACGC GCAAACCTCC AACACCGACG AGAACCACTA CAAGGTGGCG CTCATGCAGG CCGACGGGCG ACGCGACCTG GAGCTCAACC ACAATCGGGG CGATGCCGGT GACCCGTACC CCGGATCCGC GGCCAACACC AGCTTCTCGT CGTCGTCCAC CCCGAACTCA CACTCCTACG CCGGCGCGGA CACCTGTGTG TCGGTCACCG GGATCTCGGC GGCCGGAGCC AGCATGACCG CCCAGCTCAC CGTCAGCTGC GGCAAGTCGG TGGTCAAGGA CGCCAAGGAC CACAAGGACG GCGTGAAGGA AGCCAAGGAG CCGGTCAAGG AACGGAAGGA CATCAAGGAC CACAAGGACG GCATCAAGGA TCGAAAGGAC GGCAAGGACG GCAAGGAACC CGTCAAGGAA CGCAAGGACA TCAAGGACCA CAAGGACGGC GTCAAGGACG TCAAGGAGCC GTTCAAGGAA CGCAAGGACA TCAAGGATCA CACCGAGGGC AAGGGCCCGC TGGCCGATCG GCCGCCGCTG CCCCCGGGCC GGGCCCCCTC GCGGCGTACG CCGACCAGGG CGGCGACATG A
|
Protein sequence | MADPQIHARA IHREACFVAP SPELAERWKA DLAGLRGGGV GSDIASVLAI ARQPRPLGFD DGVILPPEEY PVDTPLTAIR NAAADRAPLR GTVRVIVVLV DFSDKKMTAT TDHFRDLFFS TGVLPHGSVR EYYREVTNNL VDLDGEVVGP FRMPQTLAWY ANGNFGIGRP TGTTRARDMA MDAFLAANPS VNFGPYDNDG NGYVDAFIVI HAGTGGEASG NSGDIWSHKW TLTSQQNADG TKVYGYLTIP EDAKIGVSAH ELGHLLFGFP DLYDTDNTSE GIGNWCLMAA GSWGGGGDVP VHPSAWCKAN QGWAAVTNVT ANGPATIPDV KASHTVHRLW EDGAAGQEYF LVENRQQTGY DVSLPAGGLL IWHIDDAQTS NTDENHYKVA LMQADGRRDL ELNHNRGDAG DPYPGSAANT SFSSSSTPNS HSYAGADTCV SVTGISAAGA SMTAQLTVSC GKSVVKDAKD HKDGVKEAKE PVKERKDIKD HKDGIKDRKD GKDGKEPVKE RKDIKDHKDG VKDVKEPFKE RKDIKDHTEG KGPLADRPPL PPGRAPSRRT PTRAAT
|
| |