Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3055 |
Symbol | |
ID | 8448668 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 3365254 |
End bp | 3368010 |
Gene Length | 2757 bp |
Protein Length | 918 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645042138 |
Product | CRISPR-associated helicase Cas3 |
Protein accession | YP_003202380 |
Protein GI | 258653224 |
COG category | [R] General function prediction only |
COG ID | [COG1203] Predicted helicases |
TIGRFAM ID | [TIGR01587] CRISPR-associated helicase Cas3 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.107976 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000610459 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCGCTGT GGCGGCACCT GAGCGACACC GCTGAAGTCG CCGGCCTGCT GTGGGACCGA TGGATGGCAC CGTCCGTCCG TCGGATCATC GCCGATGGAA GACCGGACGA CGTCGCGCGG ACCGTGCTCG TCTGGCTGGC CGGCTCGCAC GATCTGGGCA AGGCCAGCCC GGCATTCGCT GCCCAAGTTC GTCCACTCAC GGAGCCGATG CGCGACGCGG GTCTTCCGTT CAGCCCGTTG GTGGAGCATC CGCCAAGGAC GCCGCATGCT CAGCAGTCCC ACCGGATTCT GCGACAGTGG TTGGAGAGCA ATGGGTTCGA TCGGGACACG GCTGACTCAT GGGCCATTGT GCCCGGTGGT CATCATGGCG TCGCGCCGGG CCGGGACGAC ATCACGCACG CGGGTGTGGC GGCCCAGGCC GGCACCGGGC CATGGCGCGG CGTCCAGCAG GAGCTGACTG ACTGGGTAGC CCGGAACTCC GGCATCACCG CGCATCTGGA TTCGCTCCGC ACCGCAGCGC TCGCGCCGAC GGCACAGGTG GTGATCACGG CGGTGGTGAT CGTTGCCGAC TGGATCGCCA GCGACACCGC TCGGTTCCCC TGTTTCGATA CCAGGTCGAC GGCAGAGCGA GTTCGGAGCG GTTGGCGCGA CGTCGGCCTT CTCCCCGCCT GGTCGGCCGA TGCGGGAGCG GTGGAGACGC TGTTCGGACG TCGCTTCGGC CGGGAGGCGA GACCGGCGCA AGAAGAGGCC ATCCGCGCCG CCCAGGCGAT CACGGGTCCT GGTCTGATGA TCATCGAGGC GCCGATGGGG GAGGGCAAGA CGGAGGCGGC GCTGGCCGCC GGTGAGGTGC TCGCTGAACG GTTCGGCTGC GGCGGACTGA TCTTTGCGCT GCCGACGATG GCCACTTCCG ATGCGATGGT CGACCGGTTC ACCGGGTGGC TTCGCGGTGC TGCGCAGTCG GCACAGCCCT TCTTCCTCGC CCACAGCAAG GCACCCCTGA ACCCGACATT CGCCGCCATG CGCACGCTGT CGCCCACGAC GGGGGATGGC GGGGTCGATG GATGCGGAGC CGGGGTGGCA GTTGCGCTTG AGTGGCTGAC CGGACGCAAG AAGGGCATGC TGGCGAACTT CGTGATCGGG ACGATCGACC AGGTCTTGTA TGCAGCCTTG CGTGGCCGTC ACGTGGCGTT GCGGCACCTG GCCCTGGCCG GGAAGGTCGT CGTCATCGAC GAGGTCCACG CCGCCGATGT GTACATGGCA GTCTTCCTGC GGGATGTCCT GCGTTGGCTC GGTGCCTACG GTGTCCCGGT CATCCTGCTG AGCGCCACCC TGCCAGCGGC TCGCCGGCAG GAACTTCTGA ACGCCTACAG CACCGGGGCC CGGCCGGCGA CGCCAGCCTC CCCGACGGCT CGGGCTTCTC GACGCCGTCG GCCGCGGGCA CTCACGTCGG CTGGCCCGCC ACCCCCATCC TGCGACGAGG CGACAGCATC TGGTGAAGGC CTGGCGGGGG CGGGCCCTTT GCCGTACCCC CTGATCACGA TCGCCGGGGC CGGCAGCATG GTTCAGGTGG CGCCGGCGGC AGCCGGGCTG GCCCGCACGG TGCGGGTCGA ATTCGCCTCC GACGACGAGT CCGAGCTGGT CGACCTCGTC CGCGCAGCTT CCCAGGGCGG CGGAGTTGTC GGGATCATCC GGAACACCGT CGGGCGTGCC CAGCGCTCGG CCGCGGTCCT TCGGGCGGAA TTGGGTGAGG AGGTGGTGAC GTTAACGCAC TCGCGTTACC TGGCCACGGA TCGGATGCGC AGGGACCGTG ACCTGCTTGC CGAGCTGGGG GCTCCCGGCC AGGCCCGCCG GATGCCCGGC CCTCGTATCG TGGTCGGCAC CCAGGTACTC GAGCAATCCC TCGATATCGA CTTCGATCTG TTGGTCACCG ATCTGGCCCC GGCTGACCTG ATCCTGCAGC GAATGGGTCG CTTGCACCGC CATCGACGCC ACGCTCACGA GCGAGGATTA ATGACCAAGC CCCGGTGCGT CGTGGTCGGG GCGAATTGGG ATGCCGACCC CGTCGAACCT GATCGTGGAT CGGTGGCAGT TTATGGATTG GCAGCCCTGT TGCGGGCTGC GGTCGTCCTC AGATCGGACG GATCGCCAGG CAGGGATATT GAGCTTCCAG CGGATATCCC GCGGCTGGTC GATCAGGCCT ACGCCGCGGA GGTCGAGGTA CCCGAAGCGT GGTGCGCGAC CTTCGATGAC GCCGAACAGA AGGCTCAGCA GGAGAGCGCC GGTCGCACCA CTGGGGCCGA GATGTTCAGC ATCGGGAAGG TCCCGGTGCT GGACCTGTTC GGCTGGTCGC ATGCCAATGC CGGTGACGCG GAGGGACACG ATGGTCGGGC GAAGGTGCGG GACGGCGAGG ACGGGTTGGA GGTGCTTGTC GTCCAACAGC GGGGTGACGG CTGGTTCGTG CTTCCCTGGT TGAGCGATTG CGGCGGTGCG GAATTGCCCC GGGACGCAGC CCCGGAACCG CGGCTGGCTC GGGCCGTGGC TCAATGCTCG TTGCGGCTAC CCCGACAGCT GTCGGTCTGG CGAATTGACC AGGTCATCGG CGAACTTGAA CGCCAAGGTC GGGCGTCCTG GCAGCAATCT CCCTGGGTGC GCGGAGAGTT GGTCCTGCCG CTGAGTGAGG ACCTGACCGC GGAGCTTGCT GGATTCCGGC TGCGCTACTC ACGTCGGGAC GGGCTCGTCG TCGAGGAGGC AGAATGA
|
Protein sequence | MPLWRHLSDT AEVAGLLWDR WMAPSVRRII ADGRPDDVAR TVLVWLAGSH DLGKASPAFA AQVRPLTEPM RDAGLPFSPL VEHPPRTPHA QQSHRILRQW LESNGFDRDT ADSWAIVPGG HHGVAPGRDD ITHAGVAAQA GTGPWRGVQQ ELTDWVARNS GITAHLDSLR TAALAPTAQV VITAVVIVAD WIASDTARFP CFDTRSTAER VRSGWRDVGL LPAWSADAGA VETLFGRRFG REARPAQEEA IRAAQAITGP GLMIIEAPMG EGKTEAALAA GEVLAERFGC GGLIFALPTM ATSDAMVDRF TGWLRGAAQS AQPFFLAHSK APLNPTFAAM RTLSPTTGDG GVDGCGAGVA VALEWLTGRK KGMLANFVIG TIDQVLYAAL RGRHVALRHL ALAGKVVVID EVHAADVYMA VFLRDVLRWL GAYGVPVILL SATLPAARRQ ELLNAYSTGA RPATPASPTA RASRRRRPRA LTSAGPPPPS CDEATASGEG LAGAGPLPYP LITIAGAGSM VQVAPAAAGL ARTVRVEFAS DDESELVDLV RAASQGGGVV GIIRNTVGRA QRSAAVLRAE LGEEVVTLTH SRYLATDRMR RDRDLLAELG APGQARRMPG PRIVVGTQVL EQSLDIDFDL LVTDLAPADL ILQRMGRLHR HRRHAHERGL MTKPRCVVVG ANWDADPVEP DRGSVAVYGL AALLRAAVVL RSDGSPGRDI ELPADIPRLV DQAYAAEVEV PEAWCATFDD AEQKAQQESA GRTTGAEMFS IGKVPVLDLF GWSHANAGDA EGHDGRAKVR DGEDGLEVLV VQQRGDGWFV LPWLSDCGGA ELPRDAAPEP RLARAVAQCS LRLPRQLSVW RIDQVIGELE RQGRASWQQS PWVRGELVLP LSEDLTAELA GFRLRYSRRD GLVVEEAE
|
| |