Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3049 |
Symbol | |
ID | 8448662 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 3359248 |
End bp | 3360204 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645042132 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_003202374 |
Protein GI | 258653218 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0000891408 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.000905677 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGGAAGA TCCCCGGCAC TCGTCCGCCC GAACTTCCCG AGCTCGTCCG CGCGCAGGAC CGGATCTCCT TCGTCTACCT CGAACGATGC ATCGTCCACC GCCAGGACAA CGCGATCACG GCGACGGACG AACGAGGGAC CGTCCATCTG CCGGCGGCCA CCCTCGGCGC CCTGCTCCTA GGTCCAGGAA CCCGGGTCAG TCACCAAGCG ATGGTGCTGC TGGCCGAGTC CGGTTCCACG GCAGTGTGGG TCGGCGAGCG GGGCGTCCGC TACTACGCGC ATGGTCGCAG CCTGGCTCGC TCGTCGCGGC TGCTGGAGGC GCAGGCCGCG ATCGTGAGCA ATCAGCAGCG CCGATTGGCC GTCGCCCGGG CCATGTATGC CATGCGATTT CCCGGTGAGG ATGTCGAAGG CCAAACGATG CAACAGCTTC GGGGTCGCGA GGGTGCCCGT GTCCGGCGCG TGTACCGATC GATGTCCGCC GAGACCGGTG TGGTTTGGGA CAAACGGGAC TACAACAGCG AGGATTTTGC GTCCGGGACG CTCATCAATC AGGCGCTCTC GGCGGCCCAC ACCTGCTTGT ACGGGATTGT GCACGCGGTG ATCGTCGCCC TCGGTTGCTC GCCGGGCCTC GGGGTGGTCC ACACCGGACA CGTTCGGTCA TTCGTCTTTG ACATCGCCGA TCTCTACAAG GCCGAAATTT CCATCCCGGT GGCCTTCCGA GTTGCAGCCA CTGAACCCGA GGACGTGGGC GCGGAGACGC GACGGGCGGT TCGCGACGCC GTGCACGACG GCAAGATCCT CGCCCGTTGC GCCCGAGACA TCCGTCAACT CCTCTTGCCG GACCAGGATC CGGTCGAGGA CGACGTGGAC GCCGACGTCA TCAATCTCTG GGACGGCGAT GATCGGGTCG TGTCCGGAGG GACCGGCTAC GTGGAGTCGG ACACATGGTC GTTCTGA
|
Protein sequence | MRKIPGTRPP ELPELVRAQD RISFVYLERC IVHRQDNAIT ATDERGTVHL PAATLGALLL GPGTRVSHQA MVLLAESGST AVWVGERGVR YYAHGRSLAR SSRLLEAQAA IVSNQQRRLA VARAMYAMRF PGEDVEGQTM QQLRGREGAR VRRVYRSMSA ETGVVWDKRD YNSEDFASGT LINQALSAAH TCLYGIVHAV IVALGCSPGL GVVHTGHVRS FVFDIADLYK AEISIPVAFR VAATEPEDVG AETRRAVRDA VHDGKILARC ARDIRQLLLP DQDPVEDDVD ADVINLWDGD DRVVSGGTGY VESDTWSF
|
| |