Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1281 |
Symbol | |
ID | 9245131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1588426 |
End bp | 1590147 |
Gene Length | 1722 bp |
Protein Length | 573 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | CRISPR-associated protein, Cse1 family |
Protein accession | YP_003679225 |
Protein GI | 297560251 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCGACTG ACGCCTCCGG GCCAACACCA CCCCCACCGC GACCCGCGCC CCCGCCGACG CTCCCGCCCT CCTTCGACCT GACCAGCCGA CCCTGGGTTC CCGTCCAGCG GCTCGACGGG ACGGAGGCCG AACTCTCCCT GACCGGGGTC TTCGAGCAGG CCGCGCGGAT CCGGCGCCTG GTCGGGGACG TGCCCACCCA GGACTTCGCC CTCCTGCGGC TGCTCCTGGC GATCCTGCAC GACGCGATCG ACGGCCCGGA GGACATCGAG GACTGGGCGG ACCTGTGGGA CGAGGGTCGG GGAGAACTCC CCGCGGACCG CGTCCGCGAC TACCTCGGCG AGCACCGCGA CCGCCTCGAC CTGCTGCACC CGACCGCGCC CTTCCTCCAG GTGGCGGACC TGCGCACGGC CAAGGGCGAG TACTCCTCCC TCGACCCGAT CGTGGCCGAC GTCCCCAACA ACGCACGCTT CTTCACCATG CGCGCGCACG GGGCCGAAAG TCTGGGCTTC GCCGAGGCCG CCCGCTGGCT CCTGCACGCC CACGCCTACG ACACCTCCGG GATCAAGTCC GGAGTGGTCG GCGACCCCCG GGTCAAGGGC GGCAAGGTCT ACCCGCAGGG GGTGGGGTGG TCCGGGAACC TCGGCGGGAT CCACATGGAG GGCGACGACC TGCGCGCGAC CCTCCTGCTC AACCTGCTGC CCCGCGACAC CGACAACCTG CGTTCGCGCC CCGACGACCG CCCGGCCTGG CGCCGGGCCC CGGCCACCGC CGAGGCACTC GGGGGAGCGG AGGCCCAGAC CCGCCCCCAC GGCCTGCGCG ACCTCTACAC CTGGCAGAGC CGACGCGTTC GGCTGCACCA CGACGGTGAA AGCGTCCACG GGGTCCTGCT CGCCTACGGT GACCCGCTCA CCCCGCGCAA CAAGCACGAC CGCGAGCCCA TGACCGCCTG GCGGCGCAGT CCCGCACAGG AGAAGAAGCT CGGTGAGGAG CAGGTCTACC TGCCCCGGGA CCACGACCCC GCCCGCAGCG CCTGGCGCGG CCTCGCCGCC CTGGTCACCG GCCGCGTCCG GGGCGCCGAA CAGCGCCGTG AGGCCGCGAA GATCGTGCGC CCCCGGGTAC TGGACTGGAT CGCGCGCCTG ACCGTGGAGG AGTACCTGGA CAAGGGCTTC CTTCTCCGCG CGCGCCTGGT CGGCGCCGTC TACGGCACTC AGCAGTCCGT CATCGACGAG ATCGTGGACG ACACCGTCGC CATGCCCGTG GTCCTGCTGC ACGATCAGGA CCGCGCCCTG GGCCAGACCG CGGTCGACGC GGTCAACGAC GCCGAGGAGG CGGTCATGGT CCTCGGGGAC CTGGCCACCG CCCTGGCCGA GGCGGCGGGC GCGGAGACCG AGGCCCCCCG GGCCGCCGCC CGCGACCGGG GCTTCGCGGA GCTGGACGAG CCCTTCCGCA AGTGGCTGCG CGACCTGCGC CCCTCAGAGG ACCCCCTCTA CCCGGACGAG CAGCGCCGCG TCTGGCAGCT CAGGGCACAC CGGATCGTGT CCCAGCTCGG CGCCGAACTC ATGGACACGG CAGGGGAGGC CGCCTGGACG GGCCGGGTCG TGGCCACCAA GAACGGCTCG GTCTGGCTCA CCGCCTCCCG GGCCGACCTG CGCTTCCGCT CCGCCCTGCG CAGGGCGCTC CCGCTCACCA ACACCGACCA CTCCACGGAG GAAGAGCAGT GA
|
Protein sequence | MPTDASGPTP PPPRPAPPPT LPPSFDLTSR PWVPVQRLDG TEAELSLTGV FEQAARIRRL VGDVPTQDFA LLRLLLAILH DAIDGPEDIE DWADLWDEGR GELPADRVRD YLGEHRDRLD LLHPTAPFLQ VADLRTAKGE YSSLDPIVAD VPNNARFFTM RAHGAESLGF AEAARWLLHA HAYDTSGIKS GVVGDPRVKG GKVYPQGVGW SGNLGGIHME GDDLRATLLL NLLPRDTDNL RSRPDDRPAW RRAPATAEAL GGAEAQTRPH GLRDLYTWQS RRVRLHHDGE SVHGVLLAYG DPLTPRNKHD REPMTAWRRS PAQEKKLGEE QVYLPRDHDP ARSAWRGLAA LVTGRVRGAE QRREAAKIVR PRVLDWIARL TVEEYLDKGF LLRARLVGAV YGTQQSVIDE IVDDTVAMPV VLLHDQDRAL GQTAVDAVND AEEAVMVLGD LATALAEAAG AETEAPRAAA RDRGFAELDE PFRKWLRDLR PSEDPLYPDE QRRVWQLRAH RIVSQLGAEL MDTAGEAAWT GRVVATKNGS VWLTASRADL RFRSALRRAL PLTNTDHSTE EEQ
|
| |