Gene Ndas_1281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1281 
Symbol 
ID9245131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1588426 
End bp1590147 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content73% 
IMG OID 
ProductCRISPR-associated protein, Cse1 family 
Protein accessionYP_003679225 
Protein GI297560251 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGACTG ACGCCTCCGG GCCAACACCA CCCCCACCGC GACCCGCGCC CCCGCCGACG 
CTCCCGCCCT CCTTCGACCT GACCAGCCGA CCCTGGGTTC CCGTCCAGCG GCTCGACGGG
ACGGAGGCCG AACTCTCCCT GACCGGGGTC TTCGAGCAGG CCGCGCGGAT CCGGCGCCTG
GTCGGGGACG TGCCCACCCA GGACTTCGCC CTCCTGCGGC TGCTCCTGGC GATCCTGCAC
GACGCGATCG ACGGCCCGGA GGACATCGAG GACTGGGCGG ACCTGTGGGA CGAGGGTCGG
GGAGAACTCC CCGCGGACCG CGTCCGCGAC TACCTCGGCG AGCACCGCGA CCGCCTCGAC
CTGCTGCACC CGACCGCGCC CTTCCTCCAG GTGGCGGACC TGCGCACGGC CAAGGGCGAG
TACTCCTCCC TCGACCCGAT CGTGGCCGAC GTCCCCAACA ACGCACGCTT CTTCACCATG
CGCGCGCACG GGGCCGAAAG TCTGGGCTTC GCCGAGGCCG CCCGCTGGCT CCTGCACGCC
CACGCCTACG ACACCTCCGG GATCAAGTCC GGAGTGGTCG GCGACCCCCG GGTCAAGGGC
GGCAAGGTCT ACCCGCAGGG GGTGGGGTGG TCCGGGAACC TCGGCGGGAT CCACATGGAG
GGCGACGACC TGCGCGCGAC CCTCCTGCTC AACCTGCTGC CCCGCGACAC CGACAACCTG
CGTTCGCGCC CCGACGACCG CCCGGCCTGG CGCCGGGCCC CGGCCACCGC CGAGGCACTC
GGGGGAGCGG AGGCCCAGAC CCGCCCCCAC GGCCTGCGCG ACCTCTACAC CTGGCAGAGC
CGACGCGTTC GGCTGCACCA CGACGGTGAA AGCGTCCACG GGGTCCTGCT CGCCTACGGT
GACCCGCTCA CCCCGCGCAA CAAGCACGAC CGCGAGCCCA TGACCGCCTG GCGGCGCAGT
CCCGCACAGG AGAAGAAGCT CGGTGAGGAG CAGGTCTACC TGCCCCGGGA CCACGACCCC
GCCCGCAGCG CCTGGCGCGG CCTCGCCGCC CTGGTCACCG GCCGCGTCCG GGGCGCCGAA
CAGCGCCGTG AGGCCGCGAA GATCGTGCGC CCCCGGGTAC TGGACTGGAT CGCGCGCCTG
ACCGTGGAGG AGTACCTGGA CAAGGGCTTC CTTCTCCGCG CGCGCCTGGT CGGCGCCGTC
TACGGCACTC AGCAGTCCGT CATCGACGAG ATCGTGGACG ACACCGTCGC CATGCCCGTG
GTCCTGCTGC ACGATCAGGA CCGCGCCCTG GGCCAGACCG CGGTCGACGC GGTCAACGAC
GCCGAGGAGG CGGTCATGGT CCTCGGGGAC CTGGCCACCG CCCTGGCCGA GGCGGCGGGC
GCGGAGACCG AGGCCCCCCG GGCCGCCGCC CGCGACCGGG GCTTCGCGGA GCTGGACGAG
CCCTTCCGCA AGTGGCTGCG CGACCTGCGC CCCTCAGAGG ACCCCCTCTA CCCGGACGAG
CAGCGCCGCG TCTGGCAGCT CAGGGCACAC CGGATCGTGT CCCAGCTCGG CGCCGAACTC
ATGGACACGG CAGGGGAGGC CGCCTGGACG GGCCGGGTCG TGGCCACCAA GAACGGCTCG
GTCTGGCTCA CCGCCTCCCG GGCCGACCTG CGCTTCCGCT CCGCCCTGCG CAGGGCGCTC
CCGCTCACCA ACACCGACCA CTCCACGGAG GAAGAGCAGT GA
 
Protein sequence
MPTDASGPTP PPPRPAPPPT LPPSFDLTSR PWVPVQRLDG TEAELSLTGV FEQAARIRRL 
VGDVPTQDFA LLRLLLAILH DAIDGPEDIE DWADLWDEGR GELPADRVRD YLGEHRDRLD
LLHPTAPFLQ VADLRTAKGE YSSLDPIVAD VPNNARFFTM RAHGAESLGF AEAARWLLHA
HAYDTSGIKS GVVGDPRVKG GKVYPQGVGW SGNLGGIHME GDDLRATLLL NLLPRDTDNL
RSRPDDRPAW RRAPATAEAL GGAEAQTRPH GLRDLYTWQS RRVRLHHDGE SVHGVLLAYG
DPLTPRNKHD REPMTAWRRS PAQEKKLGEE QVYLPRDHDP ARSAWRGLAA LVTGRVRGAE
QRREAAKIVR PRVLDWIARL TVEEYLDKGF LLRARLVGAV YGTQQSVIDE IVDDTVAMPV
VLLHDQDRAL GQTAVDAVND AEEAVMVLGD LATALAEAAG AETEAPRAAA RDRGFAELDE
PFRKWLRDLR PSEDPLYPDE QRRVWQLRAH RIVSQLGAEL MDTAGEAAWT GRVVATKNGS
VWLTASRADL RFRSALRRAL PLTNTDHSTE EEQ