Gene Ndas_1283 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1283 
Symbol 
ID9245133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1590960 
End bp1592147 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content72% 
IMG OID 
ProductCRISPR-associated protein, Cse4 family 
Protein accessionYP_003679227 
Protein GI297560253 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCA CCATTCTCGA CGTGCACGTG TTGCAGACCG TGCCGCCCAG CAACCTCAAC 
CGCGACGACA CCGGCGCACC CAAGACGGCC GTCTACGGCG GCGTGCGCCG CTCCCGGGTC
TCCAGCCAGG CCTGGAAGCG CGCCACCCGC CTGGCCTTCG ACGCCCTGCT CGACCCCCGG
GAGCTGGGCA CCCGCACCAA GCGCGTCGCC GAGCTGGTCG CCTCCCGCAT CCGTGACCTG
GACACCTCCA TCGAGGAGGC CGAGGCGCTG ACCCTGGCCG CCGAGACCGT CCAGGTCGCC
ACCGGCTCCA GGATCGAGGT GCCCAAGCGC AAGGCCGACG CGGCCAAGAA GAACGGCACC
AAGGAACCCG CTCCCGAGTC CGCCTACCTG ATGTTCCTCA GCGCCCGCCA GCGCGACGGA
CTGGCCGCAC TCGCCGTGGA GGGCCGTGAG GACATCAAGG CCTTCCTCAA GGAGAAGGAG
AACAAGGCGC GCGCCAAGGC GGTCGCCGAC ACCCGCCACT CGGTGGACAT CGCCCTGTTC
GGCCGCATGG TCGCCGACGG CGCCGACGTC AACGTGGACG CCGCCGCCCA GGTCGCGCAC
GCCCTCAGCG TGCACGCCGT GGAGAACGAG TCCGACTACT ACACGGCCGT GGACGACCGC
AACCCCGAGG AGGAGACCGG CGCGGGCATG ATCGGCACCG TCGAGTTCAA CTCCGCCACC
CTGTACCGCT ACGCCGCCGT GGACGTGGAC CTGCTCCGCA GGAACCTGGG GGAGGGGCTG
CGCGAGGACG AGCCGGTCAC CGAACCGCTG CGCCGGGCCG TGGAGGCGTT CGTGCGCGGG
TTCGTGGAGT CGATGCCCAC CGGCAAGGTG AACACCTTCG GCAACCACAC CCTGCCCGAC
GCGGTCGTGG TCAAGCTGCG CGGCGCGCGC CCGATCAGCT TCGTGGGGGC CTTCGAGGAG
CCGGTGGAGG CGGGCGCGGG GCACGTGGCC CAGGCCAGCG CGCGCCTGGC CGAGTACGTG
CCGCAGGTGG AGCGGGCGTT CGGGGCGGCC GACGACGTCG ACACCTGGGT GGTGCGGGTG
GGGGAGCGCA CCGCCAAGCT CTCCGGCCTG GGCGAGGAGG TCACCCTCCC CGAACTGGTC
GAGCGGGTCG GCGCGGCCGT CGCCCAGCGC CAGGCGCCGC GGGCATGA
 
Protein sequence
MSRTILDVHV LQTVPPSNLN RDDTGAPKTA VYGGVRRSRV SSQAWKRATR LAFDALLDPR 
ELGTRTKRVA ELVASRIRDL DTSIEEAEAL TLAAETVQVA TGSRIEVPKR KADAAKKNGT
KEPAPESAYL MFLSARQRDG LAALAVEGRE DIKAFLKEKE NKARAKAVAD TRHSVDIALF
GRMVADGADV NVDAAAQVAH ALSVHAVENE SDYYTAVDDR NPEEETGAGM IGTVEFNSAT
LYRYAAVDVD LLRRNLGEGL REDEPVTEPL RRAVEAFVRG FVESMPTGKV NTFGNHTLPD
AVVVKLRGAR PISFVGAFEE PVEAGAGHVA QASARLAEYV PQVERAFGAA DDVDTWVVRV
GERTAKLSGL GEEVTLPELV ERVGAAVAQR QAPRA