Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1283 |
Symbol | |
ID | 9245133 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1590960 |
End bp | 1592147 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | CRISPR-associated protein, Cse4 family |
Protein accession | YP_003679227 |
Protein GI | 297560253 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGCA CCATTCTCGA CGTGCACGTG TTGCAGACCG TGCCGCCCAG CAACCTCAAC CGCGACGACA CCGGCGCACC CAAGACGGCC GTCTACGGCG GCGTGCGCCG CTCCCGGGTC TCCAGCCAGG CCTGGAAGCG CGCCACCCGC CTGGCCTTCG ACGCCCTGCT CGACCCCCGG GAGCTGGGCA CCCGCACCAA GCGCGTCGCC GAGCTGGTCG CCTCCCGCAT CCGTGACCTG GACACCTCCA TCGAGGAGGC CGAGGCGCTG ACCCTGGCCG CCGAGACCGT CCAGGTCGCC ACCGGCTCCA GGATCGAGGT GCCCAAGCGC AAGGCCGACG CGGCCAAGAA GAACGGCACC AAGGAACCCG CTCCCGAGTC CGCCTACCTG ATGTTCCTCA GCGCCCGCCA GCGCGACGGA CTGGCCGCAC TCGCCGTGGA GGGCCGTGAG GACATCAAGG CCTTCCTCAA GGAGAAGGAG AACAAGGCGC GCGCCAAGGC GGTCGCCGAC ACCCGCCACT CGGTGGACAT CGCCCTGTTC GGCCGCATGG TCGCCGACGG CGCCGACGTC AACGTGGACG CCGCCGCCCA GGTCGCGCAC GCCCTCAGCG TGCACGCCGT GGAGAACGAG TCCGACTACT ACACGGCCGT GGACGACCGC AACCCCGAGG AGGAGACCGG CGCGGGCATG ATCGGCACCG TCGAGTTCAA CTCCGCCACC CTGTACCGCT ACGCCGCCGT GGACGTGGAC CTGCTCCGCA GGAACCTGGG GGAGGGGCTG CGCGAGGACG AGCCGGTCAC CGAACCGCTG CGCCGGGCCG TGGAGGCGTT CGTGCGCGGG TTCGTGGAGT CGATGCCCAC CGGCAAGGTG AACACCTTCG GCAACCACAC CCTGCCCGAC GCGGTCGTGG TCAAGCTGCG CGGCGCGCGC CCGATCAGCT TCGTGGGGGC CTTCGAGGAG CCGGTGGAGG CGGGCGCGGG GCACGTGGCC CAGGCCAGCG CGCGCCTGGC CGAGTACGTG CCGCAGGTGG AGCGGGCGTT CGGGGCGGCC GACGACGTCG ACACCTGGGT GGTGCGGGTG GGGGAGCGCA CCGCCAAGCT CTCCGGCCTG GGCGAGGAGG TCACCCTCCC CGAACTGGTC GAGCGGGTCG GCGCGGCCGT CGCCCAGCGC CAGGCGCCGC GGGCATGA
|
Protein sequence | MSRTILDVHV LQTVPPSNLN RDDTGAPKTA VYGGVRRSRV SSQAWKRATR LAFDALLDPR ELGTRTKRVA ELVASRIRDL DTSIEEAEAL TLAAETVQVA TGSRIEVPKR KADAAKKNGT KEPAPESAYL MFLSARQRDG LAALAVEGRE DIKAFLKEKE NKARAKAVAD TRHSVDIALF GRMVADGADV NVDAAAQVAH ALSVHAVENE SDYYTAVDDR NPEEETGAGM IGTVEFNSAT LYRYAAVDVD LLRRNLGEGL REDEPVTEPL RRAVEAFVRG FVESMPTGKV NTFGNHTLPD AVVVKLRGAR PISFVGAFEE PVEAGAGHVA QASARLAEYV PQVERAFGAA DDVDTWVVRV GERTAKLSGL GEEVTLPELV ERVGAAVAQR QAPRA
|
| |