Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1123 |
Symbol | |
ID | 5055810 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1019813 |
End bp | 1020697 |
Gene Length | 885 bp |
Protein Length | 294 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640468679 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001153353 |
Protein GI | 145591351 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0138486 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGTGG CGGTCGCTAG CTACGGGGCC AGAATAAGGG CGAGGAGGGG GCTACTGGTC GTGGAGGGGA GGGAGGGGAG GCGGGAGTAC CCCCTCCACC AGGTTGACGA GGTCCTGCTC CTCACCGGCG GCATCTCCAT CTCGGCGAGG GCGTTGAGGA TGTTGCTGAG GGCCGGGGCC ACGGTGGTTG TCATGGACGC AAGGGGGGAG CCCCTAGGCG TCTTCATGAA GCCCGTGGGC GACGCCACGG GGGCCAAGAG GCTGTGCCAG TACCAAGCCG CCGTCAGCGG CAGGGGGCTG GAAATCGCGA AGGGGTGGAT CTGGAGGAAA ATACGGGGCC AGCTGGAAAA CGTCAAGCGG TGGAGGCGCC GCCTCGCCAA GTACAGGGAA TACGCCGAGA GGATCTCCGC CGCGGCCGAC GCCCTCAGAC ACGCCGCGGA GCCCCACGAC GTCCTCGAGG CCGAGGCGGC CGCCGCGGAG GCCTACTGGT CCGCATATAG GGAGGTGACG GGCTTCCCGG GCCGCGACCA GGAGGGCGGC GACCCGGTGA ACGCCGCCCT TAACTACGGC TACGGCGTTT TGAAGGCTCT CTGCTTCAAG TCTCTGCTAA TAGCCGGCCT CGACCCCTAC GTCGGCTTCC TCCACGCCGA GAAGTCCGGC CGGCCGTCGC TTGTGCTCGA CTTCATGGAG CAGTGGAGGC CGAGGGTAGA CGCCGTGGTG GCCAAGATAT ACGACGGGCT TGAAACCGAG GGCGGACTAC TCACCCACCA GTCGAGGCTG AGGGTCGCCG CCGCGGTGCT GGAGGAGCTC GGCGCCGCCG GCAAGCCGCT GTCAGCAGAA ATACACAGAG AGGCCAGGGC CTTGGCCAGG TCCATATGTA CCTAG
|
Protein sequence | MQVAVASYGA RIRARRGLLV VEGREGRREY PLHQVDEVLL LTGGISISAR ALRMLLRAGA TVVVMDARGE PLGVFMKPVG DATGAKRLCQ YQAAVSGRGL EIAKGWIWRK IRGQLENVKR WRRRLAKYRE YAERISAAAD ALRHAAEPHD VLEAEAAAAE AYWSAYREVT GFPGRDQEGG DPVNAALNYG YGVLKALCFK SLLIAGLDPY VGFLHAEKSG RPSLVLDFME QWRPRVDAVV AKIYDGLETE GGLLTHQSRL RVAAAVLEEL GAAGKPLSAE IHREARALAR SICT
|
| |