Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1133 |
Symbol | |
ID | 5055935 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1028634 |
End bp | 1029512 |
Gene Length | 879 bp |
Protein Length | 292 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640468689 |
Product | CRISPR-associated Cas family protein |
Protein accession | YP_001153363 |
Protein GI | 145591361 |
COG category | [S] Function unknown |
COG ID | [COG5551] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01877] CRISPR-associated endoribonuclease Cas6 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.282696 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.252203 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGCCGA GGTTTGAGCG GCGGCCCATC TTGATGGCAT CTTCAGGGGG AGTCGCGTAT TTAAGGGTGC GACCTCTTTG GGCTGTGTCT GTGGTGTTGG TGCGGGGGAG GCCCACTGAG GCCGTCGCCA TGGTTGGCTT CACCGGGACG GTGGCCCAGT CACTAGTGGT GTCCCTCCTC GGCGGGGAGC TCCACGACGC TAGGCCGAAG TCCTTCTCCG TCACGCCGTT CTTCGTCAAC GGTAGGCCGG CGGTGGACAA GGCCGTGGCG GGGCCCGGCG ACATTCTGGA GCTCCGCGCC GCCTTCGCCC AGAGGGAGCT CGCCGAGAGG TTCATAGCGG AGGTGGCCAA GGGCTACACC CTCTTCGGGA GGAGGGTCGT GGTGGAGGAG CTGGAGTTCT ACGACGTGTT CTCCCAGCCC CTCCCAGAGG CGCAGTGCTT CAAGCTGGAG TTCCTCACCC CGCTGAGATT CGCCGTGAAG CCCCTCTACA GGCGAAGCCG CGCCGTGTTC GACTTTCTCC CGCGGCCCCT CTCGGTGTTT AAATCCGCCG TGAGACACGG CAGGGCTCTT GGCCTATTGA AGCTGGGCGC CCCGTTCCTC CGCTGGGTGC ATACCTACGT GGCGCTCACC GACTTCGGGT GCCGCGGCAG GTGCGTGGTC ACCGTCAAGC TACCCAACGG CGGGGTGGCG AGGGGCTTCG TGGGCTGGGC GCTGTACCGC TCCTTCGGCA AGAGGAGGAT CGCCGACCTG TGGAGGGCCC TCCGCGTGGC AGAGGCCTTC AACCTCGGCA CCGGCCGAGG GATGGGCCTC GGCGTTGTGA GGGTGACCCC TCTTGACTGT CCAGGTAACG GCCCCGCGGC TCAGCGCGGG GACGCATAG
|
Protein sequence | MEPRFERRPI LMASSGGVAY LRVRPLWAVS VVLVRGRPTE AVAMVGFTGT VAQSLVVSLL GGELHDARPK SFSVTPFFVN GRPAVDKAVA GPGDILELRA AFAQRELAER FIAEVAKGYT LFGRRVVVEE LEFYDVFSQP LPEAQCFKLE FLTPLRFAVK PLYRRSRAVF DFLPRPLSVF KSAVRHGRAL GLLKLGAPFL RWVHTYVALT DFGCRGRCVV TVKLPNGGVA RGFVGWALYR SFGKRRIADL WRALRVAEAF NLGTGRGMGL GVVRVTPLDC PGNGPAAQRG DA
|
| |