Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pcal_0272 |
Symbol | |
ID | 4908765 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum calidifontis JCM 11548 |
Kingdom | Archaea |
Replicon accession | NC_009073 |
Strand | - |
Start bp | 267423 |
End bp | 270230 |
Gene Length | 2808 bp |
Protein Length | 935 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640124024 |
Product | CRISPR-associated Cmr2 family protein |
Protein accession | YP_001055175 |
Protein GI | 126458897 |
COG category | [R] General function prediction only |
COG ID | [COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) |
TIGRFAM ID | [TIGR02577] CRISPR-associated protein, Crm2 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACTTGC GCGAGTTCTT TTTGACAAAG GCGTACGCCC TCTTCCACGA CCCGCCCGAC AAAATGTGGG GGCTAAAGGG CCACGAGGAG CGGGCCTGGG ACATGTTCAA CAAGGTGCGC CGCGGGAGCC CGCTAGACGG CGAACTGCCG GAGTGGGCGA GGGAGGTGGT CAAGCTGGCG GACCGCATGG CCGCCTCCAT GGACAGATAC GCCTTCTACT CGGGGGGCGG GGAGGGCGTG AAGTACGACA AGCTCCACAA CCTCTTCAAC CCACGCCTGT GGCGCGCCCT CGCGCCGCCG GACCCGGGGA AAGTCGCCGA GTACGTCGAG AATCTGGGCG GCCGAGTGAG GGCGGCTAAG GGGCCTAGAG AGGCGTACCA CGTCCTCTAC GCCCTATTCG AGGCCCTGTG GATTGACGAG GGGCTTGGGC CAAGCCTCGC AGACCCGAGG GCCCCCACCC ACGACGTGTT TGACCACTTG TACGCGACGG CCATGGTGGC CAACTGGCTC CTCGCCGGGG GCAAGCCCTC CGGCTACTTC GTCACCTTAG ACGTGCCGGG GGTGCAAGAC TTCGTCAAGG CTGGGAGAAA GGCCGGCGAC ATTTGGGCAG GTAGCTGGGC CTTGTCCATG GCGGTGTGGC TCACCGTGTG GCCCTTTGCC TGGGAGTACG GCCCCGACGT GCTCCTGAGG CCCACGGCGA GGCTCAACCC CTACTACTAC GCCTTCGCCG CCGCCCGGGG GCTGGGGGTG GAACACGGCG TCTTTGCCAA ATTGCTGTCC CCCTACGTCA AAGGAGAGTC GTTTTACGAC TTGGTCAACT TTGCCGTTGT CCAGCCGCTG ATTGGGGAGA GGGCCCAGAT TGTCCTGCCG CCGTACCGCG TCGAGGGGGA CAAAATTGCC AGCTGGGGCG GGGCCGAGGA GGTGAGAGAG GCTGTCCGAG ACAGGTTCAA GAGGGCCGTC GAGTGTCTAA CCCATTTCGC AAAGGGGCAA CGGGGCGCGG GGGATGAGTA CTGCGAAGAC TTCTTCAAAG TGGTAGAGGA GGGGCGCGGA GAGGGAAGAG GCGGAGGCGA GGTGGAAAAA ATTGCGGCAT GGTTGCAGAA GTACGCAGAG CTTAGGCTCC CGCTGAGAGT GGAGGTGGTG GACGTCGGCG CTGTGTATGA CAGCATAAAG TGTCCAAGGG AGGTGGCGAA GGCCATAGAA GAGGCGTTGG GGGACGGCGA GGAGCGCTGC AAAATCCTCT TCCTCTTCGA CGCCGTATTG AGAGGGGGGA GGGAGGGGAC AAAGCTCGTG GAACCCCAGA GGCGCAGGTT CATCCCCACG CGAGGCCCCT GGTTTAAGCC GGGCGGATAC GACGACCTCA ACCCCCACTT CAACTTCCCA CTAGACCCAA ACTGGAGGGT CTGTAGTCTC TGCGGCTCTG AGCCCGCCGT CGTCGGAGTG AGGAAGGTGG CGAGGGAGGG GAGGGAGGAC TACAACGAGG AGGACTTGAG GCGGATATCT GCAGAAGTGG GGATAGAAGT GAGTAGACTG AAGAACCACT TGCGCAAAGT GTTGCGGCCC GGCGAGTACC TAGGCCCAGT GTGCTTGGCC AAGAGGCTGA TATACCTCAG AGCCGCCGAC CGGGAGCTCA TAAAATTTGA AAGCACAGAG GACGTGGCAG TGGTAAAGCT GGCTGAGAGT CACGAGCAAC ACTATGCAGA GTTAGATAGA GTACAGGAGT GTGTTAAGGT GGTCAACTAT TTGAGGACAA CGGGGGGCAG AGACCTTGAG GTCATTTGGG GAAACCCAGA GGCCATGAGA CGGGACTGGG ACAACTGCTT TAAAAAATTC GGCAGGCTCC GGGGGGTAGA GAGACTCGTG AGAGAGATCT TTGGCGTAGA GGGGGACGTG GGGAAGGCTG TTGAGGAGTT CGCGGGCCCC AGGCTCTTCT ACGCGGTGGT GAGGGCGGAC GGGGACAGCG TTGGGAAGTT GCTGGAGGGC CACCTCCCGG TCAACTGGTA TGGGGCGGTG GCGGAGGTGG TGGAAGGCGT GGGGGCGGAG GGCCGCGAGA AGGCGCTGGA GGTGCTCCGC GCCGTAGAAG AGCACATGCG CTCCTTCGGC GTGCCGCCCC CCGTGGTGAT TACCCCCACC TACCGCGTGG CGGTGAACCG CGCCATGGTG CTCACCTCGC TGAAGGACTT GGCCACCACG GAGAAGCACC GCGGCCTGTT GATATACGCC GGCGGCGACG ACGTAGTGGC GCTGTTGCCC GTGGAGACTG CGCTAGACGC CGCCGCGGAG TACAGAGAGA ACTACTGGGG AGAAGGGGGC TTCCACATTG TGGACAACTA CCCCGTGCCA GCCCTGGCGG CGTATGGGAG AAGCACGGCG GTGCGCTTCG TCCACCTAAT GGACCTAATG TCTGAGGAGC TTGGGAAGTC CTACCACGAC CTAGAGCACT TGGCCAAAGG GGGCAGGTGG GACTGCTTTG AAAAAGACTC CCTTACGATC ACCAGCTCGA GGGTTGAGGC AAAGGCGGTC CTCCCCTTTA GGCGGCCCAG AGAGGCCGTG GAGAGGCTGA AGGAGCTCTG GCTCCTCATG GCCCTTGGGC GCCTCAGCAA GAACGCGCCC CACGACCTCG ACGCCTACGA GGATCTGAAG AGGGACTTAG CCGCGTATTT AAAGGCGTGG CGCTACGCCC TAAAGAGAAA CGCCAGGGAG CTCTCCCCCG AGACTCTAAC CCGCCTCCTC TGCTTCATTG AGAAATACGG CGCAGAGGCA GGCGAGGGGC TGAAAGAGGC CATGAAGATA CTCAGGAGGT TGCCATGA
|
Protein sequence | MDLREFFLTK AYALFHDPPD KMWGLKGHEE RAWDMFNKVR RGSPLDGELP EWAREVVKLA DRMAASMDRY AFYSGGGEGV KYDKLHNLFN PRLWRALAPP DPGKVAEYVE NLGGRVRAAK GPREAYHVLY ALFEALWIDE GLGPSLADPR APTHDVFDHL YATAMVANWL LAGGKPSGYF VTLDVPGVQD FVKAGRKAGD IWAGSWALSM AVWLTVWPFA WEYGPDVLLR PTARLNPYYY AFAAARGLGV EHGVFAKLLS PYVKGESFYD LVNFAVVQPL IGERAQIVLP PYRVEGDKIA SWGGAEEVRE AVRDRFKRAV ECLTHFAKGQ RGAGDEYCED FFKVVEEGRG EGRGGGEVEK IAAWLQKYAE LRLPLRVEVV DVGAVYDSIK CPREVAKAIE EALGDGEERC KILFLFDAVL RGGREGTKLV EPQRRRFIPT RGPWFKPGGY DDLNPHFNFP LDPNWRVCSL CGSEPAVVGV RKVAREGRED YNEEDLRRIS AEVGIEVSRL KNHLRKVLRP GEYLGPVCLA KRLIYLRAAD RELIKFESTE DVAVVKLAES HEQHYAELDR VQECVKVVNY LRTTGGRDLE VIWGNPEAMR RDWDNCFKKF GRLRGVERLV REIFGVEGDV GKAVEEFAGP RLFYAVVRAD GDSVGKLLEG HLPVNWYGAV AEVVEGVGAE GREKALEVLR AVEEHMRSFG VPPPVVITPT YRVAVNRAMV LTSLKDLATT EKHRGLLIYA GGDDVVALLP VETALDAAAE YRENYWGEGG FHIVDNYPVP ALAAYGRSTA VRFVHLMDLM SEELGKSYHD LEHLAKGGRW DCFEKDSLTI TSSRVEAKAV LPFRRPREAV ERLKELWLLM ALGRLSKNAP HDLDAYEDLK RDLAAYLKAW RYALKRNARE LSPETLTRLL CFIEKYGAEA GEGLKEAMKI LRRLP
|
| |