Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_1152 |
Symbol | |
ID | 6166036 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 1043300 |
End bp | 1045567 |
Gene Length | 2268 bp |
Protein Length | 755 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641668303 |
Product | CRISPR-associated Csm1 family protein |
Protein accession | YP_001794528 |
Protein GI | 171185609 |
COG category | [R] General function prediction only |
COG ID | [COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) |
TIGRFAM ID | [TIGR02578] CRISPR-associated protein, Csm1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00402598 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.000176122 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTGGCT ATAGGGAGTA CGTAGTTGCG GCCCTCCTCC ACGACGTTGG GAAGCTCATA AGGAGGGCGA AGCTGTGCCG TGGGGAGCCG GCTAGGCGCC ACGTGGAGGA GAGCGCGGAT TTCGTAGACG TCGTCGCGCC GGCGCTTAAG GCCGCTGGGG TTGATCCGCA GGCGGTGAGG GAGCTGGTTC TGAGGCACCA CGAGGGGGGC TGGGGGGTGG GGCCCTACGA CAGAGCCGCG GCTCTGGAGA GGGCGCCGGG CGACGAGGAG TCGGGCCAGG GCCTGGCCAT GCCGGGGCGG CGCGAGCACG AGATACCTCT GAGGTTGCCC ACGGGGGTGT ACGTCCCGCC GTGTCCAACG CCGCAGAGCC TCAGCGAGAG GCTCATCCCG TCTACGGAGC CGCCGAGCCC GGAGGAGGTG TGTAGGTGCT ACCGGAGGGC CTACGAGGAG CTGATGAGGC TGGCGGCGAA GGCGGCTCAG AGGAGGATGG GCTTCAAGGA GCTCGTGGAG ACGCTGGTAA ACGTGCTGAA GGCGACGGCG TCTTTTGTTC CAGCGGCTGT ATACGGCGTG AGGGAGCCAG ACACATCCCT CTACGCCCAC TCCCTCCTGG CCGCCGCCCT CGCCTCTACG GGGGGCGAGT TCTACCTGGT GTCTATAGAC GTGGGGAGGA TCCAGGAGTA CATCTCGAGG GCCGGCGCCA CCAAGGCCGC CATGGCCATC CTCAGGGGGC GCTCCCTCCG GATAAACGCC CTCCAGAGGG CCGCCGTAAG GTGGCTCATA GACAGGGTGG AGACGGCAAC ATACGCCAAC GTCCTCCTGG ACACCGGCGG GGAGGCCCTC CTCCTCCTGC CCAAGTTCGA CCTGGCGCTC CTCGACCAGC TGGAGAGTAG AGTGCTCCGG GAGACGGAGG GGGCCCTCGC CCTGTATGCC GCCGCGGCTG GGCCCTACAG GCTGGAGGAC GTGGCAAGGT TCAAAGACCT CATGAGGGAG CTCTCCCAGA GGGTCACAGA GCGTAAGTTC ATCTACCGCG ACTACGGAGC CCCGGCGGGG CCCGCCGCGA AGTGCCAGTT CTGCGGCCGG CTGTCCGCCA AGGTGACGCC GGAGAGGCTG AAAAGCGGGG AGGTCGTCGA CCTTTGCCAC CTCTGTAGAG ACGAGCTACA CATCGGCCGG GCCGCCCGCA ACCTGAAATA CATCGCCTTT CTCCCCAGGG GGGCACTGCC GCCCGGCTTG GCGGGGGCCG AGCGCGGGGA TGACACGGCT GTGGTGAACA TTCTGGACTA CGCCGTGGTC TTCAGCGGCC AGATGTCCAA GATGCCCCCC GCCTCCCACG CCGTCTACGC CACCAACAGG AGGGATTTCA TCCTCGACGC CGACGGGCCG GCGTACGGCA TGTGGTTCAC CAACACCCAC ATCTACTACA GGGAGGGCGA GGACTCCTCC CTCGACGCGG CCGGGAGGTA CGCCGCCTTC GTCAAGATGG ACGCCAACAG TATGGGGAGG CTGAAGGAGG CCGCCTCGCG GACCCCCTCG GCGCTGATCA CCTTCTCCCT CGCCGTCTCC ACCGCCTACG AGCTCTACCC AGCCCTCCTT GCGGACGAGA GGTACCGCGA GGTGCCCATC TTTGTGATCT ACGCGGGCGG CGACGATGCG GTGCTGGCCG GCAACCTCGA GGCGCTTCGG TACGCCGCCA GCGTGGCGAC CTACGCCGAG AAGTGGGGCT TCAAGACGGC GATTGGCGCC AAGATAGACA AGCCTCAGTA CCCCATCTAC TTCGCCTTCG CAGACACCGA GGAAAGGCTA GAGAGGGCGA AGGGGATAGA CAGGGGGCGG AGCATCGCCG TGTTGATAGC GGAGCCCGTC ACGATATACG AAGAGGCCGC GGAGCTCGAG AATGACTTGG AAAAAATCCC CAGATACAGG GAGGACGAGG AGACACGGCG CATGGGGGCC TTCGAGCGGA AGGTGTACGA GAGGCTCTTC GCCGCGTACG CCACCGCTGC CGTGGACGGC AAAGTGGACA AGAGGGTGGT CAAGAGGGCG CTGGCGAAGA TCGCCGTGGA GCTGGTCTAC ATGCTCAAGA GGCGCGAGGG GGATAAGGAG ACCACGGGGG TGCTTGAGGA GGTGGCTGGG CCTCTGTTCG CCAGCGCGGA GGGGGTTGGG TCCTTCTTCG CCGATCTAAT GGCGGGGAAA GGGCGTTTGG ACGAGCTGAG GCGCGCTGTG CTCCGGCTGT ATCTCCACCA CATCGCGCTG GCGTGGGCTC CCGAATGA
|
Protein sequence | MSGYREYVVA ALLHDVGKLI RRAKLCRGEP ARRHVEESAD FVDVVAPALK AAGVDPQAVR ELVLRHHEGG WGVGPYDRAA ALERAPGDEE SGQGLAMPGR REHEIPLRLP TGVYVPPCPT PQSLSERLIP STEPPSPEEV CRCYRRAYEE LMRLAAKAAQ RRMGFKELVE TLVNVLKATA SFVPAAVYGV REPDTSLYAH SLLAAALAST GGEFYLVSID VGRIQEYISR AGATKAAMAI LRGRSLRINA LQRAAVRWLI DRVETATYAN VLLDTGGEAL LLLPKFDLAL LDQLESRVLR ETEGALALYA AAAGPYRLED VARFKDLMRE LSQRVTERKF IYRDYGAPAG PAAKCQFCGR LSAKVTPERL KSGEVVDLCH LCRDELHIGR AARNLKYIAF LPRGALPPGL AGAERGDDTA VVNILDYAVV FSGQMSKMPP ASHAVYATNR RDFILDADGP AYGMWFTNTH IYYREGEDSS LDAAGRYAAF VKMDANSMGR LKEAASRTPS ALITFSLAVS TAYELYPALL ADERYREVPI FVIYAGGDDA VLAGNLEALR YAASVATYAE KWGFKTAIGA KIDKPQYPIY FAFADTEERL ERAKGIDRGR SIAVLIAEPV TIYEEAAELE NDLEKIPRYR EDEETRRMGA FERKVYERLF AAYATAAVDG KVDKRVVKRA LAKIAVELVY MLKRREGDKE TTGVLEEVAG PLFASAEGVG SFFADLMAGK GRLDELRRAV LRLYLHHIAL AWAPE
|
| |