Gene Tneu_1152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1152 
Symbol 
ID6166036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp1043300 
End bp1045567 
Gene Length2268 bp 
Protein Length755 aa 
Translation table11 
GC content66% 
IMG OID641668303 
ProductCRISPR-associated Csm1 family protein 
Protein accessionYP_001794528 
Protein GI171185609 
COG category[R] General function prediction only 
COG ID[COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) 
TIGRFAM ID[TIGR02578] CRISPR-associated protein, Csm1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00402598 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000176122 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTGGCT ATAGGGAGTA CGTAGTTGCG GCCCTCCTCC ACGACGTTGG GAAGCTCATA 
AGGAGGGCGA AGCTGTGCCG TGGGGAGCCG GCTAGGCGCC ACGTGGAGGA GAGCGCGGAT
TTCGTAGACG TCGTCGCGCC GGCGCTTAAG GCCGCTGGGG TTGATCCGCA GGCGGTGAGG
GAGCTGGTTC TGAGGCACCA CGAGGGGGGC TGGGGGGTGG GGCCCTACGA CAGAGCCGCG
GCTCTGGAGA GGGCGCCGGG CGACGAGGAG TCGGGCCAGG GCCTGGCCAT GCCGGGGCGG
CGCGAGCACG AGATACCTCT GAGGTTGCCC ACGGGGGTGT ACGTCCCGCC GTGTCCAACG
CCGCAGAGCC TCAGCGAGAG GCTCATCCCG TCTACGGAGC CGCCGAGCCC GGAGGAGGTG
TGTAGGTGCT ACCGGAGGGC CTACGAGGAG CTGATGAGGC TGGCGGCGAA GGCGGCTCAG
AGGAGGATGG GCTTCAAGGA GCTCGTGGAG ACGCTGGTAA ACGTGCTGAA GGCGACGGCG
TCTTTTGTTC CAGCGGCTGT ATACGGCGTG AGGGAGCCAG ACACATCCCT CTACGCCCAC
TCCCTCCTGG CCGCCGCCCT CGCCTCTACG GGGGGCGAGT TCTACCTGGT GTCTATAGAC
GTGGGGAGGA TCCAGGAGTA CATCTCGAGG GCCGGCGCCA CCAAGGCCGC CATGGCCATC
CTCAGGGGGC GCTCCCTCCG GATAAACGCC CTCCAGAGGG CCGCCGTAAG GTGGCTCATA
GACAGGGTGG AGACGGCAAC ATACGCCAAC GTCCTCCTGG ACACCGGCGG GGAGGCCCTC
CTCCTCCTGC CCAAGTTCGA CCTGGCGCTC CTCGACCAGC TGGAGAGTAG AGTGCTCCGG
GAGACGGAGG GGGCCCTCGC CCTGTATGCC GCCGCGGCTG GGCCCTACAG GCTGGAGGAC
GTGGCAAGGT TCAAAGACCT CATGAGGGAG CTCTCCCAGA GGGTCACAGA GCGTAAGTTC
ATCTACCGCG ACTACGGAGC CCCGGCGGGG CCCGCCGCGA AGTGCCAGTT CTGCGGCCGG
CTGTCCGCCA AGGTGACGCC GGAGAGGCTG AAAAGCGGGG AGGTCGTCGA CCTTTGCCAC
CTCTGTAGAG ACGAGCTACA CATCGGCCGG GCCGCCCGCA ACCTGAAATA CATCGCCTTT
CTCCCCAGGG GGGCACTGCC GCCCGGCTTG GCGGGGGCCG AGCGCGGGGA TGACACGGCT
GTGGTGAACA TTCTGGACTA CGCCGTGGTC TTCAGCGGCC AGATGTCCAA GATGCCCCCC
GCCTCCCACG CCGTCTACGC CACCAACAGG AGGGATTTCA TCCTCGACGC CGACGGGCCG
GCGTACGGCA TGTGGTTCAC CAACACCCAC ATCTACTACA GGGAGGGCGA GGACTCCTCC
CTCGACGCGG CCGGGAGGTA CGCCGCCTTC GTCAAGATGG ACGCCAACAG TATGGGGAGG
CTGAAGGAGG CCGCCTCGCG GACCCCCTCG GCGCTGATCA CCTTCTCCCT CGCCGTCTCC
ACCGCCTACG AGCTCTACCC AGCCCTCCTT GCGGACGAGA GGTACCGCGA GGTGCCCATC
TTTGTGATCT ACGCGGGCGG CGACGATGCG GTGCTGGCCG GCAACCTCGA GGCGCTTCGG
TACGCCGCCA GCGTGGCGAC CTACGCCGAG AAGTGGGGCT TCAAGACGGC GATTGGCGCC
AAGATAGACA AGCCTCAGTA CCCCATCTAC TTCGCCTTCG CAGACACCGA GGAAAGGCTA
GAGAGGGCGA AGGGGATAGA CAGGGGGCGG AGCATCGCCG TGTTGATAGC GGAGCCCGTC
ACGATATACG AAGAGGCCGC GGAGCTCGAG AATGACTTGG AAAAAATCCC CAGATACAGG
GAGGACGAGG AGACACGGCG CATGGGGGCC TTCGAGCGGA AGGTGTACGA GAGGCTCTTC
GCCGCGTACG CCACCGCTGC CGTGGACGGC AAAGTGGACA AGAGGGTGGT CAAGAGGGCG
CTGGCGAAGA TCGCCGTGGA GCTGGTCTAC ATGCTCAAGA GGCGCGAGGG GGATAAGGAG
ACCACGGGGG TGCTTGAGGA GGTGGCTGGG CCTCTGTTCG CCAGCGCGGA GGGGGTTGGG
TCCTTCTTCG CCGATCTAAT GGCGGGGAAA GGGCGTTTGG ACGAGCTGAG GCGCGCTGTG
CTCCGGCTGT ATCTCCACCA CATCGCGCTG GCGTGGGCTC CCGAATGA
 
Protein sequence
MSGYREYVVA ALLHDVGKLI RRAKLCRGEP ARRHVEESAD FVDVVAPALK AAGVDPQAVR 
ELVLRHHEGG WGVGPYDRAA ALERAPGDEE SGQGLAMPGR REHEIPLRLP TGVYVPPCPT
PQSLSERLIP STEPPSPEEV CRCYRRAYEE LMRLAAKAAQ RRMGFKELVE TLVNVLKATA
SFVPAAVYGV REPDTSLYAH SLLAAALAST GGEFYLVSID VGRIQEYISR AGATKAAMAI
LRGRSLRINA LQRAAVRWLI DRVETATYAN VLLDTGGEAL LLLPKFDLAL LDQLESRVLR
ETEGALALYA AAAGPYRLED VARFKDLMRE LSQRVTERKF IYRDYGAPAG PAAKCQFCGR
LSAKVTPERL KSGEVVDLCH LCRDELHIGR AARNLKYIAF LPRGALPPGL AGAERGDDTA
VVNILDYAVV FSGQMSKMPP ASHAVYATNR RDFILDADGP AYGMWFTNTH IYYREGEDSS
LDAAGRYAAF VKMDANSMGR LKEAASRTPS ALITFSLAVS TAYELYPALL ADERYREVPI
FVIYAGGDDA VLAGNLEALR YAASVATYAE KWGFKTAIGA KIDKPQYPIY FAFADTEERL
ERAKGIDRGR SIAVLIAEPV TIYEEAAELE NDLEKIPRYR EDEETRRMGA FERKVYERLF
AAYATAAVDG KVDKRVVKRA LAKIAVELVY MLKRREGDKE TTGVLEEVAG PLFASAEGVG
SFFADLMAGK GRLDELRRAV LRLYLHHIAL AWAPE