Gene Rcas_1977 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1977 
Symbol 
ID5539455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2532265 
End bp2534517 
Gene Length2253 bp 
Protein Length750 aa 
Translation table11 
GC content66% 
IMG OID640894112 
ProductCRISPR-associated Csm1 family protein 
Protein accessionYP_001432083 
Protein GI156741954 
COG category[R] General function prediction only 
COG ID[COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) 
TIGRFAM ID[TIGR02578] CRISPR-associated protein, Csm1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.757339 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.306221 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGAGGTG GTATGCACGA ACTTGCGCAT CGCGCAGCGC TCGAGGCGCT GCGCTTCTGG 
GTGGCGGCGG CGAACAGGGA ACATGCGCCC GCAGCGCCGG ACGCTGCGTG TGATGTGGTG
GATCGCGCTG CGCGGTTCGT CTGTGGCGGC GATGCTCCGC CGTGCCTTGA TCTGACACGA
CCGCTCGCAA CGGTCTTTGG GAGTCTTCAA GGGGCGTCGG CGGCATATGT GCGCCGCGCG
CCGCTGGCGC TGACCGATGA CATCCTCTTC CCGTCGACGG ACGCGCGCGT TGACACGGCA
GCCACCGACC GGCTGCGCGC GGCGTTGCGC GCAGCCGACA ATCAGAGCGC GCACCTGCCG
CTCGCGCCAC GCATCGAGGC GCTCTTCTTC GCGTTTCAGC GGTTCGCCTG GTGTCTCCCT
TCGCCCCTTG CCGGCGTGTC GCTGTACGAC GCGGCGCGTA TCCACGCAGC CGTCGCGGCG
GCGCTGACCG CCGATTCCGG TCTGCTCCTC GTTGGCGGTG ATGTGTCCGG GGTGCAGGAG
TTTATCTACT CGATCAGCGC TGCCGGTGCG ACCCGCCAGC TGCGCGGGCG GTCATTCTAT
CTCCAATTGC TGACCGAGGC ATGCGCGCAC TATGTGTTGC ACCAGGCGGG GATGCCGTTC
TGTAATCTGT TGTACGCCGG CGGCGGTCGC TTTTATGTGC TGCTTCCTGC GTCGTTCGAG
TCGCGTCTTG GCGAATGGCG GCGCGCCATC GGCAGACTGC TCCTTGATGC GCACGGCGGC
GCGCTCTACC TGGCAATCGG CGGAGTGCGC TTCGCGCCGC ATGACTATAG CGAATCCACC
TGGCAGGAGT TGACGCGCCG AATCGACGAC GTGAAACGTC GCCGTTTTGC CGACCTCGAT
GATGCGACGT TCGCCCGGTT GTTCGAGCCG ACGCAGCCGG AGCCGCCGCC GGACGTTGAA
GAGCAATCCG CCGAACCGCT CGACGCCATG GGCGAGTCGC TCGCAGACCT GGGGCGGCAA
CTGGCGCGTG TGGCGCTGCT GTCGGTCGAG CCGGTGGAAC CGCGCGCCGT CTCCTTTGAT
CGCGGCAAAG CGGGCTGGAA CGACGTGCTG CGCGTGCTCG GCTTGAACGT CGAGCTGCTC
GACCATCTGT GTGCTTATCG CGTTGACCAC TCCCGGCGTC GGCGGGTGCT GCTGATCGAT
GACCGTGATC CGCAGTCTAT CGCCCTCGGA CCACAGGACG TCGTCGGGAC GCGCTACACG
GTCGCAGTGG CGCAACTGGC GACGGATCGC GATGTTGCGC AGTATCAGGC GCTCGAACGC
GATGCTGGCG ATGAGCAGAC GTTGCGCGTC GGCGATGTCA AGCCGTTCAA TCTGCTGGCG
GAACAGAGCA TCGGCGCGCG GCGGATGGGC GTGCTGCGCA TGGATGTGGA CGATCTCGGC
GATCTGTTTG GGCGACGCCT GAGTCGCCCG TCGGGTCTGG CGGGTCTGGC TGTGACGGCA
GCGCTCAGCA CAACGTTGAG CCGCTACTTC GAGGGATGGG TCGGTGAACT GTGTCGCCGC
GCCAATGATG ATGGCGGCGC CGGCGGGGTA TATGCCGTCT ACAGCGGCGG CGACGATCTC
TTCCTGGTGG GATCGTGGCA TCGGATGCCA CGCCTGGCGC AGCAGATTCG CAACGATTTT
GCGCGCTACG TCTTGGGGCG CGCGCCCAAT GCCGGCGAGA CGCTGCCGAT CACGCTGTCG
GGCGGGATCA CGCTGCACGC GGCGCGCTAT CCGCTCTACC AGGCTGCCGA TGATGCTGCT
GAAGCGCTCG ATGCCGCGAA ACGCCATGCG CGTCCCGACC GGCACGCCAA GGATGCGGTG
ACTTTCCTGG GACGCACTCT GGGCTGGGAG CATTTCGGCG AAGCGGCGGA CCTGTGCGCT
GCCCTCGTCG ATCTGGTGCA GGCGCAGGGT GTGCCGCGCA GCCTGCTGAT GGTCATTCAA
ACGCTCGACG CGCGCTTTCG GCAGGAGCAG CGCCGCAACC GCAGTGGCGC CGCGCAGTTC
GCCTATGGTC CGTGGGTATG GCAGGGGGCA TACCAGTTGA CGCGCGTTGC CGAACGATCA
CCAAACGGGG TCAAAGCGCA GATTGAGCGC CTGCGCGACC GGATCGTCGG CAATGAAGGC
GTGCCGCAAC GGTTCATCGA GCGGGCGGGA CTCGCTGCGC GCTGGGCACA GTTACTGGTT
CGTGAACGCA GCAATGCAAA GGAGGAACGA TGA
 
Protein sequence
MGGGMHELAH RAALEALRFW VAAANREHAP AAPDAACDVV DRAARFVCGG DAPPCLDLTR 
PLATVFGSLQ GASAAYVRRA PLALTDDILF PSTDARVDTA ATDRLRAALR AADNQSAHLP
LAPRIEALFF AFQRFAWCLP SPLAGVSLYD AARIHAAVAA ALTADSGLLL VGGDVSGVQE
FIYSISAAGA TRQLRGRSFY LQLLTEACAH YVLHQAGMPF CNLLYAGGGR FYVLLPASFE
SRLGEWRRAI GRLLLDAHGG ALYLAIGGVR FAPHDYSEST WQELTRRIDD VKRRRFADLD
DATFARLFEP TQPEPPPDVE EQSAEPLDAM GESLADLGRQ LARVALLSVE PVEPRAVSFD
RGKAGWNDVL RVLGLNVELL DHLCAYRVDH SRRRRVLLID DRDPQSIALG PQDVVGTRYT
VAVAQLATDR DVAQYQALER DAGDEQTLRV GDVKPFNLLA EQSIGARRMG VLRMDVDDLG
DLFGRRLSRP SGLAGLAVTA ALSTTLSRYF EGWVGELCRR ANDDGGAGGV YAVYSGGDDL
FLVGSWHRMP RLAQQIRNDF ARYVLGRAPN AGETLPITLS GGITLHAARY PLYQAADDAA
EALDAAKRHA RPDRHAKDAV TFLGRTLGWE HFGEAADLCA ALVDLVQAQG VPRSLLMVIQ
TLDARFRQEQ RRNRSGAAQF AYGPWVWQGA YQLTRVAERS PNGVKAQIER LRDRIVGNEG
VPQRFIERAG LAARWAQLLV RERSNAKEER