Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1977 |
Symbol | |
ID | 5539455 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 2532265 |
End bp | 2534517 |
Gene Length | 2253 bp |
Protein Length | 750 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640894112 |
Product | CRISPR-associated Csm1 family protein |
Protein accession | YP_001432083 |
Protein GI | 156741954 |
COG category | [R] General function prediction only |
COG ID | [COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) |
TIGRFAM ID | [TIGR02578] CRISPR-associated protein, Csm1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.757339 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.306221 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGAGGTG GTATGCACGA ACTTGCGCAT CGCGCAGCGC TCGAGGCGCT GCGCTTCTGG GTGGCGGCGG CGAACAGGGA ACATGCGCCC GCAGCGCCGG ACGCTGCGTG TGATGTGGTG GATCGCGCTG CGCGGTTCGT CTGTGGCGGC GATGCTCCGC CGTGCCTTGA TCTGACACGA CCGCTCGCAA CGGTCTTTGG GAGTCTTCAA GGGGCGTCGG CGGCATATGT GCGCCGCGCG CCGCTGGCGC TGACCGATGA CATCCTCTTC CCGTCGACGG ACGCGCGCGT TGACACGGCA GCCACCGACC GGCTGCGCGC GGCGTTGCGC GCAGCCGACA ATCAGAGCGC GCACCTGCCG CTCGCGCCAC GCATCGAGGC GCTCTTCTTC GCGTTTCAGC GGTTCGCCTG GTGTCTCCCT TCGCCCCTTG CCGGCGTGTC GCTGTACGAC GCGGCGCGTA TCCACGCAGC CGTCGCGGCG GCGCTGACCG CCGATTCCGG TCTGCTCCTC GTTGGCGGTG ATGTGTCCGG GGTGCAGGAG TTTATCTACT CGATCAGCGC TGCCGGTGCG ACCCGCCAGC TGCGCGGGCG GTCATTCTAT CTCCAATTGC TGACCGAGGC ATGCGCGCAC TATGTGTTGC ACCAGGCGGG GATGCCGTTC TGTAATCTGT TGTACGCCGG CGGCGGTCGC TTTTATGTGC TGCTTCCTGC GTCGTTCGAG TCGCGTCTTG GCGAATGGCG GCGCGCCATC GGCAGACTGC TCCTTGATGC GCACGGCGGC GCGCTCTACC TGGCAATCGG CGGAGTGCGC TTCGCGCCGC ATGACTATAG CGAATCCACC TGGCAGGAGT TGACGCGCCG AATCGACGAC GTGAAACGTC GCCGTTTTGC CGACCTCGAT GATGCGACGT TCGCCCGGTT GTTCGAGCCG ACGCAGCCGG AGCCGCCGCC GGACGTTGAA GAGCAATCCG CCGAACCGCT CGACGCCATG GGCGAGTCGC TCGCAGACCT GGGGCGGCAA CTGGCGCGTG TGGCGCTGCT GTCGGTCGAG CCGGTGGAAC CGCGCGCCGT CTCCTTTGAT CGCGGCAAAG CGGGCTGGAA CGACGTGCTG CGCGTGCTCG GCTTGAACGT CGAGCTGCTC GACCATCTGT GTGCTTATCG CGTTGACCAC TCCCGGCGTC GGCGGGTGCT GCTGATCGAT GACCGTGATC CGCAGTCTAT CGCCCTCGGA CCACAGGACG TCGTCGGGAC GCGCTACACG GTCGCAGTGG CGCAACTGGC GACGGATCGC GATGTTGCGC AGTATCAGGC GCTCGAACGC GATGCTGGCG ATGAGCAGAC GTTGCGCGTC GGCGATGTCA AGCCGTTCAA TCTGCTGGCG GAACAGAGCA TCGGCGCGCG GCGGATGGGC GTGCTGCGCA TGGATGTGGA CGATCTCGGC GATCTGTTTG GGCGACGCCT GAGTCGCCCG TCGGGTCTGG CGGGTCTGGC TGTGACGGCA GCGCTCAGCA CAACGTTGAG CCGCTACTTC GAGGGATGGG TCGGTGAACT GTGTCGCCGC GCCAATGATG ATGGCGGCGC CGGCGGGGTA TATGCCGTCT ACAGCGGCGG CGACGATCTC TTCCTGGTGG GATCGTGGCA TCGGATGCCA CGCCTGGCGC AGCAGATTCG CAACGATTTT GCGCGCTACG TCTTGGGGCG CGCGCCCAAT GCCGGCGAGA CGCTGCCGAT CACGCTGTCG GGCGGGATCA CGCTGCACGC GGCGCGCTAT CCGCTCTACC AGGCTGCCGA TGATGCTGCT GAAGCGCTCG ATGCCGCGAA ACGCCATGCG CGTCCCGACC GGCACGCCAA GGATGCGGTG ACTTTCCTGG GACGCACTCT GGGCTGGGAG CATTTCGGCG AAGCGGCGGA CCTGTGCGCT GCCCTCGTCG ATCTGGTGCA GGCGCAGGGT GTGCCGCGCA GCCTGCTGAT GGTCATTCAA ACGCTCGACG CGCGCTTTCG GCAGGAGCAG CGCCGCAACC GCAGTGGCGC CGCGCAGTTC GCCTATGGTC CGTGGGTATG GCAGGGGGCA TACCAGTTGA CGCGCGTTGC CGAACGATCA CCAAACGGGG TCAAAGCGCA GATTGAGCGC CTGCGCGACC GGATCGTCGG CAATGAAGGC GTGCCGCAAC GGTTCATCGA GCGGGCGGGA CTCGCTGCGC GCTGGGCACA GTTACTGGTT CGTGAACGCA GCAATGCAAA GGAGGAACGA TGA
|
Protein sequence | MGGGMHELAH RAALEALRFW VAAANREHAP AAPDAACDVV DRAARFVCGG DAPPCLDLTR PLATVFGSLQ GASAAYVRRA PLALTDDILF PSTDARVDTA ATDRLRAALR AADNQSAHLP LAPRIEALFF AFQRFAWCLP SPLAGVSLYD AARIHAAVAA ALTADSGLLL VGGDVSGVQE FIYSISAAGA TRQLRGRSFY LQLLTEACAH YVLHQAGMPF CNLLYAGGGR FYVLLPASFE SRLGEWRRAI GRLLLDAHGG ALYLAIGGVR FAPHDYSEST WQELTRRIDD VKRRRFADLD DATFARLFEP TQPEPPPDVE EQSAEPLDAM GESLADLGRQ LARVALLSVE PVEPRAVSFD RGKAGWNDVL RVLGLNVELL DHLCAYRVDH SRRRRVLLID DRDPQSIALG PQDVVGTRYT VAVAQLATDR DVAQYQALER DAGDEQTLRV GDVKPFNLLA EQSIGARRMG VLRMDVDDLG DLFGRRLSRP SGLAGLAVTA ALSTTLSRYF EGWVGELCRR ANDDGGAGGV YAVYSGGDDL FLVGSWHRMP RLAQQIRNDF ARYVLGRAPN AGETLPITLS GGITLHAARY PLYQAADDAA EALDAAKRHA RPDRHAKDAV TFLGRTLGWE HFGEAADLCA ALVDLVQAQG VPRSLLMVIQ TLDARFRQEQ RRNRSGAAQF AYGPWVWQGA YQLTRVAERS PNGVKAQIER LRDRIVGNEG VPQRFIERAG LAARWAQLLV RERSNAKEER
|
| |