Gene RoseRS_1868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1868 
Symbol 
ID5208828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp2310911 
End bp2313262 
Gene Length2352 bp 
Protein Length783 aa 
Translation table11 
GC content61% 
IMG OID640595476 
ProductCRISPR-associated helicase Cas3 
Protein accessionYP_001276207 
Protein GI148656002 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR01587] CRISPR-associated helicase Cas3
[TIGR01596] CRISPR-associated endonuclease Cas3-HD 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCCT TACCGTGGCC TGATTGGATG GATCATCTGC TGGCAAAGAG TAAGCGGTAT 
GGCGGTGAGA CGTTAGCCGG GCATACCTGG GATGTGCTCT GTCGGCTCGC CGACCTCTAC
CGCCTGCGCC CGCACCTCAC AGACGATACG CGGCTCTGGC ACTGTCTGTA TTGGTCCGCA
TTTCTGCACG ATTTCGGCAA GGCGGCGCAG GGCTTTCAAC GTCGGTTGCG TGGCGGTCCG
AAATGGGCGC ACCGCCACGA AGTGCTCTCG CTGGCGTTTG TTGATGCGAT TGCCCACGAC
CTGAATGAAG CAGAACAGCG CTGGGTGGTG GCGGCAATTG TCTCACACCA CCGCGATGCA
ACCGAGATCG CCGAACTCTA CCCCTCCGGT CTGCGCAGGG ATCCGCTGAT CGATCTCTGC
AATGAACTTG ATGCCGAAAC AGTTGCCCAT TTGCTGCGCT GGATCGCCGA ATGCGCCAAT
CGGTGGCGCG ACCTGCTGGG GCTGACCGCA GACGTCGGTG CTATCGATCC GCAGCCGTGT
TCGGTGGGCG CGCAACGGGT GCGGTACTGG TTACGGATGT ACCACGACTG GGTTGACACC
CTGGACGGCG AGGGGTCTGA TGCCCGTCGC CCCGGCATTC TGCTACGCGG ACTGATCACC
AGCGCCGATC ATATGGCCTC GGCGCATCTG CGTCGTCTGC CGCCGCCTAT CCGCCTGTCG
TGGCAGGATG TGGCCATGAG GTGCAATCTG TCGCCGGATC AGGTCTACCC GCATCAGCGC
CAGAGCGCCG GGCAGAGCGC ACAATCGGCG CTCCTGATGG TGCCGACGGG TAGCGGCAAG
ACCGAAGCGG CGCTCTCATG GGCGCTCGGC GACGGGCGCA ACCCGCCGGC GCGCATTTTC
TATACCTTGC CGTATCAGGC CAGCATGAAT GCGATGTACG ACCGGTTACG GCAGACCTTT
GGCGATGATG TGGTCGGCCT GCAGCACGGT CGAGCGACAC AGGCGCTCTA CGCCAGGTTT
CGCGAAGGCG ATGAGTGGTC GGCGACTGCG GCACGGCGCG CGCAATGGGA AAAGAATCTC
AATCTGTTGC ACGCCCGTCC GCTGAAGGTG CTCAGCCCCT ACCAGTTGCT CAAAGCGCTC
TTTCAGCTGC GCGGCTTTGA GGCGATGCTG ACCGACTACG CGCATGCGGC GTTTATTTTC
GACGAGATTC ATGCCTACGA ACCGGAGCGC CTGGCGCTCA TAACCGGGCT GATGCGCTAT
CTGCGGGAAC ACTTTGCCGC CCGTTTCTTC GTCATGTCAG CAACGTTCCC TGAACTGATT
CGCAGTCACC TGCGCACCGC CCTGGGCGAT ACGCCCCTGA TCCGCGCAAC TCCAGACATT
TTTGACCGCT TCCGTCGTCA TCAGTTGTGC CTGCGTGATG GGGAACTGAC CGATCCGGCA
ACTATCACCG AGATCGTCGA AGCGGTGCGC AACGGCAAGC AGGTGCTGGT CTGCGCCAAC
ACGGTGGCAC GGGCGCAAAT GGTGCGCGAT CTTCTGACGC ACGCCGGTCT GACCGATGAA
CACCTGATCC TGATCCATAG CCGCTTCACC TACGGCGACC GTAGCCGGCT TGAGCAGGCG
ATAGGCGCGC GCTGCCGCAG CAATAGCGCG AATCGATCAC CGCTGGCGCT GGTTGCCACT
CAGGTGGTTG AAGTCAGCCT CGACATTGAT CTCGATGTGC TCTACAGCGA TCCGGCGCCG
CTCGAAGCGC TCCTGCAACG CTTCGGTCGG GTCAATCGCA AAGCGGCGAA AGGAATCTGC
CCGGTCTACG TCTTCCGCCA GCCTGACGAT GGACAGGGCG TGTACGGGCG CGACCGCGAC
CCGGAACGAT CCGGTCGTAT TGTGCGGGTA ACGCTCGCCG AACTCGAACA GCACGATGGC
GCGATCATCG ACGAAGCGAC CATCAATGAA TGGCTTGACC GGATCTACGC TGATCCCTTG
TTGAATCAGC AATGGCGCGA CGCCTATCAG CGGATGGCGC ACCAGGTCGA CCTGATCATT
CAGGGGATCC GCCCGTTCCA GAGCGACGAA CAGCGTGAAG ACGAATTCGA GCGCATGTTC
GATGGGGTCG ATGTGGTGCC GCAATGCTTT GAACGCGACT ACGTTCATCT GCTCGTCGAA
GAGCGCTTTA TCGAAGCGAA TGACTATCTG GTGAGCATCA GCAAACAACG TTTTGCGATT
CTGCGCAACC AGGGCAAACT GCGACCGGCG GAAGAGACCG ATCAGCGACG TGTCTGGGTT
GCGCAAACGC CCTATGATGC GCGGAATGGA CTCTCGTTCG GTGACCGTGT GGTCGATATG
GACTGGATTT GA
 
Protein sequence
MKPLPWPDWM DHLLAKSKRY GGETLAGHTW DVLCRLADLY RLRPHLTDDT RLWHCLYWSA 
FLHDFGKAAQ GFQRRLRGGP KWAHRHEVLS LAFVDAIAHD LNEAEQRWVV AAIVSHHRDA
TEIAELYPSG LRRDPLIDLC NELDAETVAH LLRWIAECAN RWRDLLGLTA DVGAIDPQPC
SVGAQRVRYW LRMYHDWVDT LDGEGSDARR PGILLRGLIT SADHMASAHL RRLPPPIRLS
WQDVAMRCNL SPDQVYPHQR QSAGQSAQSA LLMVPTGSGK TEAALSWALG DGRNPPARIF
YTLPYQASMN AMYDRLRQTF GDDVVGLQHG RATQALYARF REGDEWSATA ARRAQWEKNL
NLLHARPLKV LSPYQLLKAL FQLRGFEAML TDYAHAAFIF DEIHAYEPER LALITGLMRY
LREHFAARFF VMSATFPELI RSHLRTALGD TPLIRATPDI FDRFRRHQLC LRDGELTDPA
TITEIVEAVR NGKQVLVCAN TVARAQMVRD LLTHAGLTDE HLILIHSRFT YGDRSRLEQA
IGARCRSNSA NRSPLALVAT QVVEVSLDID LDVLYSDPAP LEALLQRFGR VNRKAAKGIC
PVYVFRQPDD GQGVYGRDRD PERSGRIVRV TLAELEQHDG AIIDEATINE WLDRIYADPL
LNQQWRDAYQ RMAHQVDLII QGIRPFQSDE QREDEFERMF DGVDVVPQCF ERDYVHLLVE
ERFIEANDYL VSISKQRFAI LRNQGKLRPA EETDQRRVWV AQTPYDARNG LSFGDRVVDM
DWI