Gene Rcas_3301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3301 
Symbol 
ID5540799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4291110 
End bp4293449 
Gene Length2340 bp 
Protein Length779 aa 
Translation table11 
GC content59% 
IMG OID640895419 
ProductCRISPR-associated helicase Cas3 
Protein accessionYP_001433370 
Protein GI156743241 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR01587] CRISPR-associated helicase Cas3
[TIGR01596] CRISPR-associated endonuclease Cas3-HD 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.733843 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCCT CGCCATGGCC TGATTGGATG GACCATATCC TGGCGAAGAG TCAGCAGTAC 
GGCGGTGAGA CGCTGGCAGC ACACACGTGG GATGTGCTGG TCAAATTGTC CGATCTGTAC
CGACTGCGCC CACGCCTCGA CAATGGTTCG CAACTCTGGC ACTGCCTCTA CTGGGCGGCG
TTTCTGCACG ATTTTGGCAA AGCAGCGCGC GGATTCCAGC AGCGGCTGCG TGGTGGACAA
GCCTGGTCCC ATCGCCACGA GGTGCTGTCG CTGGCGTTTG TGGATACCAT CGCCGACGGC
TTTACCCCGG AGGAGCAACG CTGGTTGGTC GCGGCAATCG TCTCCCATCA CCGCGATGAG
CCGGAGATTG CGGAGACCTA TCCACCGGGT CTGCGCCGCG ATCCGCTGGT TGATCTGTGC
AACGAACTCG AACCACAAGT GATCGATCAT CTTCAACGCT GGCTGACGGA ATGCGCCAAC
CCCTGGCGTG AGGCGTTGGG GTTAGCTGAG GTGGTGGGGA AGATTACACC GCAATCGGTG
ACGGTCAACC CCAAACGGGT ACGCTACTGG TTGCAGGTCT ATCACGATTG GGTTGCGGCG
AACGATCCTG CAGCGCGGGT ACCCGGTATC TTGCTGCGTG GCTTGATCAC CACCGCCGAT
CATATGGCTT CAGCGCATCT GCATCGTGTG CCACCACCGA TCACCGAACC GTGGTCGGCA
TTGGCCAAAC GTCTCTTGTC GCAGGGACAG CAGGTCTACG ACCACCAACA GCAAAGTGGC
GAGATGAAAG GACAATCGGC GCTGCTCATG GCCCCAACCG GCAGTGGCAA AACCGAAGCC
GCACTCTATT GGGCACTCGG TGATGGCGCC GCACCTCCGG CGCGCATCTT TTACACTCTG
CCCTATCAGG CGAGCATGAA TGCAATGTAC GACCGATTGC GGCAGAACTT TGGCGATGAG
CTGGTCGGAT TGCAGCATGG CCGGGCAGCG CAGGCGCTCT ACGCCCGCTT CCGCGAGGGT
GAGGAATGGT CGGCAGCGGC GCGGCGCGTG CAGTGGGAGA AAAACCTGAA CATCCTGCAT
GCTCGCCCGC TCAAGGTGCT CAGTCCATAC CAACTCCTCA AAGCCCTCTT CCAACTGCGT
GGGTTTGAAG CCATGCTCAC CGATTATGCG CAGGCGGCAT TTGTGTTTGA TGAGATTCAC
GCTTACGAAC CGGAACGTCT AGCCCTGATT ACCGGTCTGA TGCGGTATCT ACGCGAGCAG
TTCGCTGCAC GTTTCTTTGT GATGTCGGCT ACATTTCCCC AGCTTATTCG CAAACAGTTA
ACGGTGGCAC TTGGTGACGT TCCGGTGATC CAGGCTTGCC CGGAGATCTT TACCCACTTT
CGTCGTCACC AACTATTCCT ATGCGACGGC GAATTGATCG ACCCGACTAC CATCGCCGCG
ATTGTTGCTG AAGTGCAGGC GGGCCGGCAA GTGTTGGTCT GCGCGAATAC GGTGGCGCGA
GCGCAAGCGC TGCGCGATCA GCTCGCGCAA GCCGGTCTCA CCGATGATCA ACTACTCTTG
ATCCATAGCC GCTTCACCTA TGGCGACCGC AGCCGCATCG AGCAAGACAT CCGCGCGCGC
TGCGGTAGTA ATGTCACGCC ACGACCGCCG CTGGCGCTCG TGGCGACGCA GGTGGTTGAG
GTGAGTCTCG ACATCGATCT CGACACGATC TACACCGATC CGGCGCCGCT TGAAGCGCTT
CTGCAACGCT TCGGGCGGGT CAACCGCAAA GGCGCAAAAG GCATCTGCCC GGTCTACGTT
TTTCGCCAGC CCACCGACGG GCAGGGCGTC TACGGGCGCG ACCGCGACCC GCAACAGGCG
GGCCACATCG TGCGCGTTAC GCTTGCCGAA CTGGAACAGC ATAACCGTGA AATCATCGAC
GAAGCGGCAA TCAACCAGTG GCTCGACAAC ATCTATGCCG ATCCCGTGCT CAGCCAGCAA
TGGACGGAAG CCTACCAACG GATGGCGCAG CAGGTCGAGC TGATCATCAA CGGGTTACGC
CCATTCCAGA GCGACGAACA GCGTGAAGAT GATTTCGAGC AAATGTTTGA TGGGGTTGAG
GTCGTGCCAC AGTGTTTTGA ACAGGCATAC GTTGATTGTC TGGTTCAGGA ACGGTTTATT
GAAGCGAATG ACTACCTGGT CAGCATCAGC AAACAGCGTT TTGCCATCCT GCGCAATCAG
GGCAAAATCA GACCGGCAGA AGAAGTCGGG CAGCGACGAG TGTGGGTAGC GCTCCTTCCC
TACGACTCGC GCAATGGTTT GTCATTTGGC GACACGACGT ATGATCCAGA CTGGAGTTGA
 
Protein sequence
MKPSPWPDWM DHILAKSQQY GGETLAAHTW DVLVKLSDLY RLRPRLDNGS QLWHCLYWAA 
FLHDFGKAAR GFQQRLRGGQ AWSHRHEVLS LAFVDTIADG FTPEEQRWLV AAIVSHHRDE
PEIAETYPPG LRRDPLVDLC NELEPQVIDH LQRWLTECAN PWREALGLAE VVGKITPQSV
TVNPKRVRYW LQVYHDWVAA NDPAARVPGI LLRGLITTAD HMASAHLHRV PPPITEPWSA
LAKRLLSQGQ QVYDHQQQSG EMKGQSALLM APTGSGKTEA ALYWALGDGA APPARIFYTL
PYQASMNAMY DRLRQNFGDE LVGLQHGRAA QALYARFREG EEWSAAARRV QWEKNLNILH
ARPLKVLSPY QLLKALFQLR GFEAMLTDYA QAAFVFDEIH AYEPERLALI TGLMRYLREQ
FAARFFVMSA TFPQLIRKQL TVALGDVPVI QACPEIFTHF RRHQLFLCDG ELIDPTTIAA
IVAEVQAGRQ VLVCANTVAR AQALRDQLAQ AGLTDDQLLL IHSRFTYGDR SRIEQDIRAR
CGSNVTPRPP LALVATQVVE VSLDIDLDTI YTDPAPLEAL LQRFGRVNRK GAKGICPVYV
FRQPTDGQGV YGRDRDPQQA GHIVRVTLAE LEQHNREIID EAAINQWLDN IYADPVLSQQ
WTEAYQRMAQ QVELIINGLR PFQSDEQRED DFEQMFDGVE VVPQCFEQAY VDCLVQERFI
EANDYLVSIS KQRFAILRNQ GKIRPAEEVG QRRVWVALLP YDSRNGLSFG DTTYDPDWS