Gene RoseRS_4335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4335 
Symbol 
ID5211319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5448155 
End bp5450869 
Gene Length2715 bp 
Protein Length904 aa 
Translation table11 
GC content50% 
IMG OID640597919 
ProductCRISPR-associated helicase Cas3 
Protein accessionYP_001278623 
Protein GI148658418 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR01587] CRISPR-associated helicase Cas3
[TIGR01596] CRISPR-associated endonuclease Cas3-HD 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0995159 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00193959 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGCTACGGT TCAGCCCGCA TGTGGCGCGA CGGGTGCAGG AGACGCGCTG GCACCGCTCC 
GAGCGCACCG AGCCGCAACC CGACGGATCG CTGGTGTGGC GAGCACAGAT CGCTGAGCCG
AAAGAGATGC TGCCCTGGAT CCGCGGCTGG GGCGCCGATG TGGAAGTGCT GGGACCGGAG
GGGCTGCGCG GGACGCTGGA AGCGGAGGTG CGAAAGATGG CGAGGGTGTA TGGCGTGAGT
GAGATGAAGT TGCAGGATGC TCTGATTGCG CATCGACGCG AACGCGATGG TAAAGAACAA
TCTTTAGTTG ATCATCTGAA AGGAACCGCA GAACTGGCGA AGCGTTTTGC CGGAAAAATA
GGATTGCCAG AACTTGGAGA AGTCATGGGA CTGGCACACG ACTTCGGAAA AGCCAGCAAG
GAGTTTCAGG ATTACCTGAA GTCTGCTACA GGGTTGAAAA ATCCAGATGA AGATGACTAT
ATAGATTATG AGTCGCGGAA GGGAAAAATA GATCACTCTA CAGCAGGGGC ACAGCTTGTA
TATGAGAAAT GCAGCCATTT AGGCAAGGAA GGCGAGTTCC TTGCGCAATT TCTGGCTCTG
GCAATAGCTT CGCACCACTC CGGACTGATC GATTGTCTGA CACCCACAGG TGAAGATAAC
TTCAGTCGAC GTATTACTAA AGATGATAGT GTAACACATG TATCTGAAGC CAGGGGTAAG
TTACCTGAGA TCGAGGATGG TCTCAATAAA ACTCTTACGC TGGATATATT AAAACAATTC
ATGCAAAAAC TTCAAAGTCT GAGAGAACAG AACGACTCAA AGGAGACTAT TGCTTTCAAA
CTCGGTATTC TGGCTCGTTT CCTGCTGAGT TGTCTTCTGG ATGCAGATCG CCTGAACACT
GCTGATTTCG AGTTTCCTGA AAACAGTGTA ATCCGTAGTT ATGGCAACTA TGTATCCTGG
GATATGCTGA TTGAGCGACT GGAGAAAACA TTTAATGAAT TTGCACAGGC AGTCGCACAA
ACGAAGGAAG GTAGCCAGGC GCGGGAAGTA TATCAACTGC GATCACAGGT GGCGCGGGCA
TGCCGTGAAG CAGCTACGAA ACCAAAGGGC ATTTACCAAC TGACTGTACC GACTGGCGGT
GGCAAGACTC TGGCAAGTCT ACGCTTTGCG CTACACCATG CAAAGCATCA CGGGATGGAC
AGGGTTTTTT ACGTGGCGCC ATACATTACG ATTATTGACC AGAATGCTGA CACCCTGCGA
ACAATTCTTG AGCCTGAAAG TGAACGCGGT CGTGTCGTAC TCGAACACCA CTCCAATTTT
GTTCCAGAAG AAGATACGCA TCGTCGTCAT AGCCTGCTGG CGGAAAACTG GGATGCGCCG
ATTGTATGCA CCACGCAGGT CCAATTCTTA GAGGCCTTGT TCGGTGCAGG CACGCGCGAT
ACTCGTCGCA TGCATCAACT GGCAAAAGCG GTGATTATCT TAGATGAGGT GCAGACAGTT
CCAATTAGAA TCGTCCATAT GCTGAACGTG GCTCTGCGTT TTTTAGCGCA TAGTTGTGGC
TCAACGGTTG TCCTTTGTAC GGCTACACAG CCGCCGCTGG ATCGACTACC GGACAATCCA
TACCGTTCGC TTATTGTTAA GCAGGAACAA CGAATCATCC TGAACGAAGT GGAACTGTTC
AAGAGCATGC AGCGCGTCGA GGTTCACTAT GCACGTAAAG ATGGAGGAAT GACGAATGAT
GAGATCGTCG ATCTTGCCGG ACGGGCACTG GATTCGGAGG GCAGTGTTCT GATCGTGGTC
AATACGCGAG CGATGGCGCG AACATTGTAT GAACAAATCA AAGTCAGGCG TCTGGCGCCA
ACATATCATC TGAGCACCAA TATGTGCCCT GCACACCGAA TGGATGTGCT GAATACCATC
AAGGAAAAAT TAGCGGCAAA AGAACCTGTT ATCTGCGTGA GTACGCAACT TATCGAGGCA
GGTGTAGATA TTGATTTCGG CGCAGTTATT CGCTCGCTCG CAGGGTTGGA CTCGGTCGCG
CAGTCAGCCG GACGCTGCAA CCGCCATGGA TTACGCGCGA CACCAGGAAA AGTTTGGATT
GTCAATCCAC AGGAAGAGGA TATCGACCAG CTTCTCGACA TTAAGATCGG TCGAGATCAG
GCGCAAAGAG TTCTTAGTGA GTTCAAATCC GGGAACGACT TTATCGGATT AGAAGCAATC
ACTACCTATT ACAGGTATTA CTATTCTTCC AGGAAAGATG AAATGGACTA TCTCGTTCGT
TCGAGCTCAG CGGTTGGGAG GGATGATACA CTGTTCAGAT TGCTGTCAAC AAACGAGCTT
TCGGTGGCAT CATACCGTTC GAGTCGTGGG GTGAATCCAC CGCTCCTCTT GAATCAGTCA
TTCCAATCCG CCGCTAGAGA GTTTTGCGTG ATTGATTCAC CCACAATCGG GGTCATTGTG
CCGTATGGTG ATGAGGGTAA AACGATTATT AATGCATTGT GTAGCGCGTC AGAACTTGAT
AATCGGGTGA AATTCCTCAG ACAGGCACAG AGATTCTCAG TATGTTTATT TCGGCGTCAA
TTTGAAGAAT TAAATCGAGT TGGTGCCATC CTTGAGACGA ATTCAGGGAT CCACTATCTG
GATGAGCGAT ATTACAGTAA GGAGTTTGGT TGGAGTCATG ATCCGGTTAA CGATATGGAT
ACACTCATTG TATAA
 
Protein sequence
MLRFSPHVAR RVQETRWHRS ERTEPQPDGS LVWRAQIAEP KEMLPWIRGW GADVEVLGPE 
GLRGTLEAEV RKMARVYGVS EMKLQDALIA HRRERDGKEQ SLVDHLKGTA ELAKRFAGKI
GLPELGEVMG LAHDFGKASK EFQDYLKSAT GLKNPDEDDY IDYESRKGKI DHSTAGAQLV
YEKCSHLGKE GEFLAQFLAL AIASHHSGLI DCLTPTGEDN FSRRITKDDS VTHVSEARGK
LPEIEDGLNK TLTLDILKQF MQKLQSLREQ NDSKETIAFK LGILARFLLS CLLDADRLNT
ADFEFPENSV IRSYGNYVSW DMLIERLEKT FNEFAQAVAQ TKEGSQAREV YQLRSQVARA
CREAATKPKG IYQLTVPTGG GKTLASLRFA LHHAKHHGMD RVFYVAPYIT IIDQNADTLR
TILEPESERG RVVLEHHSNF VPEEDTHRRH SLLAENWDAP IVCTTQVQFL EALFGAGTRD
TRRMHQLAKA VIILDEVQTV PIRIVHMLNV ALRFLAHSCG STVVLCTATQ PPLDRLPDNP
YRSLIVKQEQ RIILNEVELF KSMQRVEVHY ARKDGGMTND EIVDLAGRAL DSEGSVLIVV
NTRAMARTLY EQIKVRRLAP TYHLSTNMCP AHRMDVLNTI KEKLAAKEPV ICVSTQLIEA
GVDIDFGAVI RSLAGLDSVA QSAGRCNRHG LRATPGKVWI VNPQEEDIDQ LLDIKIGRDQ
AQRVLSEFKS GNDFIGLEAI TTYYRYYYSS RKDEMDYLVR SSSAVGRDDT LFRLLSTNEL
SVASYRSSRG VNPPLLLNQS FQSAAREFCV IDSPTIGVIV PYGDEGKTII NALCSASELD
NRVKFLRQAQ RFSVCLFRRQ FEELNRVGAI LETNSGIHYL DERYYSKEFG WSHDPVNDMD
TLIV