Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_4335 |
Symbol | |
ID | 5211319 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 5448155 |
End bp | 5450869 |
Gene Length | 2715 bp |
Protein Length | 904 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640597919 |
Product | CRISPR-associated helicase Cas3 |
Protein accession | YP_001278623 |
Protein GI | 148658418 |
COG category | [R] General function prediction only |
COG ID | [COG1203] Predicted helicases |
TIGRFAM ID | [TIGR00277] uncharacterized domain HDIG [TIGR01587] CRISPR-associated helicase Cas3 [TIGR01596] CRISPR-associated endonuclease Cas3-HD |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0995159 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00193959 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGCTACGGT TCAGCCCGCA TGTGGCGCGA CGGGTGCAGG AGACGCGCTG GCACCGCTCC GAGCGCACCG AGCCGCAACC CGACGGATCG CTGGTGTGGC GAGCACAGAT CGCTGAGCCG AAAGAGATGC TGCCCTGGAT CCGCGGCTGG GGCGCCGATG TGGAAGTGCT GGGACCGGAG GGGCTGCGCG GGACGCTGGA AGCGGAGGTG CGAAAGATGG CGAGGGTGTA TGGCGTGAGT GAGATGAAGT TGCAGGATGC TCTGATTGCG CATCGACGCG AACGCGATGG TAAAGAACAA TCTTTAGTTG ATCATCTGAA AGGAACCGCA GAACTGGCGA AGCGTTTTGC CGGAAAAATA GGATTGCCAG AACTTGGAGA AGTCATGGGA CTGGCACACG ACTTCGGAAA AGCCAGCAAG GAGTTTCAGG ATTACCTGAA GTCTGCTACA GGGTTGAAAA ATCCAGATGA AGATGACTAT ATAGATTATG AGTCGCGGAA GGGAAAAATA GATCACTCTA CAGCAGGGGC ACAGCTTGTA TATGAGAAAT GCAGCCATTT AGGCAAGGAA GGCGAGTTCC TTGCGCAATT TCTGGCTCTG GCAATAGCTT CGCACCACTC CGGACTGATC GATTGTCTGA CACCCACAGG TGAAGATAAC TTCAGTCGAC GTATTACTAA AGATGATAGT GTAACACATG TATCTGAAGC CAGGGGTAAG TTACCTGAGA TCGAGGATGG TCTCAATAAA ACTCTTACGC TGGATATATT AAAACAATTC ATGCAAAAAC TTCAAAGTCT GAGAGAACAG AACGACTCAA AGGAGACTAT TGCTTTCAAA CTCGGTATTC TGGCTCGTTT CCTGCTGAGT TGTCTTCTGG ATGCAGATCG CCTGAACACT GCTGATTTCG AGTTTCCTGA AAACAGTGTA ATCCGTAGTT ATGGCAACTA TGTATCCTGG GATATGCTGA TTGAGCGACT GGAGAAAACA TTTAATGAAT TTGCACAGGC AGTCGCACAA ACGAAGGAAG GTAGCCAGGC GCGGGAAGTA TATCAACTGC GATCACAGGT GGCGCGGGCA TGCCGTGAAG CAGCTACGAA ACCAAAGGGC ATTTACCAAC TGACTGTACC GACTGGCGGT GGCAAGACTC TGGCAAGTCT ACGCTTTGCG CTACACCATG CAAAGCATCA CGGGATGGAC AGGGTTTTTT ACGTGGCGCC ATACATTACG ATTATTGACC AGAATGCTGA CACCCTGCGA ACAATTCTTG AGCCTGAAAG TGAACGCGGT CGTGTCGTAC TCGAACACCA CTCCAATTTT GTTCCAGAAG AAGATACGCA TCGTCGTCAT AGCCTGCTGG CGGAAAACTG GGATGCGCCG ATTGTATGCA CCACGCAGGT CCAATTCTTA GAGGCCTTGT TCGGTGCAGG CACGCGCGAT ACTCGTCGCA TGCATCAACT GGCAAAAGCG GTGATTATCT TAGATGAGGT GCAGACAGTT CCAATTAGAA TCGTCCATAT GCTGAACGTG GCTCTGCGTT TTTTAGCGCA TAGTTGTGGC TCAACGGTTG TCCTTTGTAC GGCTACACAG CCGCCGCTGG ATCGACTACC GGACAATCCA TACCGTTCGC TTATTGTTAA GCAGGAACAA CGAATCATCC TGAACGAAGT GGAACTGTTC AAGAGCATGC AGCGCGTCGA GGTTCACTAT GCACGTAAAG ATGGAGGAAT GACGAATGAT GAGATCGTCG ATCTTGCCGG ACGGGCACTG GATTCGGAGG GCAGTGTTCT GATCGTGGTC AATACGCGAG CGATGGCGCG AACATTGTAT GAACAAATCA AAGTCAGGCG TCTGGCGCCA ACATATCATC TGAGCACCAA TATGTGCCCT GCACACCGAA TGGATGTGCT GAATACCATC AAGGAAAAAT TAGCGGCAAA AGAACCTGTT ATCTGCGTGA GTACGCAACT TATCGAGGCA GGTGTAGATA TTGATTTCGG CGCAGTTATT CGCTCGCTCG CAGGGTTGGA CTCGGTCGCG CAGTCAGCCG GACGCTGCAA CCGCCATGGA TTACGCGCGA CACCAGGAAA AGTTTGGATT GTCAATCCAC AGGAAGAGGA TATCGACCAG CTTCTCGACA TTAAGATCGG TCGAGATCAG GCGCAAAGAG TTCTTAGTGA GTTCAAATCC GGGAACGACT TTATCGGATT AGAAGCAATC ACTACCTATT ACAGGTATTA CTATTCTTCC AGGAAAGATG AAATGGACTA TCTCGTTCGT TCGAGCTCAG CGGTTGGGAG GGATGATACA CTGTTCAGAT TGCTGTCAAC AAACGAGCTT TCGGTGGCAT CATACCGTTC GAGTCGTGGG GTGAATCCAC CGCTCCTCTT GAATCAGTCA TTCCAATCCG CCGCTAGAGA GTTTTGCGTG ATTGATTCAC CCACAATCGG GGTCATTGTG CCGTATGGTG ATGAGGGTAA AACGATTATT AATGCATTGT GTAGCGCGTC AGAACTTGAT AATCGGGTGA AATTCCTCAG ACAGGCACAG AGATTCTCAG TATGTTTATT TCGGCGTCAA TTTGAAGAAT TAAATCGAGT TGGTGCCATC CTTGAGACGA ATTCAGGGAT CCACTATCTG GATGAGCGAT ATTACAGTAA GGAGTTTGGT TGGAGTCATG ATCCGGTTAA CGATATGGAT ACACTCATTG TATAA
|
Protein sequence | MLRFSPHVAR RVQETRWHRS ERTEPQPDGS LVWRAQIAEP KEMLPWIRGW GADVEVLGPE GLRGTLEAEV RKMARVYGVS EMKLQDALIA HRRERDGKEQ SLVDHLKGTA ELAKRFAGKI GLPELGEVMG LAHDFGKASK EFQDYLKSAT GLKNPDEDDY IDYESRKGKI DHSTAGAQLV YEKCSHLGKE GEFLAQFLAL AIASHHSGLI DCLTPTGEDN FSRRITKDDS VTHVSEARGK LPEIEDGLNK TLTLDILKQF MQKLQSLREQ NDSKETIAFK LGILARFLLS CLLDADRLNT ADFEFPENSV IRSYGNYVSW DMLIERLEKT FNEFAQAVAQ TKEGSQAREV YQLRSQVARA CREAATKPKG IYQLTVPTGG GKTLASLRFA LHHAKHHGMD RVFYVAPYIT IIDQNADTLR TILEPESERG RVVLEHHSNF VPEEDTHRRH SLLAENWDAP IVCTTQVQFL EALFGAGTRD TRRMHQLAKA VIILDEVQTV PIRIVHMLNV ALRFLAHSCG STVVLCTATQ PPLDRLPDNP YRSLIVKQEQ RIILNEVELF KSMQRVEVHY ARKDGGMTND EIVDLAGRAL DSEGSVLIVV NTRAMARTLY EQIKVRRLAP TYHLSTNMCP AHRMDVLNTI KEKLAAKEPV ICVSTQLIEA GVDIDFGAVI RSLAGLDSVA QSAGRCNRHG LRATPGKVWI VNPQEEDIDQ LLDIKIGRDQ AQRVLSEFKS GNDFIGLEAI TTYYRYYYSS RKDEMDYLVR SSSAVGRDDT LFRLLSTNEL SVASYRSSRG VNPPLLLNQS FQSAAREFCV IDSPTIGVIV PYGDEGKTII NALCSASELD NRVKFLRQAQ RFSVCLFRRQ FEELNRVGAI LETNSGIHYL DERYYSKEFG WSHDPVNDMD TLIV
|
| |