Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2572 |
Symbol | |
ID | 5706144 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2926471 |
End bp | 2929431 |
Gene Length | 2961 bp |
Protein Length | 986 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641272035 |
Product | CRISPR-associated helicase Cas3 |
Protein accession | YP_001537405 |
Protein GI | 159038152 |
COG category | [R] General function prediction only |
COG ID | [COG1203] Predicted helicases |
TIGRFAM ID | [TIGR01587] CRISPR-associated helicase Cas3 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00323142 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGCCGATT GGTGCGACCT GAGCGAGGAT GCCCGTCTCG TCTGGGGTAA GACGAACCGT GACCGGGGCC TGGTCTGGCT GCCGCTGTTC CGGCATCTCG CCGACAGCGC CGACGTGGCC GGGCTCTTGT GGGATGTGTG GCTGCCGGTC ACGGTCCGGC AGCGCATCGC CGGGTCTTTT CCGGGCGGCT TCGACGACGG ACGTGTCCTG GTTCGGTGGC TCGCTGGTGT TCACGACATC GGAAAGGCCA CGCCCGCTTT CACCTGGCAG GTGGGGCCGT TGCGGGAGGC CATGCGGGAG CACGGATTCA CCTTCGACCG CCAGGTCGAG GCGGACCGTG GCCGCGCGCC GCACGGGACG GCCGGGCAGA TCGTGCTCAC GGACTGGCTG ACCGGCCGGC ACGGCTGGGA CCGGGCCTCG GCCGGGGGGT ACGCGGTGGT CGTGGGCGGT CATCACGGGG TACCGCCGAC CGACGCTGAT CTGACGGAGA TTCGTGCCCG GCCGTACCTG CTCGGCGCGG GTGGGATCTG GCCTGCAGTG CAGGATGAAC TGCTCGGTTG GCTGACCCGT CGTGTCGCTG CCACGGACCG GTTGGCCGTA TGGCGGGAGG TGATCCTGCC GCAGCCGGTG CAGGTGTTGT TGACCGCGAT CGTCATCGTC GCCGACTGGA TCGCCAGCAA CGAACAGTTC TTCCCGTACG GCTTCCGGCA GGAAGATGCC TCCGATCGGC TTCAACAGGC GTGGGAGGAG TTGGATCTGC CGACCCCGTG GCGGGCCGTC GACACCACCG GCGTCGACGC GGCGAGGTTC CTCGCCCAAC GGTTCAAGAT GCCGCCCGGC GCTCAGCCGT ACCCGGTCCA GGCGGCGGTG CTGGAGCAGG CCCGTGCGAT GCCGCTGCCG GGGCTGCTGA TCGTCGAGGC ACCGATGGGA GAGGGCAAAA CCGAGGCCGC CCTGGCGGCC GTGGAAGTCC TCGCCGGGCG AAGCGGGGCG GGCGGGTGCT TCGTGGCGCT GCCCACCAGG GCGACAAGTG ACGCGATGTT CAGCCGGGTG CTCGGCTGGC TGAGCCGGCT GCCGGACGCC GACGTCGGTC GGGGTGCCCG AGAGGCGGCG CTGGCCCACG GCAAGGCGAT CCTCAACGAC GAGTATTCCC GCCTCTATCG CGGAATGTTG CCCAGCGCGA TCGGGGTGGA CGAGGGTGGC ACGGCTGTCG CCGCGCACGG CTGGCTGGCA GGGCGGAAGC GGAAGATGCT GTCCAGTTTC GTGGTCGGCA CCATCGACCA GTTGCTCTTT GCCGCGTTGC AGGCTCGTCA TGTGGCACTG CGGCACCTCG GACTGGCCGG GAAGGTGGTC GTCGTCGACG AGGCCCACGC CTACGACGTC TACATGAGCC AGTATCTCGA CCGTGCTCTC GAGTGGCTGG GCGCGCACGG CGTTCCGGTG GTGGTTCTCT CCGCGACCCT CCCGGCGCAT CGGCGCGCCG ACATGATGCA GGCGTACGAC GTCGGGCGGT TTGGCCCCCG TCGAGGCGCC CGTCGCCGGC GCCGGTTCCG TGCGGGCAGC GACGAGGCTA CCGACGATGA CCAGGTGCTC CGTGACGAGC GGCGGTATCC GCTACTCAGC GTCTCAGGTG TGGACCGGGT GCCGACCACG GTGGGATGTG CGGCGTCCGG TCGAAGCTTC GACGTACGGC TGGAACGCAG CGACGACGAC CTGGCCGCCC TCGCCGACCG GCTCCGGACC GACCTCACCG ACGGAGGCTG CGTCCTGGTC GTCCGTAACA CCGTCGCCCG GGTGCTCGAG ACGGCCGACG AATTGCGACG GCTACTCGGC CCAAAAATTC CGGTCAGCGT GGCGCATTCC CGTTTCATGG CCGCTGATCG GGCGGCCAAG GACACCTGGC TACGGGACAC CTTCGGCCCG CCCGAGCATC TCACCGAGTG GGGGCGGGCG CGGCCGGCCT GCCACGTCGT CGTCGCCAGC CAGGTCGCCG AACAGTCCCT GGACATCGAC TTCGACCTGC TGGTGACCGA CCTGGCACCG GTGGACCTGA TCCTGCAACG ACTCGGCCGC CTGCATCGGC ACCGGCGTGC CGTTCGCCCC GGCAGGCTGG CACAGCCGCG GTGCCTGGTG ACCGGGGTCG ACTGGGCGAC TGTCCCACCC GCCCCGGTAC CTGGCTCGGT CACGGTGTAT CAGCACCACG CCCTGCTACG GGCCGCGGCC GTCCTGCTGC CGTATCTCGA CGGCGACCGG CCCCTGTACC TGCCCCAGGA CATCGCACCC CTGGTGCAAA CCGCGTACGG CAGAGGGTCG GTGGGTCCGC CGGGGTGGCA GTCCACGTTG GCCGAGGCGG CGGTCCAGGC CGACGCCCGC GCCGAGGTGG CACGCAAACG CGCGGATACC TTCCGGGTCG GGCCGGTCCA GGCGCCGGGG GTGAGCCTGG TGGGATGGCT CGACGCCAGC GTCGGCGACA GCGAGGCCAC GAACGGCAAC GAGTGGCGAG GGCGGGCCCA CGTCCGCGAC GACGGGGCTG AGGCGTTGGA GGTGCTGATC CTTGTTCGCC GGGGCGAGAA GCTGGTGACC CCACCGTGGC TCGACCAGGG CGGCGACGTC GAGGTCCCGA CCGACTTCGA GCCCTCGCGG TCGGTGGCCC GCACGATGCT GCGATGTGCC CTGCCGCTGC CGCAAGCGCT CACCGCGCAC GGGGGCGTAG ACCGGATCAT TGCCGAGTTG GAGCGGCGGA ACACGTTCCC GGCCTGGGAG AAAGACCGAC TACTTGGTGG GGAGCTGATC CTCGACCTCG ACGAGCAGGG GCGAAGTCGA CTCACGGATT TCGTCCTGAC CTACGACCGG GACACCGGGC TACGGGCTAC GGGCTACGGG CTACGGGCTA CGGGCTACGG GCTACGGGCT ACGGGCCAGC AAGATCGACC ATTACGCTGC CCACGGGCAA CTATCCGATA G
|
Protein sequence | MADWCDLSED ARLVWGKTNR DRGLVWLPLF RHLADSADVA GLLWDVWLPV TVRQRIAGSF PGGFDDGRVL VRWLAGVHDI GKATPAFTWQ VGPLREAMRE HGFTFDRQVE ADRGRAPHGT AGQIVLTDWL TGRHGWDRAS AGGYAVVVGG HHGVPPTDAD LTEIRARPYL LGAGGIWPAV QDELLGWLTR RVAATDRLAV WREVILPQPV QVLLTAIVIV ADWIASNEQF FPYGFRQEDA SDRLQQAWEE LDLPTPWRAV DTTGVDAARF LAQRFKMPPG AQPYPVQAAV LEQARAMPLP GLLIVEAPMG EGKTEAALAA VEVLAGRSGA GGCFVALPTR ATSDAMFSRV LGWLSRLPDA DVGRGAREAA LAHGKAILND EYSRLYRGML PSAIGVDEGG TAVAAHGWLA GRKRKMLSSF VVGTIDQLLF AALQARHVAL RHLGLAGKVV VVDEAHAYDV YMSQYLDRAL EWLGAHGVPV VVLSATLPAH RRADMMQAYD VGRFGPRRGA RRRRRFRAGS DEATDDDQVL RDERRYPLLS VSGVDRVPTT VGCAASGRSF DVRLERSDDD LAALADRLRT DLTDGGCVLV VRNTVARVLE TADELRRLLG PKIPVSVAHS RFMAADRAAK DTWLRDTFGP PEHLTEWGRA RPACHVVVAS QVAEQSLDID FDLLVTDLAP VDLILQRLGR LHRHRRAVRP GRLAQPRCLV TGVDWATVPP APVPGSVTVY QHHALLRAAA VLLPYLDGDR PLYLPQDIAP LVQTAYGRGS VGPPGWQSTL AEAAVQADAR AEVARKRADT FRVGPVQAPG VSLVGWLDAS VGDSEATNGN EWRGRAHVRD DGAEALEVLI LVRRGEKLVT PPWLDQGGDV EVPTDFEPSR SVARTMLRCA LPLPQALTAH GGVDRIIAEL ERRNTFPAWE KDRLLGGELI LDLDEQGRSR LTDFVLTYDR DTGLRATGYG LRATGYGLRA TGQQDRPLRC PRATIR
|
| |