Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dred_1625 |
Symbol | |
ID | 4957420 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum reducens MI-1 |
Kingdom | Bacteria |
Replicon accession | NC_009253 |
Strand | + |
Start bp | 1768844 |
End bp | 1770478 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640180801 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001112978 |
Protein GI | 134299482 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1468] RecB family exonuclease [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR00372] CRISPR-associated protein Cas4 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.575155 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGAAC ATAATATATC CAATGAACAG CATTACTTTC CAATTTCTTC TGTGGCAGAA ATACTTTACT GCCCGAGAAA CTTTTACTAC CGGGTGGTTG AAGGGGCAGA AGATTCTAAC CACCATCTAT TGGAGGGCAA GTTGCAGGAG GAACGGCGGG ACGAAAGACA GCGACTGGTT CGGGAAGGTT ACCGTCAAGA TAGGTCCATT CATGTTTCTT CAGAAAAACT TAACCTTTAT GGTATTGTAG ATATAGTGGA GCAGGGTAAA GAGATTTACC CGGTGGAATA TAAGAAGGGC TTTCGAAAGG AAAGTTTGAA TGATGATGTC CAGGTTTGTG CCCAGGCTAT GGCTCTGGAA GAAAAACTAG GTCAAGATAT TAACCGGGGA TATATTTATT ACGCTGGGTC CAAAGCTAGG CGTGAAGTAA TATTTGATGA AGACTTGCGA CTGATGGTAG AGAATGCGGT TGGCCTGGCC AGAAATATTG CTTTGTCAGG GGAGATACCG CCACCACTGG CTGATAACCG TTGTGAAGGC TGTGCCCTGG TAAATCGTTG TCTTCCCTTT GAAGTGAAAG GGATCAAAGA AAACAAGGCA AAGGCAGTTC GCCCCCAACC AGGAATTAAC CTAGGCAGGG TTCTATATGT GGACGAACAG GGTGCATCCC TTTATAAAAA GGGAGAACGG GTGCTGGTAA CAAAAGATCA AATAAAATTT AAAGATATAC CACTATGCAA CCTGGATCAA GTGGTCCTTG TGGGGAATGT CAATTTATCT TCCCAGTTAA TTAAACTTTT TTTGGGAAGA GGTACAGAGG TCCATTTTAT ATCCACAAAG GGAAAATACT ATGGCTGTCT TCAGGCTGCC CTGTCAAAAA ACTCTGTTTT GCGTATTGCC CAGCATCGGG CCTACCAGAA GCAAGAGGAG CGCCTGCTCT ATGCCAGTGA ATTTGTTCGT GGAAAGCTAT CCAATATGCG AACCAATTTA TTAAGATATA ACCGATCGTT AAATAACCAT AGTATTGATG AAGCTGTATC AAGAATAAAA AATATTATCA AAAGGTTGGA GAAGGCCAAA GATCTTAATG AGTTGATGGG TTTGGAGGGG GCTGGTTCCC GAGATTATTT TAGTGTGTTT GGCCTGCTTA TTAAGGATAG AGTACCCTTT GATTTTAATA AGCGCAGCAG GCGCCCTCCT GAAGACCCGG CCAATGCGCT CCTAAGTTTT AGCTACTCTC TGCTGTTAAA AGATGTGATC ACAGCCGTTC AGGTGGTAGG TTTTGACCCA TTTATTGGGT TTCTTCATAG GTCTGATTTT GGTCGACCTG CCCTGGCCCT GGATATAATA GAAGAGTTTC GGCCAGTAGT GGCAGACTCA GTTGTGCTAA CGGCTTTAAA CAAAGGTGTT ATTGCAGAAG GGGATTTTGA GTACAGGATG GGTGGATGTT TTTTAAGTGA AACCGGGCGT AAAAAAATGT ATCGGCTTTA CGAAGAGCGC AGAAAGGAAA TGATTACCCA TCCGGTTTTT GGCTACCGTA TTTCCTACCT GCGTACCATA GAATTACAAG CACGATTTTT GGCAAAAGTC CTTACTAAGG AAATCGATGG GTATAAACCT TTTCTTGTTC GGTAG
|
Protein sequence | MAEHNISNEQ HYFPISSVAE ILYCPRNFYY RVVEGAEDSN HHLLEGKLQE ERRDERQRLV REGYRQDRSI HVSSEKLNLY GIVDIVEQGK EIYPVEYKKG FRKESLNDDV QVCAQAMALE EKLGQDINRG YIYYAGSKAR REVIFDEDLR LMVENAVGLA RNIALSGEIP PPLADNRCEG CALVNRCLPF EVKGIKENKA KAVRPQPGIN LGRVLYVDEQ GASLYKKGER VLVTKDQIKF KDIPLCNLDQ VVLVGNVNLS SQLIKLFLGR GTEVHFISTK GKYYGCLQAA LSKNSVLRIA QHRAYQKQEE RLLYASEFVR GKLSNMRTNL LRYNRSLNNH SIDEAVSRIK NIIKRLEKAK DLNELMGLEG AGSRDYFSVF GLLIKDRVPF DFNKRSRRPP EDPANALLSF SYSLLLKDVI TAVQVVGFDP FIGFLHRSDF GRPALALDII EEFRPVVADS VVLTALNKGV IAEGDFEYRM GGCFLSETGR KKMYRLYEER RKEMITHPVF GYRISYLRTI ELQARFLAKV LTKEIDGYKP FLVR
|
| |