Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Daud_1289 |
Symbol | |
ID | 6025647 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Desulforudis audaxviator MP104C |
Kingdom | Bacteria |
Replicon accession | NC_010424 |
Strand | - |
Start bp | 1361496 |
End bp | 1362416 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641594106 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001717432 |
Protein GI | 169831450 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATCTTG TCGTTTCGGA CTACGGATGT GCGCTGGGGA AAAAGAGCGA GCGGCTGTTG GTGCGGCAAA AGGGAGAAGT GTTGAGTGAG ACCCCGTTTT ACGACATAAA ACAGATCACC ATTTCCGGCC GCGGGGTTTC TCTTTCCACG GATGTGATTC AGGAATGCCT GGAGCATGGC ATCCAGATCA ACTTCATATC CTTCTCCGGC AAACCGTACG CCAAACTGGC GGCGCCGAAC CTGACCGGCA CGGTGCTCAC CCGGCGGGAA CAGTTGAAGG CCTACGACGA CCGGCGGGGC GTAATCCTGG CGAAGGCATT TGTCGAGGGG AAGCTGAAGA ACCAGGTCAA CGTGCTCAAG TATTTCGCCA AGTACCGGCG CAGCGCCGAC ACGGAAAAGT ATGCGGAGAT CTACCGGAAG ATCGAGGAGA TCGATCGGAT TCGCAGCCAG CTGGCAACTC TGGATGCCGC GCGGATTGAC GATCTGCGGG GGCAGTTGTT CGCGATCGAG GGACGGGGGG CGCATCACTA CTGGGACGCC CTGGGGCTGA TCATCGGGGA CCGCATCGAG TTCCCCGGCC GGGAACGCCG TGGGGCGACC GACCCCGTCA ATTCGGCGCT CAATTACGGC TATGGCATTC TGTACTCCCA GGTGGAAGGG GCCGTCCTTC TGGCCGGTCT TGACTCCTTC GGCGGCTTCC TTCACACCGA CCGCCCCGGC AAGCCATCCA TGGTGCTGGA TCTGGTGGAG GAGTTCCGGG CGGTGACCGT GGACCGGGTG GTCGTTGCGA TGGTGACCAA AGGGCCCGGT ATCGAAATGG ATGGGGACAA GCTGACCGAT GAAACAGAAA GGAGTTGGGC CGGCGGGTCT TGGAACGCCT GGAAGGCGAA GAGAGCTTCG AGGGCAAGAA ACACAAGCTG A
|
Protein sequence | MHLVVSDYGC ALGKKSERLL VRQKGEVLSE TPFYDIKQIT ISGRGVSLST DVIQECLEHG IQINFISFSG KPYAKLAAPN LTGTVLTRRE QLKAYDDRRG VILAKAFVEG KLKNQVNVLK YFAKYRRSAD TEKYAEIYRK IEEIDRIRSQ LATLDAARID DLRGQLFAIE GRGAHHYWDA LGLIIGDRIE FPGRERRGAT DPVNSALNYG YGILYSQVEG AVLLAGLDSF GGFLHTDRPG KPSMVLDLVE EFRAVTVDRV VVAMVTKGPG IEMDGDKLTD ETERSWAGGS WNAWKAKRAS RARNTS
|
| |