Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_0401 |
Symbol | |
ID | 5711317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 395098 |
End bp | 396009 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641266306 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_001531751 |
Protein GI | 159042957 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03639] CRISPR-associated endonuclease Cas1, NMENI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.455858 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCAGA TCGTCGATAT CGCCACGGAT GGGCGGCATC TCTCGCGTGA CCGGGGCTTC CTGAAGGTCA GCGAGGGCGC CCGGGAAATC GGCCGTATCC CCCTGGACCA GATCGCGGGC GTCATCGTGC ACGCCCATGG CACCACCTGG ACCACCTCCC TTCTGACCGA GCTGGCGGAT CGCGGGGCGC CGGTCGTGCT CTGCGGGGCC AACCACGCGC CGCGCTCGGT CCTCATGCCG CTCGACGGGC ACCATGCCCA GGGCGCGCGC CTGCGGGCAC AGTGGCAGGC CAGGGCCCCG CTCGTGAAAC AGGCCTGGAA GCAGACGGTA ATCGCGAAGA TCGCCATGCA GGCCGCCGCC TTGGAGGCCA TGGGCGAACC CCATGCCCCC GTCGGCATGC TCGCGCGCAA GGTGACCAGC GGCGATGCCA CCAATGTCGA GGCGCAGGCC GCGCGTCTCT ACTGGCCCCG AATGATGGGC ACCGAGTTCC GCCGCGACCG CACCGCCCCC GACCTCAACG CGCTCCTGAA CTATGGCTAC ACGGTGCTGC GCGCCGCCAC CGCCCGTGCG GTCGTGGCCG CAGGGCTGCA CCCGACCATC GGCCTGCACC ATTCGAACCG CGGCAACGCC TTTGCCCTGG CCGACGACCT GATGGAGCCG TTCCGCCCGC TCGTCGATTG TTGCGTGCGC GGCCTAGCCG CCCGTAACGG ACCCCAGGTC GATCCCGCGG CCAAGCAGTC CCTAGCGCGG CTCATCGCGC TGGACCTGCC CCTCGGCGAC AGCCTCACCC CGGTTTCCGT CGCGCTCGGC AAGCTTGCCA TCTCTCTCGG TCAGAGCTTC GAGTCCGGAA CGCTGGATCT CGCCCTGCCT GCACCGCCCG ATGCGCTGAC CCTTGCAGGC CTCGGCGCAT GA
|
Protein sequence | MDQIVDIATD GRHLSRDRGF LKVSEGAREI GRIPLDQIAG VIVHAHGTTW TTSLLTELAD RGAPVVLCGA NHAPRSVLMP LDGHHAQGAR LRAQWQARAP LVKQAWKQTV IAKIAMQAAA LEAMGEPHAP VGMLARKVTS GDATNVEAQA ARLYWPRMMG TEFRRDRTAP DLNALLNYGY TVLRAATARA VVAAGLHPTI GLHHSNRGNA FALADDLMEP FRPLVDCCVR GLAARNGPQV DPAAKQSLAR LIALDLPLGD SLTPVSVALG KLAISLGQSF ESGTLDLALP APPDALTLAG LGA
|
| |