Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DvMF_1976 |
Symbol | |
ID | 7173895 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris str. 'Miyazaki F' |
Kingdom | Bacteria |
Replicon accession | NC_011769 |
Strand | + |
Start bp | 2443182 |
End bp | 2444213 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643540493 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_002436387 |
Protein GI | 218887066 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 86 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACGCC TGCTGAATAC GCTCTACGTA ACCACCCAAG GCGCCTACCT TGCCAAGGAC GGCGAGGCCG TGGCCGTGCG GGTGGAACAG GAAACCCGCC TGCGGGTGCC TTTGCACGGG CTTGGCGGGG TGGTGTGCTT CGGGCTGGTA TCGGCAAGCC CGCCCCTGCT TGCGGCCTGC GCCGAACAGG ACATCGGCGT CAGCTTTCTT TCCGAGCATG GGCGGTTTCT GGCCTCGGTG CGCGGGCCTG TTTCGGGCAA CGTGCTGTTG CGGCGCGAAC AGTACCGCCG TGCCGACCAG CCCGATGCCC GCGCCCTGCT TTGCGGATGC TTTGTTCAGG GAAAGATCAT CAACGCCCGG ACGGTACTGC GCCGTTTTCT GCGCGACCAT GGCGACAGCG CGGGCGCTCC GGCCATCGCA TCGGCAGCGT CATCCATGGC CGACATGCTC GACAGGGTGG TGCGCGCCAC CACGGAAGAA CAGATCCGGG GCATCGAGGG CGAGGCGGCA TCGCTGTACT TCGGCGTGTT CGACGGGCTC ATCCTGACGC GTACCAGGGA TTTCACCTTC ACGACGCGCA GCCGCCGCCC CCCGCTGGAC CCGGTCAACT GCCTGTTGTC CTTCGTGTAC ACCCTGCTTG CCCACGACGT GCGTTCGGCA CTGGAAACCG TGGGGCTGGA CCCGCAAGTG GGCTTTCTGC ACCGCGACAG GCCGGGCAGG CCAAGCCTTG CCCTGGATGT CATGGAAGAG TTTCGGCACT GGCTTGCGGA CAGGCTGGTG CTCTCGCTCG TCAACAGGGG GCAACTGGAC CCCAAAGACT TCAAGCGCAG CGGTTCCGGC GGCGTGGTGC TGGCCGACGA TGCACGCAAG GATGTGCTTG TGGCGTGGCA GAAACGCAAA CAAGAAGAAG TGCTGCACCC CTTTCTGGAT GAACGGATGC CCATAGGACT GTTGCCGCAT ACGCAAGCCA TGCTGCTGGC GCGCCATCTG CGCGGCGAAC TGGATGCCTA TCCCCCGTTC TTGTGGAAAT AG
|
Protein sequence | MRRLLNTLYV TTQGAYLAKD GEAVAVRVEQ ETRLRVPLHG LGGVVCFGLV SASPPLLAAC AEQDIGVSFL SEHGRFLASV RGPVSGNVLL RREQYRRADQ PDARALLCGC FVQGKIINAR TVLRRFLRDH GDSAGAPAIA SAASSMADML DRVVRATTEE QIRGIEGEAA SLYFGVFDGL ILTRTRDFTF TTRSRRPPLD PVNCLLSFVY TLLAHDVRSA LETVGLDPQV GFLHRDRPGR PSLALDVMEE FRHWLADRLV LSLVNRGQLD PKDFKRSGSG GVVLADDARK DVLVAWQKRK QEEVLHPFLD ERMPIGLLPH TQAMLLARHL RGELDAYPPF LWK
|
| |