Gene DvMF_1976 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_1976 
Symbol 
ID7173895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp2443182 
End bp2444213 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content65% 
IMG OID643540493 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_002436387 
Protein GI218887066 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones86 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACGCC TGCTGAATAC GCTCTACGTA ACCACCCAAG GCGCCTACCT TGCCAAGGAC 
GGCGAGGCCG TGGCCGTGCG GGTGGAACAG GAAACCCGCC TGCGGGTGCC TTTGCACGGG
CTTGGCGGGG TGGTGTGCTT CGGGCTGGTA TCGGCAAGCC CGCCCCTGCT TGCGGCCTGC
GCCGAACAGG ACATCGGCGT CAGCTTTCTT TCCGAGCATG GGCGGTTTCT GGCCTCGGTG
CGCGGGCCTG TTTCGGGCAA CGTGCTGTTG CGGCGCGAAC AGTACCGCCG TGCCGACCAG
CCCGATGCCC GCGCCCTGCT TTGCGGATGC TTTGTTCAGG GAAAGATCAT CAACGCCCGG
ACGGTACTGC GCCGTTTTCT GCGCGACCAT GGCGACAGCG CGGGCGCTCC GGCCATCGCA
TCGGCAGCGT CATCCATGGC CGACATGCTC GACAGGGTGG TGCGCGCCAC CACGGAAGAA
CAGATCCGGG GCATCGAGGG CGAGGCGGCA TCGCTGTACT TCGGCGTGTT CGACGGGCTC
ATCCTGACGC GTACCAGGGA TTTCACCTTC ACGACGCGCA GCCGCCGCCC CCCGCTGGAC
CCGGTCAACT GCCTGTTGTC CTTCGTGTAC ACCCTGCTTG CCCACGACGT GCGTTCGGCA
CTGGAAACCG TGGGGCTGGA CCCGCAAGTG GGCTTTCTGC ACCGCGACAG GCCGGGCAGG
CCAAGCCTTG CCCTGGATGT CATGGAAGAG TTTCGGCACT GGCTTGCGGA CAGGCTGGTG
CTCTCGCTCG TCAACAGGGG GCAACTGGAC CCCAAAGACT TCAAGCGCAG CGGTTCCGGC
GGCGTGGTGC TGGCCGACGA TGCACGCAAG GATGTGCTTG TGGCGTGGCA GAAACGCAAA
CAAGAAGAAG TGCTGCACCC CTTTCTGGAT GAACGGATGC CCATAGGACT GTTGCCGCAT
ACGCAAGCCA TGCTGCTGGC GCGCCATCTG CGCGGCGAAC TGGATGCCTA TCCCCCGTTC
TTGTGGAAAT AG
 
Protein sequence
MRRLLNTLYV TTQGAYLAKD GEAVAVRVEQ ETRLRVPLHG LGGVVCFGLV SASPPLLAAC 
AEQDIGVSFL SEHGRFLASV RGPVSGNVLL RREQYRRADQ PDARALLCGC FVQGKIINAR
TVLRRFLRDH GDSAGAPAIA SAASSMADML DRVVRATTEE QIRGIEGEAA SLYFGVFDGL
ILTRTRDFTF TTRSRRPPLD PVNCLLSFVY TLLAHDVRSA LETVGLDPQV GFLHRDRPGR
PSLALDVMEE FRHWLADRLV LSLVNRGQLD PKDFKRSGSG GVVLADDARK DVLVAWQKRK
QEEVLHPFLD ERMPIGLLPH TQAMLLARHL RGELDAYPPF LWK