Gene DvMF_1971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_1971 
Symbol 
ID7173890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp2436553 
End bp2438775 
Gene Length2223 bp 
Protein Length740 aa 
Translation table11 
GC content68% 
IMG OID643540488 
ProductCRISPR-associated helicase Cas3 
Protein accessionYP_002436382 
Protein GI218887061 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR01587] CRISPR-associated helicase Cas3 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones92 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATATG CGCACACCCT CGCAAATCGA CCGGAATCGG ACTGGGAGCC GCTCTCCCGG 
CATCTTGAAG AGGTGGCAGA CCTCGCGGCG CACTTCGCTT CCGCCTTCGG AGCCGGTGAA
TGGGGCCTGG CGGCAGGCCT GCTGCACGAC GTCGGCAAGC AGTCGACGGC CTTTCAGGCC
TACCTGCGAG CTTCGACCGC AGGCAAGGGG CCCGGTCGCG GGCCGGACCA TTCCACCGCC
GGGGCGCAGT GGGCGCACGG GCATTACAAT GCGAAGCTCG GCAAACTGCT GGCCTATGCC
CTGGCCGGGC ACCACGCTGG GTTGCCCGAC GGCATAGAGT CGCTGGCCCC CCGCCTGAAG
CGTGCGGTGG AGCCATGGCA CAGTCCCGGC GACGACATCG CCGCCCGCGT ACCCGAGATC
ACGGGCTTGC CGCTGGCCGG GCGCATGCCA TCGCTCGGGT TCCAGCTCAT GCTCTTCGTG
CGCATGGTGT TCTCGTGCCT TGTGGATGCG GACTCCCTGT GCACCGAAGC CTTCACCACG
CCGGACAAGG CCGCATGGCG GCGCGGCTAC CTTCCGCTTT CGGAACTGAA AGTTCGGCTG
GACCGTCATC TGGACCACCT TGCCACCCAC GCCCCGGCCA CCCCGGTGAA CAGTCTGCGC
GCGGGCATCC TTGCCGCCTG CCGCAATGCA GCCCCAAACG TCCCCGGCCT GTTCTCGCTG
ACCGTGCCCA CCGGAGGCGG CAAGACGCTT TCCTCACTCG CCTTCGCGCT GGACCATGCG
CAGGCCCACG GCCTGCGGCG CGTAATCTAC GCCATCCCCT ACACCAGCAT CATCGAACAG
ACCGCCAGGG TCTTCCGCGA GGCATTGAAC GATACTGATG ACCAGGCCGT GCTGGAGCAT
CACTCCAACT TCGTGCCCCT GCGCGCCGAC GGCACGCCGG TCACTCCCAG ACGAGAGGGA
CAAGACGACG ACGCGGGCGA AGGCCGCCGC TCTGTACTGG CCGCGGAAAA CTGGGATGCA
CCCGTGGTGG TGACCACCAA CGTGCAGTTC CTCGAATCCC TGTTTGCCGC CCGGCGGTCC
CCCTGTCGCA AACTGCACAA CATCGCCCGC AGTGTGGTGA TTCTGGACGA GGCGCAGATG
CTTCCCCCGG AACACCTGCT GCCCTGCCTG GAAGCTCTGC GCGCCCTTGT GCTGGACTAC
GGGTGCAGCG TGGTGCTGTG CACGGCCACG CAGCCCGCCC TTGGCAAGCG CGAAGGCTTT
GACCGGGGCC TGGAACAGGT ACGGGAAATC ACCCCGAATC CGGAACAGCT TGCCACCGCG
CTGCGTCGGG TGGAGGTGAC CGATGCGGGC ACTCTGGACG ATGCGCAACT GGCCGCCAGG
CTGGCCGGGC AGCCACAGGT GCTGTGCGTG GTCAACACCC GGCCCCACGC TCGCGCCCTG
TACGAACTGC TGGCACCGCA GGGGGATGCG GTACACCTTT CCGCCGCCAT GTGCCCGGCG
CACCGCACGG AAGTGCTGCG CGGCGTCCGT CAGCGCCTGC TTCAGGGTCA ACCCTGCCGG
GTGGTGGCCA CCCAACTGGT GGAGGCCGGG GTGGATATCG ACTTTCCCGT GGTCTACCGC
GCCATGGCGG GGGTGGATTC GCTTGCGCAG GCCGCCGGGC GCTGCAACCG CGAGGGCAAT
CTGGAGCGGG GGCAGGTGTA TCTGTTCACG CCGCAGGACA GCCCGCCGCC GTTCGTCCGG
CAAGCCGCAC AGGCCGCGCG CACGGCGCTG CGCCGCAACC CGGACCCGCT GGCCCTCGAC
ACCGTGGAGG CCTATTTCCG CGAACTCTAC TGGCAGAAAG GGGACAGGCT GGACAGCGCC
AACCTGTTGC CCCTGATGCG GGACAGCGCG CCGCGCCTGG ACTTTCCCTT TCCGGAAGTG
GCGCACCTCT TTCGGCTGAT CCCCGACGAC ACCATTCCGC TCCTCATTCC TTATGACGAC
GACGCGCGCG CCCTGATTGC GGAGTTGCCG TACACACCGG CCCCGGCCCG CCTGCTGCGC
CGCGCCCAGC GCTACACGGT GGGGGTGTAC CCCAGGGTGC TGGCCGCGCT GGTGCAGGCC
GGAGCAGCGC ACCTGGCTAC AGAAGAATGC GCGGTGCTGA TAAACGAAGA CCTGTACGAT
GATCGGCTGG GACTGTGCGC GGACAACCCT ACGTACCGGA ATCCGGAAAG CCTGTTGGGG
TAA
 
Protein sequence
MKYAHTLANR PESDWEPLSR HLEEVADLAA HFASAFGAGE WGLAAGLLHD VGKQSTAFQA 
YLRASTAGKG PGRGPDHSTA GAQWAHGHYN AKLGKLLAYA LAGHHAGLPD GIESLAPRLK
RAVEPWHSPG DDIAARVPEI TGLPLAGRMP SLGFQLMLFV RMVFSCLVDA DSLCTEAFTT
PDKAAWRRGY LPLSELKVRL DRHLDHLATH APATPVNSLR AGILAACRNA APNVPGLFSL
TVPTGGGKTL SSLAFALDHA QAHGLRRVIY AIPYTSIIEQ TARVFREALN DTDDQAVLEH
HSNFVPLRAD GTPVTPRREG QDDDAGEGRR SVLAAENWDA PVVVTTNVQF LESLFAARRS
PCRKLHNIAR SVVILDEAQM LPPEHLLPCL EALRALVLDY GCSVVLCTAT QPALGKREGF
DRGLEQVREI TPNPEQLATA LRRVEVTDAG TLDDAQLAAR LAGQPQVLCV VNTRPHARAL
YELLAPQGDA VHLSAAMCPA HRTEVLRGVR QRLLQGQPCR VVATQLVEAG VDIDFPVVYR
AMAGVDSLAQ AAGRCNREGN LERGQVYLFT PQDSPPPFVR QAAQAARTAL RRNPDPLALD
TVEAYFRELY WQKGDRLDSA NLLPLMRDSA PRLDFPFPEV AHLFRLIPDD TIPLLIPYDD
DARALIAELP YTPAPARLLR RAQRYTVGVY PRVLAALVQA GAAHLATEEC AVLINEDLYD
DRLGLCADNP TYRNPESLLG