Gene Dred_0539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDred_0539 
Symbol 
ID4957594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum reducens MI-1 
KingdomBacteria 
Replicon accessionNC_009253 
Strand
Start bp583830 
End bp586181 
Gene Length2352 bp 
Protein Length783 aa 
Translation table11 
GC content46% 
IMG OID640179718 
ProductCRISPR-associated helicase Cas3 
Protein accessionYP_001111908 
Protein GI134298412 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR01587] CRISPR-associated helicase Cas3 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGAAC AACTAAAATC ACACCCACAC CTTTTTTTAC ACCAGCACAT TGAGCAGGTT 
AACCAGGCCA CAAAGGCAAT AAGAGATTGG CATACTACTG ACACAATAAC CAATGATATA
AAGTTATTGC TGGGCCATCT GGCCAAATAC CATGATTTGG GCAAGGGTAC CCCTGCCTTT
CAGGAATATA TCGAAAATCC AGAGGGTTAT CAGGGTGATC CCCAGGAGAA GGCCCATTCA
ACTCTTTCTT TACTGCTGAC ACTGGCCATT GCAAGGCAGC AATCCTGGGA GGAGTTACAC
ACCCTGGTCA TTGCCGCCGC TGTTGCCGGG CATCACAGCA GACTGCCAAC CATCCCGGAG
AAGAAAATAG GGGGCGTATG TTGCCCCCAA TGGGATATCG ACGGCTTTGC GGGGGGAGAA
AAGGCGTCAC TCTTGAAACG GCAATTAGCC AGCATCGATT TCCCTGCACT AGAGCAGGAA
ACAAAGGTAG AATTTGGGTC TTATGGTTTG GCCCAGGCAT TGCAAAGCGA CCCAGCCAAA
TCTTTGAGGG AAATGAAGAG ATTTTTAATC ACAAGAATAT ATGGCATATT TGCCTCCCTA
AGTTTGGAAG AAGCCCTAAG ATTAAGAATG AAGGCGCAAC TGCTCTATTC GGTTTTACTG
GAAGCGGATA AGGCTTTGCT GGCAGTATCC AGCCCAGAGG TGTATTTAAA CAGAGAGGTT
CGGCATTGGC AGTCCCGGTG GGTTGAGGAT AAAATTGGCA AGCCCCCGGA GACCTCAATT
AATCAATTGA GACAAAGGGC AAGGCAAGGG GTAATTGCCG CACTGGAGGC CAAAAACACC
AACCTTTACA GTCTCACCGC TCCAACAGGC TGTGGCAAAA CCATGCTGGC AGCCACTTGG
GCCCTTAAAT TAAGAGAACG GGTGACCGAA GGACAGGCTC CGCCCAAGAT TATCATTGTA
CTACCATTTT TATCAGTCAT CGACCAAACG GCCAGGGAAT ATGCCAGGCT ATTATCCCAC
AGTGGGCAAG AAACAGATGG TCGCTGGCTT ATCCAGAGCC ATTCCCTGGC CGATCGCCAC
TACGCCAGGG GGTTAGAGGA TGAGGACGGT CGTTTTTTTA TTGATACCTG GCGCAGTGAA
ATAATCATTA CAACCTATGA CCAGTTTTTA ATGAGTTTGT TAGATCCCCG GGCTAAGTAC
CAAATGAGGT TTCATAATCT GTGTGATGCC ATGATAATCA TGGATGAAGT ACAAGCATTG
CCTTGTAAAT TATGGCAGAC ACTGGAAAAG GTATTTCAGG CCCTAGCCAG TGAGGGCAAT
AGCAGGTTGC TACTGATGTC AGCCACGCTA CCGCCCTTTA TGAAAGAGGC ACTTCCTTTA
TTGCCGGACT ACCAAGGATA TTTTACCCTG TTTAATCGTT ACACCCTGCA ACTACGGCTA
CAGGAATCAC AGACGTTGGA TAATTTCTGC GAAGAAATGT CAGACAGATT GATTGGTTGG
CTGGAATGTA GCAATCGCAT TCTGATCACC CTAAATACCC GCCACAGTGC TCGCAGGGTG
CGGGATTTTC TTAGCCAAAG TTGGCCTGCT GAGTATGGAG ATGTGCCGCT GTTCTTTATC
AGTGCCGATG TGACACCAAA GGACAGGCTG GAAATAGTAA AACAAATCAA ACAGGGCAAA
CCCTGTGTTG TTGTATCAAC CCAGTGTATT GAGGCCGGTG TGGATATTGA TATGGATCGA
GTTATCCGGG ATTTTGGCCC CCTGGACAGC ATTATTCAAA TTGCCGGAAG ATGCAACCGT
GAAGGGCTAA GGGCCCAGGG TGTCGTTGAG GTGGTGGATT TAATCAATGA ACAGGACAAA
AGATATTCAG AAATGATATA TGATACTACC CACCTGCAAA TTACCAGAAA AATTTTAGCA
GATAAGCAGG AAATTCAGGA GAAGGAAATA ATAACTTTAT CCACTCAGTA TTTTAAAGAT
TTAACCGAGC AAAAGGATAC TGGGTACAAT CATTTGATAC GCTTTGCCAA ATGGCAGGAG
GATACGCCGG TAAAGGAGCT GCTTAGGGGT AAAGAACGAT TGCAGATAGA TTTTTTGGTG
TTGGAACAAG ATGCAGAGTT GAGGGATGAA ATGCAGATAG TGGGCCGTAT AAAGGACAGG
TGGGAAAGAA GGGAAGCGTG GAGAAAACTG TCCGGCAGAA TTGCCCTTGT TTCAGTTAGT
ATTTTTGCCC AACCAGGTTT CCACCCAGAA CAAATAGCAG ATGAATTCAT GGGGAGTTGG
TGGGTGGTTC GGGAGGGTTA TTATAACAGT AAACAGGGAT TATTGATTGA GGGAGAAACA
ATGATTTTAT AG
 
Protein sequence
MTEQLKSHPH LFLHQHIEQV NQATKAIRDW HTTDTITNDI KLLLGHLAKY HDLGKGTPAF 
QEYIENPEGY QGDPQEKAHS TLSLLLTLAI ARQQSWEELH TLVIAAAVAG HHSRLPTIPE
KKIGGVCCPQ WDIDGFAGGE KASLLKRQLA SIDFPALEQE TKVEFGSYGL AQALQSDPAK
SLREMKRFLI TRIYGIFASL SLEEALRLRM KAQLLYSVLL EADKALLAVS SPEVYLNREV
RHWQSRWVED KIGKPPETSI NQLRQRARQG VIAALEAKNT NLYSLTAPTG CGKTMLAATW
ALKLRERVTE GQAPPKIIIV LPFLSVIDQT AREYARLLSH SGQETDGRWL IQSHSLADRH
YARGLEDEDG RFFIDTWRSE IIITTYDQFL MSLLDPRAKY QMRFHNLCDA MIIMDEVQAL
PCKLWQTLEK VFQALASEGN SRLLLMSATL PPFMKEALPL LPDYQGYFTL FNRYTLQLRL
QESQTLDNFC EEMSDRLIGW LECSNRILIT LNTRHSARRV RDFLSQSWPA EYGDVPLFFI
SADVTPKDRL EIVKQIKQGK PCVVVSTQCI EAGVDIDMDR VIRDFGPLDS IIQIAGRCNR
EGLRAQGVVE VVDLINEQDK RYSEMIYDTT HLQITRKILA DKQEIQEKEI ITLSTQYFKD
LTEQKDTGYN HLIRFAKWQE DTPVKELLRG KERLQIDFLV LEQDAELRDE MQIVGRIKDR
WERREAWRKL SGRIALVSVS IFAQPGFHPE QIADEFMGSW WVVREGYYNS KQGLLIEGET
MIL