Gene Dgeo_2629 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_2629 
Symbol 
ID4073860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008010 
Strand
Start bp413824 
End bp416739 
Gene Length2916 bp 
Protein Length971 aa 
Translation table11 
GC content70% 
IMG OID641228846 
ProductCRISPR-associated helicase Cas3 family protein protein 
Protein accessionYP_594137 
Protein GI94972097 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR01587] CRISPR-associated helicase Cas3 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.174712 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAATGACG CGCGGAACGA GTACCTTTCC CACATGCCCC CCCTCTCCCC CATCACGGCG 
GCGGCGCGGA CCCTGTGGGC CAAGAGCGCG AAGAAGAACG CGGACGGCAC CCAGGGCGCC
TGGCTGCCCG TCCTGAACCA CCTGCTCGAC GTGGCGGCAT GCGCGGCGGA GATTCTGGCG
CTGGAGCCGC CGCAGACGCG GATGCTGTTT GAGGCCGACC TGGGCCTGGA GGGCGAGCAG
GCGCTGGCCT GGACGCTGGC GCTGGTGGCC CTACACGACC TGGGCAAGGC GAGTCCGGCT
TTTCAGGTGC TGTGGGCCGA GGGCAAGGAC GGGGTGGACC CGGCGCTGCG TTTTCCCCCC
GACCTGCGCA AGCCTTATGC GCCGCACGGG GTCGTGACCC AGGCGGTGCT GCCGGACTTT
CTGACGGGGC TGGGCTGGCC GAGACCATTG GCACGAGCGG TGGCGGATGC GGTGGGCTGT
CACCACGGCT TGCGAGCAGA GGACCGCGAC CTCAACCTCC CGCGGGCACA GACCGGGGAC
GCCCGCTGGG GGCAGGTGCG CCGGGAACTG TGCCGCCTCG TCGCCCTCGG CACCGGTGCC
CGCCACGACG CCGTGCCCAC CCTCTCCTCC TTCTCTCCCG CCGCCTTTAT GCGCCTCGCG
GGACTGACCA GCTTCGCGGA CTGGCTGGGC AGTTCCTTTC CGCTTCCGAC CCAGACGGAC
TTCTCGGCCT ACGAAGACCC GGCGGCCTAC TTCGGGCGGA CACGGGAACG GGCGCGGCAG
ACGTTGGCGG AGATTCGCTG GCCCGCATTC GCGCCGCTGC GGGAGGAGTT ACCTGCTCTT
CCAGAAGTGT TCCGTTACGT AGTGAAAGAC GGGGCTTTCC AACCCCGCCC CCTTCAGACC
AAATTGGCGC AGGCTTTAGA AGGTGTCAGC GGCCCGGCGC TCGTGCTGGT GGAGGCCCCG
ATGGGGGAAG GCAAGACCGA GGCCGCCTTT TACGCGCATC TTCAGCTTCA GCGGGCGGCG
GGGCACCGGG GGATGTACGT GGCGCTGCCC ACCCAAGCGA CCGGGAACGC GATGTATGAG
CGCTTCGCCC AGTTTCTGGC CGCGCAGGGC CGCCCAACGC CGCCCGACCT GCAACTCGCG
CACGGGGGCA CGCTGCTGAA TGAGAAGTTC CAGGCCACCA TCGAGCGCAC CCGCAACGCG
CCCGGCGACC CCGCCGAGGC GGGCGGCTAC AGCGTCCGCG CCGAGGAATG GTTTACGAAC
CGCAAGCGGG CGCTCCTCTC CGAGTACGGC GTGGGTACGG TGGACCAGGC GCTGCTGGGG
GTGCTGGGGG TGCCTCACCA GTTCGTGCGG CTGTGGGGGC TGGGCAACCG GGTGGTCGTG
CTCGACGAGG TGCACGCCTA CGACACCTAC ACTTCCGAGC TGATCGCCGC CCTCGTCGCG
TGGCTGCGGG CGTTGGGGTC CAGCGTGGTC CTCATGAGCG CGACCCTGCC GGAGGCGAGT
CGCCGCGCCC TGCTGCGGGC CTGGGGGATG GAGGACGCGC CTACGGCGGA CTACCCGCGC
CTCACGGTGG CTCCGGTGGG GGGCAAAGTG CAGACCCTCA CCATCCCCGA CCAGGACGAG
GTCGGCAACA CCAGCCGCCC CCGCCAGCAC GTCACCCTGC GGCCCCTGGG AAGCGAGGTG
GGAGAGGTGG CCCGGCACGC AGTGGACCTT GCGGCGGACG GCGGGTGCGT GGCCGTCATC
GTGAACACGG TGGCGCGGGC ACAAGCGGTG CAGACGGAGG TGCTGGCGGA ACTGGAGCGG
CGGGGCATCA CGGCCAGAAC CTGTACGCGG GGCGGCGGTG GGGATCCCCG CGCCGTCGGG
GTGCTGCTCT ACCATGCCCG CTATCCAGCA GATGAGCGCG GGGAGCGAGA AAAGCGGGTG
CTGCGGTATC TCGGCAAGGA GGGGCAACGC CCCACACGCT TCATCCTGAT TGCCACCCAG
GTGGCCGAGC AGAGCCTCGA CTTTGACGCG GACGTCATGC TGACCGACCT CGCCCCGGTG
GACCTGGTGC TGCAACGAGC CGGGCGGCTG CACCGCCACG CGGAGAATGC GGGCAAGCGG
CGCGGCCACG AGGAGGCGGT GTTGTTCGTC TCCGGGCTGG ACGAGTGGCC GGACGAGAGC
CTGAAACGCG AGTTCTGGGG CCGGGTCTAC GCGCCCGCGC TGCTCTACCG CTCGTGGCTG
ACGCTGCGGC GGCGACTGGC GGGGGGGCTG ACCCTCCCCG ACGACCTCGA CGAGCTGGTG
CAGGAGGTCT ACGCCCCTGA GTTCGCCGCG CCGGAGCTGA CGCCGGAACA GCGCGAGCAG
CTCGCGGCAG CCGAGGCCGT GCTCGACAAC CAGCGCGGCA ACGAGGCCAC GACCGGCAGC
TTCGCTCACA TTGGCTGCCC CGCCGACTTC TGGAGCACGC GCCTGCACCA CCACCCCGAC
GCTGACCCCG ACAGCGAGAG CGTGAGCGAC GATCCCACTG CCGTGGGCGC CGAGGAGGAC
GATTTTCCCC GCACCCGCCT GGGCGAGGAG AGCGTGCGGA TAGTGCCGGT GGAGCGCCGC
GAAGATGGTG TCTGGCGAGT CTGCCGCTCG CCCTTTTGGG CAAGGGACGC CGAGGATGTG
CTCCCCCGAT TCGGCACGCT GGACAAGCGC GACACCGAGC ACGCTCGCAA GATTTACCGC
CGCTCGCTGG GCGTCTCGCG CCCAGAACTC GTGCGGGCCG CCGAGCAAGG CACTCTCTGG
GAAGGCTCGC TGGGCACCGA GCACCGGGGT TGGCGGGCGC ATCCGCTCCT GCGCGACGCC
GTGCCCCTGG TGTTCACGAG CGGGCTGGCT GAGGTGGAAG GCCTCCGGGT ACGGCTGGAC
CCGGAACAGG GCCTGGTCTA TCTGGGCACA GCATAG
 
Protein sequence
MNDARNEYLS HMPPLSPITA AARTLWAKSA KKNADGTQGA WLPVLNHLLD VAACAAEILA 
LEPPQTRMLF EADLGLEGEQ ALAWTLALVA LHDLGKASPA FQVLWAEGKD GVDPALRFPP
DLRKPYAPHG VVTQAVLPDF LTGLGWPRPL ARAVADAVGC HHGLRAEDRD LNLPRAQTGD
ARWGQVRREL CRLVALGTGA RHDAVPTLSS FSPAAFMRLA GLTSFADWLG SSFPLPTQTD
FSAYEDPAAY FGRTRERARQ TLAEIRWPAF APLREELPAL PEVFRYVVKD GAFQPRPLQT
KLAQALEGVS GPALVLVEAP MGEGKTEAAF YAHLQLQRAA GHRGMYVALP TQATGNAMYE
RFAQFLAAQG RPTPPDLQLA HGGTLLNEKF QATIERTRNA PGDPAEAGGY SVRAEEWFTN
RKRALLSEYG VGTVDQALLG VLGVPHQFVR LWGLGNRVVV LDEVHAYDTY TSELIAALVA
WLRALGSSVV LMSATLPEAS RRALLRAWGM EDAPTADYPR LTVAPVGGKV QTLTIPDQDE
VGNTSRPRQH VTLRPLGSEV GEVARHAVDL AADGGCVAVI VNTVARAQAV QTEVLAELER
RGITARTCTR GGGGDPRAVG VLLYHARYPA DERGEREKRV LRYLGKEGQR PTRFILIATQ
VAEQSLDFDA DVMLTDLAPV DLVLQRAGRL HRHAENAGKR RGHEEAVLFV SGLDEWPDES
LKREFWGRVY APALLYRSWL TLRRRLAGGL TLPDDLDELV QEVYAPEFAA PELTPEQREQ
LAAAEAVLDN QRGNEATTGS FAHIGCPADF WSTRLHHHPD ADPDSESVSD DPTAVGAEED
DFPRTRLGEE SVRIVPVERR EDGVWRVCRS PFWARDAEDV LPRFGTLDKR DTEHARKIYR
RSLGVSRPEL VRAAEQGTLW EGSLGTEHRG WRAHPLLRDA VPLVFTSGLA EVEGLRVRLD
PEQGLVYLGT A