Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dgeo_2629 |
Symbol | |
ID | 4073860 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Deinococcus geothermalis DSM 11300 |
Kingdom | Bacteria |
Replicon accession | NC_008010 |
Strand | + |
Start bp | 413824 |
End bp | 416739 |
Gene Length | 2916 bp |
Protein Length | 971 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641228846 |
Product | CRISPR-associated helicase Cas3 family protein protein |
Protein accession | YP_594137 |
Protein GI | 94972097 |
COG category | [R] General function prediction only |
COG ID | [COG1203] Predicted helicases |
TIGRFAM ID | [TIGR01587] CRISPR-associated helicase Cas3 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.174712 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAATGACG CGCGGAACGA GTACCTTTCC CACATGCCCC CCCTCTCCCC CATCACGGCG GCGGCGCGGA CCCTGTGGGC CAAGAGCGCG AAGAAGAACG CGGACGGCAC CCAGGGCGCC TGGCTGCCCG TCCTGAACCA CCTGCTCGAC GTGGCGGCAT GCGCGGCGGA GATTCTGGCG CTGGAGCCGC CGCAGACGCG GATGCTGTTT GAGGCCGACC TGGGCCTGGA GGGCGAGCAG GCGCTGGCCT GGACGCTGGC GCTGGTGGCC CTACACGACC TGGGCAAGGC GAGTCCGGCT TTTCAGGTGC TGTGGGCCGA GGGCAAGGAC GGGGTGGACC CGGCGCTGCG TTTTCCCCCC GACCTGCGCA AGCCTTATGC GCCGCACGGG GTCGTGACCC AGGCGGTGCT GCCGGACTTT CTGACGGGGC TGGGCTGGCC GAGACCATTG GCACGAGCGG TGGCGGATGC GGTGGGCTGT CACCACGGCT TGCGAGCAGA GGACCGCGAC CTCAACCTCC CGCGGGCACA GACCGGGGAC GCCCGCTGGG GGCAGGTGCG CCGGGAACTG TGCCGCCTCG TCGCCCTCGG CACCGGTGCC CGCCACGACG CCGTGCCCAC CCTCTCCTCC TTCTCTCCCG CCGCCTTTAT GCGCCTCGCG GGACTGACCA GCTTCGCGGA CTGGCTGGGC AGTTCCTTTC CGCTTCCGAC CCAGACGGAC TTCTCGGCCT ACGAAGACCC GGCGGCCTAC TTCGGGCGGA CACGGGAACG GGCGCGGCAG ACGTTGGCGG AGATTCGCTG GCCCGCATTC GCGCCGCTGC GGGAGGAGTT ACCTGCTCTT CCAGAAGTGT TCCGTTACGT AGTGAAAGAC GGGGCTTTCC AACCCCGCCC CCTTCAGACC AAATTGGCGC AGGCTTTAGA AGGTGTCAGC GGCCCGGCGC TCGTGCTGGT GGAGGCCCCG ATGGGGGAAG GCAAGACCGA GGCCGCCTTT TACGCGCATC TTCAGCTTCA GCGGGCGGCG GGGCACCGGG GGATGTACGT GGCGCTGCCC ACCCAAGCGA CCGGGAACGC GATGTATGAG CGCTTCGCCC AGTTTCTGGC CGCGCAGGGC CGCCCAACGC CGCCCGACCT GCAACTCGCG CACGGGGGCA CGCTGCTGAA TGAGAAGTTC CAGGCCACCA TCGAGCGCAC CCGCAACGCG CCCGGCGACC CCGCCGAGGC GGGCGGCTAC AGCGTCCGCG CCGAGGAATG GTTTACGAAC CGCAAGCGGG CGCTCCTCTC CGAGTACGGC GTGGGTACGG TGGACCAGGC GCTGCTGGGG GTGCTGGGGG TGCCTCACCA GTTCGTGCGG CTGTGGGGGC TGGGCAACCG GGTGGTCGTG CTCGACGAGG TGCACGCCTA CGACACCTAC ACTTCCGAGC TGATCGCCGC CCTCGTCGCG TGGCTGCGGG CGTTGGGGTC CAGCGTGGTC CTCATGAGCG CGACCCTGCC GGAGGCGAGT CGCCGCGCCC TGCTGCGGGC CTGGGGGATG GAGGACGCGC CTACGGCGGA CTACCCGCGC CTCACGGTGG CTCCGGTGGG GGGCAAAGTG CAGACCCTCA CCATCCCCGA CCAGGACGAG GTCGGCAACA CCAGCCGCCC CCGCCAGCAC GTCACCCTGC GGCCCCTGGG AAGCGAGGTG GGAGAGGTGG CCCGGCACGC AGTGGACCTT GCGGCGGACG GCGGGTGCGT GGCCGTCATC GTGAACACGG TGGCGCGGGC ACAAGCGGTG CAGACGGAGG TGCTGGCGGA ACTGGAGCGG CGGGGCATCA CGGCCAGAAC CTGTACGCGG GGCGGCGGTG GGGATCCCCG CGCCGTCGGG GTGCTGCTCT ACCATGCCCG CTATCCAGCA GATGAGCGCG GGGAGCGAGA AAAGCGGGTG CTGCGGTATC TCGGCAAGGA GGGGCAACGC CCCACACGCT TCATCCTGAT TGCCACCCAG GTGGCCGAGC AGAGCCTCGA CTTTGACGCG GACGTCATGC TGACCGACCT CGCCCCGGTG GACCTGGTGC TGCAACGAGC CGGGCGGCTG CACCGCCACG CGGAGAATGC GGGCAAGCGG CGCGGCCACG AGGAGGCGGT GTTGTTCGTC TCCGGGCTGG ACGAGTGGCC GGACGAGAGC CTGAAACGCG AGTTCTGGGG CCGGGTCTAC GCGCCCGCGC TGCTCTACCG CTCGTGGCTG ACGCTGCGGC GGCGACTGGC GGGGGGGCTG ACCCTCCCCG ACGACCTCGA CGAGCTGGTG CAGGAGGTCT ACGCCCCTGA GTTCGCCGCG CCGGAGCTGA CGCCGGAACA GCGCGAGCAG CTCGCGGCAG CCGAGGCCGT GCTCGACAAC CAGCGCGGCA ACGAGGCCAC GACCGGCAGC TTCGCTCACA TTGGCTGCCC CGCCGACTTC TGGAGCACGC GCCTGCACCA CCACCCCGAC GCTGACCCCG ACAGCGAGAG CGTGAGCGAC GATCCCACTG CCGTGGGCGC CGAGGAGGAC GATTTTCCCC GCACCCGCCT GGGCGAGGAG AGCGTGCGGA TAGTGCCGGT GGAGCGCCGC GAAGATGGTG TCTGGCGAGT CTGCCGCTCG CCCTTTTGGG CAAGGGACGC CGAGGATGTG CTCCCCCGAT TCGGCACGCT GGACAAGCGC GACACCGAGC ACGCTCGCAA GATTTACCGC CGCTCGCTGG GCGTCTCGCG CCCAGAACTC GTGCGGGCCG CCGAGCAAGG CACTCTCTGG GAAGGCTCGC TGGGCACCGA GCACCGGGGT TGGCGGGCGC ATCCGCTCCT GCGCGACGCC GTGCCCCTGG TGTTCACGAG CGGGCTGGCT GAGGTGGAAG GCCTCCGGGT ACGGCTGGAC CCGGAACAGG GCCTGGTCTA TCTGGGCACA GCATAG
|
Protein sequence | MNDARNEYLS HMPPLSPITA AARTLWAKSA KKNADGTQGA WLPVLNHLLD VAACAAEILA LEPPQTRMLF EADLGLEGEQ ALAWTLALVA LHDLGKASPA FQVLWAEGKD GVDPALRFPP DLRKPYAPHG VVTQAVLPDF LTGLGWPRPL ARAVADAVGC HHGLRAEDRD LNLPRAQTGD ARWGQVRREL CRLVALGTGA RHDAVPTLSS FSPAAFMRLA GLTSFADWLG SSFPLPTQTD FSAYEDPAAY FGRTRERARQ TLAEIRWPAF APLREELPAL PEVFRYVVKD GAFQPRPLQT KLAQALEGVS GPALVLVEAP MGEGKTEAAF YAHLQLQRAA GHRGMYVALP TQATGNAMYE RFAQFLAAQG RPTPPDLQLA HGGTLLNEKF QATIERTRNA PGDPAEAGGY SVRAEEWFTN RKRALLSEYG VGTVDQALLG VLGVPHQFVR LWGLGNRVVV LDEVHAYDTY TSELIAALVA WLRALGSSVV LMSATLPEAS RRALLRAWGM EDAPTADYPR LTVAPVGGKV QTLTIPDQDE VGNTSRPRQH VTLRPLGSEV GEVARHAVDL AADGGCVAVI VNTVARAQAV QTEVLAELER RGITARTCTR GGGGDPRAVG VLLYHARYPA DERGEREKRV LRYLGKEGQR PTRFILIATQ VAEQSLDFDA DVMLTDLAPV DLVLQRAGRL HRHAENAGKR RGHEEAVLFV SGLDEWPDES LKREFWGRVY APALLYRSWL TLRRRLAGGL TLPDDLDELV QEVYAPEFAA PELTPEQREQ LAAAEAVLDN QRGNEATTGS FAHIGCPADF WSTRLHHHPD ADPDSESVSD DPTAVGAEED DFPRTRLGEE SVRIVPVERR EDGVWRVCRS PFWARDAEDV LPRFGTLDKR DTEHARKIYR RSLGVSRPEL VRAAEQGTLW EGSLGTEHRG WRAHPLLRDA VPLVFTSGLA EVEGLRVRLD PEQGLVYLGT A
|
| |