Gene Dgeo_0234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_0234 
Symbol 
ID4059142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp219242 
End bp220270 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content65% 
IMG OID641229234 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_603706 
Protein GI94984342 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGACAAC TTCTCAACAC CCTCTACATC CAGACCCAGG GCACCTACCT GCACCTTGAC 
ACCGACAACA TCCGGGTCGA GGTGGAGCGA ACAAAGAAGG CGATGCTGCC CCTGCACCAC
ATCGAGGGCG TGGTGGTGTT CGGCAACGTG CTGCTCTCGC CCTTTCTGAT TCACCGCCTC
GCCCGCGAGC ACAAGCCGGT CACCTGGCTG AGCGAACACG GGCGCTTCAT GGCCCGCACC
GAAACGCCGA TGAGCGGAAA CGTCCTCCTG CGAACGGCCC AGCACGCCTG CGCAGGGAAT
GCCGCAAGAA CGCTGGCCAT CGCCCGCCTG ATTGCCGCTG GGAAGCTCCA GAATCAGAAA
GTCACCCTGC TGCGCGCGGC TCGCGAAGCA GAAGCCGACG ACGCCGCGCT GCTGCGCCAA
GCCGCCCGTG ACATCAACGT CCAGATCGCC TGCCTCCCCC TGACCGAGAC GGTGGACGAG
GTCCGCGGCA CCGAGGGCAC CGCCGCTCGC CTCTACTGGG AGGTCTTCCC GCTCATGCTG
CGGCAAAACC GCGATTTCTT CTGGCTCTCG GAGCGCCACC GCCGCCCGGC CCGCGACCCC
ATCAATGCCC TGCTGAACTT CGTGTACACC GTGCTGGCCA ATGACTGTGC CTCAGCGTGT
CAGGCGGTGG GTCTTGACCC GCAGCTCGGC TTCCTGCACG CCCTGCGTCC GGGCAGAAGC
AGCCTGGCCC TCGACCTCAT GGAAGAACTG CGCCCCGTCA TCGCCGACCG TGCCATCCTC
ACCCTGATTA ACCGCCAGCA GCTCACCCCT CGCGACTTTG TGCTGCATGA GGGCGGCACT
GTCAGCATCA CCGAAGAGGG ACGCAAAACC ATTCTGGCGC ATCTGGCTGA ACGCCGCCGG
GAGGAAGTCA TGCACCCCCT CACCGCCCGC AAAACTCCGC TGGGGCTGCT GTCACACGTT
CAGGCTCGTC TGCTTGCCCA GCACCTCCGC GGTGACCGCC CCCATTACCC CCCCTACCTG
CACCGATGA
 
Protein sequence
MRQLLNTLYI QTQGTYLHLD TDNIRVEVER TKKAMLPLHH IEGVVVFGNV LLSPFLIHRL 
AREHKPVTWL SEHGRFMART ETPMSGNVLL RTAQHACAGN AARTLAIARL IAAGKLQNQK
VTLLRAAREA EADDAALLRQ AARDINVQIA CLPLTETVDE VRGTEGTAAR LYWEVFPLML
RQNRDFFWLS ERHRRPARDP INALLNFVYT VLANDCASAC QAVGLDPQLG FLHALRPGRS
SLALDLMEEL RPVIADRAIL TLINRQQLTP RDFVLHEGGT VSITEEGRKT ILAHLAERRR
EEVMHPLTAR KTPLGLLSHV QARLLAQHLR GDRPHYPPYL HR