Gene Dgeo_0965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_0965 
Symbol 
ID4058662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp1032364 
End bp1033572 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content70% 
IMG OID641229983 
ProductCRISPR-associated Cmr1 family protein 
Protein accessionYP_604434 
Protein GI94985070 
COG category[L] Replication, recombination and repair 
COG ID[COG1367] Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) 
TIGRFAM ID[TIGR01894] CRISPR-associated RAMP protein, Cmr1 family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.364827 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAGAA CCCCACCCCC CCTCCCCACC GACCTGCCCG ACCTCTCTCC TCCCGAACAG 
CTCACCGTCC AGTTGCGGAC CATCACTCCC ATGTTCGGTG GGAGTGCCGA GACCCGCGAG
GTGGACGAGC GTCACCCCGT GCGTGCGGCC AGCGTGCGCG GCCACCTGCG GTTTTGGTGG
CGGGCCACCG CTGGGGCCGG GTACGCCACA GCAAAGGAGC TGCACAAAGC CGAGTCGGGG
CTGTGGGGCA ACACCGAGAA GCCGGGGCGG GTGCGGGTGG AGGTGGAGGT GACGGAGCGG
GGCACGCGCG TTTACCCCTC CGAACTGAAC AAGGGGGGCG ACAGCCCCGC CAAGACCGGG
CCTCGCGAAG CGTACTTCGT GCATCCTTTT CAGGAGATTC GCTCCGAAAA CAAGCCGGAA
ACCTTCGGCC TCAGGGACGT CGCCTTCACC GTAACCCTCA CCCTGCATAG GCTGAGCGCC
GCCGAGCGTG AGCAGGTCAT CACGGCGCTG CGGGCCTGGA TTGCCTTTGG GGGCGTGGGT
GCCCGCACCC GCCGGGGTTG CGGTGCCCTC ACCGTCGCCG GGGACGCGGC TCAGTGGCTG
CCCGCCCAGC CCGCCGACTT GTGGGCGTGG TTTGGCCGCG AGGAACGCGC GGTGCCCACG
CCGCAACACA GTGTTCTGGC TGGAGCCAAA GCGCTGCTCG GCCCGGTGGG TGCCGACCCG
GTGAAGGGGG TATGGCGCGA CCTAGGCCGC TTCTGGGCAC GGTTTCGCAA GGGGCACTAC
ACCGAGCGCC GCCCTGCTTA CAGCCCCATG AGCGGCGGCG CGTGGCGCGA CCACCGCACC
CTTCAGGCGG GGCTGCGGCG CGACGAACCC GCTCGGCTCG CCAAGCCCTT TCTCGGCCTG
CCCATCGTGT ACCAGAAGTT TCCCAAGACG GACGCCTTCG CGGGCACCAT CGAGGGCGCG
CAGGAGGGCA AAAAGCGCAT GGCGTCCCCG GTCATCCTCA AGCCCTGCGC CTTTCGCGAC
GGGGTGCGGG GGCTGGTGCT CGTCCTGAAC GCGCCCCCCC CGCGCCAGGT CAAGGTGTCG
GGACAGCCGC ATCCCCTGGA GATTCCGCCC CATGACCCCG TGCTGGCAGC GCTGGGGGTG
CGCGGCCCGC TGGCTGCCGT TCGCGCCGCT GCCAAGGTGG ACGGCTACTC CGAGGAGGTC
TCCCTGTGA
 
Protein sequence
MPRTPPPLPT DLPDLSPPEQ LTVQLRTITP MFGGSAETRE VDERHPVRAA SVRGHLRFWW 
RATAGAGYAT AKELHKAESG LWGNTEKPGR VRVEVEVTER GTRVYPSELN KGGDSPAKTG
PREAYFVHPF QEIRSENKPE TFGLRDVAFT VTLTLHRLSA AEREQVITAL RAWIAFGGVG
ARTRRGCGAL TVAGDAAQWL PAQPADLWAW FGREERAVPT PQHSVLAGAK ALLGPVGADP
VKGVWRDLGR FWARFRKGHY TERRPAYSPM SGGAWRDHRT LQAGLRRDEP ARLAKPFLGL
PIVYQKFPKT DAFAGTIEGA QEGKKRMASP VILKPCAFRD GVRGLVLVLN APPPRQVKVS
GQPHPLEIPP HDPVLAALGV RGPLAAVRAA AKVDGYSEEV SL