Gene Hmuk_2834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2834 
Symbol 
ID8412385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2715133 
End bp2716188 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content63% 
IMG OID645021179 
ProductCRISPR-associated protein, Csh2 family 
Protein accessionYP_003178646 
Protein GI257388873 
COG category[L] Replication, recombination and repair 
COG ID[COG3649] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR01595] CRISPR-associated protein, CT1132 family
[TIGR02590] CRISPR-associated protein, Csh2 family 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAAC ACTATCCCAC CGTTTCGAAC AGGTCCGAAA TCGTCTTCGC GTACGACGCG 
GTCGACGCGA ACCCCAACGG CAATCCGCTG AGCGGCGCGA ACCGACCGCG TATCGACCCC
CACACCGATC AGGCGATCGT CACCGACGTT CGCCTCAAGC GCTACCTCCG CGATCAGCTA
CAGGACGACG GTCACGGCGT CTACATTCGC AACGTCAAGG AAGACGACGG CGATCAGGCG
ACCCGCGAGG ACCTCCTCGA AGACCGTCTC AAGGACATCG ACCTCGACGA CGTGGACGAA
GCCGACATCG AAAACGCCGT CTTCGGTCAG TTCCTCGAAA ACAGCGCCGA CGTTCGCTAC
TTCGGCGCGA CGATGAGCAT CGATATGGAC GATGAGAAGG TCGACCACCT CCCGGATCAC
TTCACCGGTC CCGTCCAGTT CTCGCCGGGC AAGTCGCTCC ACCGAGTCAT GGAAAACGAG
GAGTACAACA GCCTCACAAG CGTCATCGCG ACCGGCGACG ACAAGGCACA GGGCGGGTTC
GATCTCGACG ACCACCGGAT CCAGTACGCG TTCATCGGGT TCCACGGACT CGTCGACGAG
CACGGGGCCG AAGGCACGCT CCTGACGGAT GGGGACGTGC GGCGACTGGA CACGCTGTGC
TGGCGCGCGC TGAAGAACCA GACGATCAGC CGGAGCAAGG TCGGACAGGA GCCCCGGCTC
TACCTCAGAG TCGAGTACGC CGACGAGAGC TTCCATCTCG GCGGGCTCGA TCAGGACATC
GATCTCGACA GTTCGGAATC CGCTCCCGTC GAGGAAATTC GCAACGTCCG AGACATCTGT
GTCGACGTGT CGGCGCTGCT CGAACGGCTC GACGCGGCGT CCGACCGGAT CGACACCGTC
CACGTCGTCG CCAGCGACGT TCTCGAACTC TCCGTCGACG GTGAGACGGG CGGTCCGGAG
TTCCTCTACG ACGCCCTCGA ATCGAGGGTC GGTAGCGAAT CCGTCCGCGA GATCGACGTG
TACGAGGACG CGAAGGCGAC GATGCCGGAG GAGTGA
 
Protein sequence
MSEHYPTVSN RSEIVFAYDA VDANPNGNPL SGANRPRIDP HTDQAIVTDV RLKRYLRDQL 
QDDGHGVYIR NVKEDDGDQA TREDLLEDRL KDIDLDDVDE ADIENAVFGQ FLENSADVRY
FGATMSIDMD DEKVDHLPDH FTGPVQFSPG KSLHRVMENE EYNSLTSVIA TGDDKAQGGF
DLDDHRIQYA FIGFHGLVDE HGAEGTLLTD GDVRRLDTLC WRALKNQTIS RSKVGQEPRL
YLRVEYADES FHLGGLDQDI DLDSSESAPV EEIRNVRDIC VDVSALLERL DAASDRIDTV
HVVASDVLEL SVDGETGGPE FLYDALESRV GSESVREIDV YEDAKATMPE E