Gene Hmuk_2838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2838 
Symbol 
ID8412389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2720110 
End bp2721102 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content61% 
IMG OID645021183 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_003178650 
Protein GI257388877 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03641] CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.150798 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACA ACTACCACGT CTTTTCCGAC GGACGCATCG AACGCCACGA CGACACGGTA 
CGGGTCATCA CCGACGACGG CGAGAAAAAA TACCTCCCGG TCGAGAACGC CGAGGCGATC
TTCCTCCACG GTCAGATCGA GTACAACACC CGCTTCGTCT CCTTTCTCAA TCAGGAAGGC
GTCGCCGTAC ACGTCTTCGG CTGGCACGAT CACTACGCCG GGTCGATCAT GCCCAAGCGG
GGCCAAACGT CCGGACAGAC ACTCGTCGAC CAGGTCCGGG CCTACGACGA TCCGGCCCAC
CGGCTCGAAC TGGCTCAGGC GTTCGTCGAC GGCAGCATCC ACAACATGCG TGCGAACGTC
ACGTACTACG ACGGCCGAGG ACACGACTTC GAGGACGTGC TGGCAGAGCT GACCGAAGCC
CGGTCGTCAC TCGACAGGAT GGAGACGATC GACGAGACGA TGGGCGTCGA AGCACGCGCC
CGAAAGGCGT ACTACTCGAC CTTCGACGAG ATCCTGCCCG ACGAGTTCGT CTTCGGCGGC
CGCCAGTACG ATCCGCCGAA CAACGAAGTC AACAGCCTCA TCTCTTTCGG CAATTCGCTC
GTCTACGCCA ACGTCGTCTC GGCCATCCGA GCGACGGCAC TCGATCCCAC GGTCAGCTTC
CTCCACGAGC CCGGCGAGCG TCGGTACTCG CTGGCCCTGG ACATCGCCGA CCTGTTCAAA
CCGTTGCTCG CGGATCGAGT CATCTTCAGA CTCGTCAACC GCGGCCAGCT GACCAGCGAC
GATTTCGAGG CCGAGATGAA CGCCTGCCTG CTGAACGAGC ACGGCCGGAA GACCTACTCG
AAGGCCTACG AAGAGACGCT CGACGAGACG ATCGAGCACC CGGATCTGGG AAAGAAGGTG
AGCTATCAGT ATCTCCTCCG AGTCGAGGTG TACAAGCTCA AAAAACATCT CCTGACCGGC
GAGGAGTACG TCCCGTTCCA ACGGTGGTGG TGA
 
Protein sequence
MNDNYHVFSD GRIERHDDTV RVITDDGEKK YLPVENAEAI FLHGQIEYNT RFVSFLNQEG 
VAVHVFGWHD HYAGSIMPKR GQTSGQTLVD QVRAYDDPAH RLELAQAFVD GSIHNMRANV
TYYDGRGHDF EDVLAELTEA RSSLDRMETI DETMGVEARA RKAYYSTFDE ILPDEFVFGG
RQYDPPNNEV NSLISFGNSL VYANVVSAIR ATALDPTVSF LHEPGERRYS LALDIADLFK
PLLADRVIFR LVNRGQLTSD DFEAEMNACL LNEHGRKTYS KAYEETLDET IEHPDLGKKV
SYQYLLRVEV YKLKKHLLTG EEYVPFQRWW