Gene Hmuk_2833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2833 
Symbol 
ID8412384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2713035 
End bp2715140 
Gene Length2106 bp 
Protein Length701 aa 
Translation table11 
GC content64% 
IMG OID645021178 
ProductCRISPR-associated protein, Csh1 family 
Protein accessionYP_003178645 
Protein GI257388872 
COG category 
COG ID 
TIGRFAM ID[TIGR02591] CRISPR-associated protein, Csh1 family 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGATC CCGACGAGTT CTACGATGAG TATCCGCCCG AGAAGCTCGA AGACGAGCTC 
CCCGAGCGAC CGATCTCGTC GCTGCGGGAC ATCCAGCACC TCTACGGGCG ACTCTACACG
CTGGCGACGG CCGGCGGCGG CGACTACGCG GCCTATCTGA CGCCCGATCA GGCCAACGAT
CTGATCGGGA CAGAGGAGAG CCTGATCGCC GTCTGCGTCG ATCTGTCCGG AGAGCGGCCG
ACGCTCGACG CGGAGGAACC CGTCCGGGTG ACCCAGTACA CCGACGACCT GGTACAGCGG
GTCGCTCACT GCAAGTACAA CGCGGCGCGG GGGATCGACC ACAGCGTCAC GCACCGCTCC
GGACGCAACA GCGATCCCGA GAAACTGGCC CGGTACGCCT GCGAGCGGCT GACGAAGTGG
GCGACCGACG ACGTGGTCCA GAGCATCGCA GACGATCACC CCGACGGCGA CGTGATCCGC
GGACTGGCGA CCGTCGGCGA GGACGAGAAC GCCCTCGACA GGATTCGAGA GGCCGTCCAG
ACCGAGCTCG GCGGTTCGAC GACAGCGCTC CTCACCGTGC AGGTGCGCCG GGAGGAGGGC
GGCGAGTACG AGTGGCCGGG ACAGATTTCC GTGTTCGACG AAGCGATGCG TGCGCGGAAG
CTCTCGAAGC TCGTCTCGAA GGGGCAGGCG ACGAACTCTG CCGGCGAGGC GACGGATCTC
GTCAGCGGCG AACGGACGCG AACCGTCGGC ACCGCCGAGG ATCCGCTCAA CTACTTCCTG
GGCAAGCAAA TGGAGAAGTT CCCGGGGCTG GATCCCGACG AGGCCTGGCG GAGCCACCCC
ATCTCCGAGG ACGGCGCGGT CACGCTGATG AACGCCGAGG AGTTCGTCGA CGCCTGTTCC
TACCGGACGT TCAACGCCGA CGTGTATTAC CTCCCGTACT TCCTCGGACG ACCGACGCCC
GAAGAGACGT ACAAGCTGTA CGCGGCACTC CACAGAGTCA CACGGGACGA CGACATGAAT
CCGGTTCAGG CGCTCTACGA CGAGTTCGGC GACCACGAGG CGGCGCCCGA GGGGCGACTG
CGCTTCTACG TCGCGGCGGT GATGAAACAC CAGATGTCCC GATTCGACGT GTACGGCGAG
ACGCTGAACG GCCGGATCCA CTACCCGACC GAGGTCGGGC AGCGACACGA GCAGGTGACC
GGCAGCTGGG TCTTCGACGA GGACGACGCC AGGAACGGAG GTCGGACGCC GCCGATGCCC
TCCCACGAAG AGTGGGCGCT CACTAATTAC CAGCGGTTTC GAGAGATGGT CGCGTCGGGT
GCCTATCTCT ACGGCACCTT CCCGTTCACC GACGAGGACA CCGACGCGAC GGTCGACGAC
GAGCGAATCG ACGTTCACGT CTCGATCCTT GCCGGCGATC CCGTCCCGCG GAACCAACTG
GTCAGCGCCT ACGTCGATCG ATTGCTCGAC CGTGACGGTG ACGAAGTCCC CGAACTCCTG
ATCGCCTCCC AGTTCGCACA GCTGTGCGCG CTGGAAAACG CCGGCCTCGT GACGACGGAC
GCCGGCGAGA CCGAACCGAC GCTGATCGAC GGCCCCGATT ACGACACTAT GGAACCAGAA
CACGCGCGAA CCGACGGCGG CACTGCTGCC CTCGATCGCA CCGAGAAACT CGAATCGTTC
ATCGAACAGA CGGACGGACT GGACGACGCC GAACGGAAGG GAGCATTCCT GCTCGGCACG
CTCGTCGGCC AGGTCGGCAC CTACCAGCAG GGGTATCACG ATCGATCGAC GACCGTCATC
GACCAGTACC CGATCAAGTC GATGACCAAG ACGAAGATCA AGCGCATCAC CCAAGAGGTG
ATCGACAAAG ACGTAGTCTA CTCCCGGGAG ATGGCCAAGA AAGGGTCGGA GATCAATTCG
ACGATGTACA AGGAAGTTAC GAACGCCATC GTCGAGACCA TGGCCGAAAG CGATCCGTCG
GACTGGGAGA TCAGCACCGA CGATCTGCGC TTCTACTACG CCCTGGGACT GACCTACGGA
ATGAACGACC GATCGACTAA CGAGAACGAC GCTGCCGACG ACGAAACCTC AGAAACCAAT
GAGTGA
 
Protein sequence
MLDPDEFYDE YPPEKLEDEL PERPISSLRD IQHLYGRLYT LATAGGGDYA AYLTPDQAND 
LIGTEESLIA VCVDLSGERP TLDAEEPVRV TQYTDDLVQR VAHCKYNAAR GIDHSVTHRS
GRNSDPEKLA RYACERLTKW ATDDVVQSIA DDHPDGDVIR GLATVGEDEN ALDRIREAVQ
TELGGSTTAL LTVQVRREEG GEYEWPGQIS VFDEAMRARK LSKLVSKGQA TNSAGEATDL
VSGERTRTVG TAEDPLNYFL GKQMEKFPGL DPDEAWRSHP ISEDGAVTLM NAEEFVDACS
YRTFNADVYY LPYFLGRPTP EETYKLYAAL HRVTRDDDMN PVQALYDEFG DHEAAPEGRL
RFYVAAVMKH QMSRFDVYGE TLNGRIHYPT EVGQRHEQVT GSWVFDEDDA RNGGRTPPMP
SHEEWALTNY QRFREMVASG AYLYGTFPFT DEDTDATVDD ERIDVHVSIL AGDPVPRNQL
VSAYVDRLLD RDGDEVPELL IASQFAQLCA LENAGLVTTD AGETEPTLID GPDYDTMEPE
HARTDGGTAA LDRTEKLESF IEQTDGLDDA ERKGAFLLGT LVGQVGTYQQ GYHDRSTTVI
DQYPIKSMTK TKIKRITQEV IDKDVVYSRE MAKKGSEINS TMYKEVTNAI VETMAESDPS
DWEISTDDLR FYYALGLTYG MNDRSTNEND AADDETSETN E