Gene Hmuk_1242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1242 
Symbol 
ID8410762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1176251 
End bp1177495 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content65% 
IMG OID645019574 
Productcysteine desulfurase, SufS subfamily 
Protein accessionYP_003177071 
Protein GI257387298 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.473312 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCACA GTGAGACAGA GGCCCTCGAC GTCGAGTCCA TCCGGGACGA GTTCCCGATC 
CTCGACCGGG AGTTCGACGG AACGCCGCTC GTGTACCTCG ACAACGCGGC GACGAGCCAG
ACGCCGGACC AGGTCATCGA CTCGATCAGT CACTACTACC GACACTACAA CGCCAACGTC
CACCGCGGCC TCCACCAGCT GAGTCAGGAG GCCTCGATCG CCTACGAGGA GGCCCACGAC
CGGGTCGCGG AGTTCATCGG GGCGAGCGGC GGGCGCAAAG AGGTCGTGTT CACGAAGAAC
ACGACGGAGT CGATGAACAC GGTCGCCTAC GCGTGGGGGC TCGCCGAGCT GGGCCCCGGC
GACGAGGTCG TCCTCACGGA GATGGAACAC CACGCCGCGC TCGTGACCTG GCAACAGATC
GCCAAGAAGA CCGGCGCGAC GGTGAAGTTC GTGGAGGTCG ACGAGGACGG ACGCCTCGAC
ATGGACCACG CTCGCGAGCT GATCACCGAC GACACCGAGA TGGTCAGCGT CGTCCACGTC
TCGAACACGC TGGGGACCGT CAACCCCGTC TCCGAGCTGG CCGACATCGC CCACGATCAC
GACTCGCTGA TCTTCGTCGA CGGGGCGCAG TCGGTGCCAC ACATGCCCGT CGACGTGGAG
GCCATCGACG CGGACTTCTT CGCCTTTTCG GGCCACAAGA TGTGTGGTCC GACCGGCGTC
GGCGTCCTCT ACGGCAAAGA GCACCTGCTC GACGCGATGC AGCCGTACCT CTACGGGGGA
ATGATGATCG AGAAGGTGAC CTTCGAGGAC TCGACGTGGC ACGAACTCCC CTGGAAGTTC
GAGGCCGGCA CGCCGGTCAT CAGCGAGGGG ATCGCACTCG CCGAGGCCTG TGACTACCTC
GATTCGATCG GGATGGAGCG CGTCCACCGC CACGGCACGG AACTGGCGGC GTACGCCTAC
GACCGCCTCC AGGAGTTCGA CGACATCGAG ATTCTGGGGC CGGGCGGCGA CGAACGGGCC
GCGCTGGTCT CGTTCGACAT GGACAACGTC CACGCCCACG ACGTTTCGGA GATTCTCAAC
GCCAACGGCG TCGCCGTCCG CGCGGGCGAT CACTGTACGC AGCCGCTACA CGACAAGCTC
GGCGTCTCCG CGTCGACGCG GGCTTCGTTC TACATCTACA ACACTCGCGA GGAAGTGGAC
GCGCTGGTCG ACGCACTCGA TCAGGCCCGC GAGCTGTTCG CGTAG
 
Protein sequence
MSHSETEALD VESIRDEFPI LDREFDGTPL VYLDNAATSQ TPDQVIDSIS HYYRHYNANV 
HRGLHQLSQE ASIAYEEAHD RVAEFIGASG GRKEVVFTKN TTESMNTVAY AWGLAELGPG
DEVVLTEMEH HAALVTWQQI AKKTGATVKF VEVDEDGRLD MDHARELITD DTEMVSVVHV
SNTLGTVNPV SELADIAHDH DSLIFVDGAQ SVPHMPVDVE AIDADFFAFS GHKMCGPTGV
GVLYGKEHLL DAMQPYLYGG MMIEKVTFED STWHELPWKF EAGTPVISEG IALAEACDYL
DSIGMERVHR HGTELAAYAY DRLQEFDDIE ILGPGGDERA ALVSFDMDNV HAHDVSEILN
ANGVAVRAGD HCTQPLHDKL GVSASTRASF YIYNTREEVD ALVDALDQAR ELFA