Gene Hmuk_2234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2234 
Symbol 
ID8411774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2151497 
End bp2153539 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content64% 
IMG OID645020577 
Productstage V sporulation protein R-like protein 
Protein accessionYP_003178054 
Protein GI257388281 
COG category[S] Function unknown 
COG ID[COG2719] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.43304 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.276408 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACCG ACGACCACAT CCGCAAACGG CGCGTCGCCG CCGAACTCGA AACGTCAGTC 
GACGAGGCGG CGAATCTGGC GAAGAAACTC GGTCTGACCC CCTACCCGGT GAACTACTGG
ATCGTCGACT ACGACGAGAT GAACGAGCTG ATCGCCTACG GGGGATTCCA GCAGCGGTAC
CCCCACTGGC GGTGGGGCAT GGCCTACGAC CGCCAGCAAA AGCAGGGGCA GTTCCTCGGC
GGGAAGGCCT TCGAGATCGT CAACAACGAC AACCCGGCCC ACGCCTTCCT CCAGGAGTCC
AACGAGCTGG CCGACCAGAA GGCCGTCATC ACCCACGTCG AGGCCCACGC GGACTTCTTC
GCGAACAACG ACTGGTTCGG ACTGTTCGCG GGCGGAGCGG CCCCGGGGGC CGCTGACAGC
GAAGACGGCC AGCGCCGTCT GAGCAATCCC GAGGCCGCGG CGATGCTGGG ACGCCACGCC
GAGACGATCG AGGAGTACAT GCAAGACCCC GAGATCGACC GCGCCGAGGT CGAGAAGTGG
ATCGACCACG TCCTCTGTCT GGAGGACAAC ATCGACCAGC ACCAGCCCTA CGCGCCCGTC
GACGGCGAGG ACCGCCCCGA CGAGGCGGAC CTCGAAGAGA TAGCGGACCA GCTCGGCGAT
CTCGAACTCT CGGAAGAGGT CCGCAGACAG GTCTTCACTG AAGAGTGGCT CGACGCCCAG
AGCGAGGACG ACGAGCCGAT CACGTTCCCC GAAGAGCCAC AGAAAGACGT GCTCGCCTTC
CTCCAGACAC ACGGCATGCA GTACGACGCC GACGGCGAGA AGGCCGTCGA GATGGCTGAC
TGGCAGTCGG ACGTACTGGA GATGCTCCGC CGGGAGGCGT ACTACTTCGC TCCCCAGAAA
ATGACAAAGG TGATGAACGA AGGATGGGCA AGCTATTGGG AATCCGTCAT GATGGCCGGA
GAGCAGTTCG CGAGCGCCGA CGAGTTCGTC CTCTACGCCG ACCACATGTC GAAGGTGCTC
GGGTCGGGCG GGCTCAACCC CTACAAGCTC GGGCTCGAAC TGTGGACGTA TCTCGAAAAC
AGCGAGAACC GCCGGGAAGT CGTCGAGCGA CTGCTTCGCG TCGAGGACGT GACCTGGCGG
AACTTCCACG ACGTGATCGA CTTCGAGCGG GTACAGGACC TGATCGCGCC CGATTCGGCG
GTGACCGACG TGCCCGCCAG CCTCGACGAT CTCGACCCCG ACGATCCGCG AGTCGACGCC
GACGCGCTCG CTCGCGCCCG CGACGGAGAG ATCGACGTCG AGACCTACCC GTGGAAGGTC
CTGACCGAAG CGGGGATGGC CGAACGCCAC TACTCGCTGG TCAAGCCACA GTACCGCGGA
TTCGTCTCGC GCATCAGCCA GTCGGAGCTG GAGCGCATCT CGCGGTACAT GTTCGACGAC
GCCCGGTACG AGAGCGTGGC CGACGCCCTC GCGGATGTGG ACTACTCGCG GGGCTGGGAC
CGGATGCGCG AGGTCCGCGA GAGCCACAAC GACGTGACCT TCCTCGACGA GTTCCTCACC
CAGGAGTTCG TCGACGAACA CGACTACTTC ACCTACGAGT ACACCCACTC CTCGGGCAAC
TTCCGTGCCA CTTCGACGGC CGCCGAAGAC GTGAAAAAGA AGCTGATGTT GCAGTTCACC
AACTTCGGCA AGCCGACGAT CACCGTCGCC GACGGCAACT ACCGCAACCG CAACGAACTC
CTCTTGACCC ACCAGTACAA CGGCGTCGTG CTGGACCTCG AACAGGCTAC AGAGACGCTG
CAACGCGTCT TCGAACTCTG GGGGCGGCCA GTCAATCTGC TGACCATCGA CAAGGAGTTC
GACGAACACG ACGTGGAGGT CGCGCGGCGA CGCGACCAGG AACCCGAGCC CGAGGCGGTC
GGCAAGCGCC TGCGATACGA CGGCGAGGAG GTCACGATAC AGGAGGTCGA CTGGTCCGAG
GTCGAACACT TAGACGCCGA CGACATCGAC TACGACACCA AGCCCGAGGA GTGGCTGTCG
TAG
 
Protein sequence
MSTDDHIRKR RVAAELETSV DEAANLAKKL GLTPYPVNYW IVDYDEMNEL IAYGGFQQRY 
PHWRWGMAYD RQQKQGQFLG GKAFEIVNND NPAHAFLQES NELADQKAVI THVEAHADFF
ANNDWFGLFA GGAAPGAADS EDGQRRLSNP EAAAMLGRHA ETIEEYMQDP EIDRAEVEKW
IDHVLCLEDN IDQHQPYAPV DGEDRPDEAD LEEIADQLGD LELSEEVRRQ VFTEEWLDAQ
SEDDEPITFP EEPQKDVLAF LQTHGMQYDA DGEKAVEMAD WQSDVLEMLR REAYYFAPQK
MTKVMNEGWA SYWESVMMAG EQFASADEFV LYADHMSKVL GSGGLNPYKL GLELWTYLEN
SENRREVVER LLRVEDVTWR NFHDVIDFER VQDLIAPDSA VTDVPASLDD LDPDDPRVDA
DALARARDGE IDVETYPWKV LTEAGMAERH YSLVKPQYRG FVSRISQSEL ERISRYMFDD
ARYESVADAL ADVDYSRGWD RMREVRESHN DVTFLDEFLT QEFVDEHDYF TYEYTHSSGN
FRATSTAAED VKKKLMLQFT NFGKPTITVA DGNYRNRNEL LLTHQYNGVV LDLEQATETL
QRVFELWGRP VNLLTIDKEF DEHDVEVARR RDQEPEPEAV GKRLRYDGEE VTIQEVDWSE
VEHLDADDID YDTKPEEWLS