Gene Hmuk_0259 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0259 
Symbol 
ID8409757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp254568 
End bp257768 
Gene Length3201 bp 
Protein Length1066 aa 
Translation table11 
GC content67% 
IMG OID645018584 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_003176103 
Protein GI257386330 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00761831 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGTATGG ACCTGAAGGT AAAGTTGGTC GTGCTGATGT TGGCGATCTC GCTGTTGCCG 
CTTGGCAGTG TCGGAGCCCT GGCGACACAG AACATGGGGT CGCTCAATCA GGACGCCCAG
GATCGGAGCG CACAGGAGTT GCGCGGAGCG ATGACCGACG ACCTGAACAA CAGCGTACGG
GCGCGGCAGT CGTCGCTACA GAACCAGTTC GACCAGCGGG AGGTCGACAT GCGCTCGCTC
GCCCAGTCGG GCGCGATGGA CAACTATCAG GCCGCTCGAC AGGGGAAGAT GCAACTCGTC
CAGGAGGCCA GCCAGCGACA GGTCGGCTAC ACGGCCCTGG AGATGCGCAA CGCCATCGAG
AGTACGACCC AGACCGTCCT CGATACCGAG TACGACGGGC GCGAGTGGGA GGAGCTGTCC
CCCCGCGAGC AGCGCGCGGT CAAGCAGCGC GTCGAGCGGA TCATCGCCGG CACCGACGGC
GACGGGACGA CCGCCGACGG GACGATGGAC GAGACGTTCC AGCCCGGCTA CATCGGCGAG
ACCGGCTACG CCTACGTGAC CGACCTGGAC TCGAACGTCG TGGTCCACCA CCGTCTGGAA
GACGGGCACA ACTTCGCCGA CGACTCCGGC GGGAGCCTGA CCGTCTTCGA CGACATCAAA
GCGAACATCG AGTCGACGCC CGCGCTGCGC AACGGCGAGA CGTGGGGTGT CGCCGAGTAC
GAGTGGGAGG ACACGACACA GGACGGCAAC CCCACGGAGA CCCAGCTGGT CGCCTACACC
TACTACGAGC CCTTCGACTG GATCGTCGCT CCCAGCGTGT ACTACTACGA ACTCCAGGAG
CGAGCCGTCG CCGACAGCGA GGCGCGGCTG GCAGACTCCT TCGAGCAGTA CCTCGAAACG
CGGACTGTCA GCGTCAATCG GGAACAGCGC TACGCCTACG ACGAGATCAT CTTCGTCGAC
AAGCGGGGCC GAGAGGTCAT CAGTGCCACT CGCGAAGGCG AACGGGCAGT GACCGAGCGC
GACGGTTCGG GGTGGCACGC CGACAAGAGC TGGTTCACCA GTGCCGGTGC CGTCGCCAAC
GGCGAAGTCG CGTTCAACCG AATCGAGGAA GAGAACGGCA GTCAGCGGAT GTACGTCTCG
ACGCCGGTGT ACCAGGACGG ACGGTTCCGG GGCGTCGTCG TGGCCCAGTT CAACTACTCG
ATCGTCACGG AGATCACGAA CGGCGTGTCG GTCGGCGAGA CGGGCTATCT CTACGTCGTC
AACGACCGCG GCGAGATGGT GAGCCACCCC AACGAGACGA AGGTGGCACA GCGCGTCGAC
GTGGCCGGGG GTGAGTACGG CGAGGAACTC GGCAACATCA CCACCGAACA GATGCTGGCC
GCCCGGTCGG GCATGGCCAC ACACACGCGA ACGGTCGGGG GCGAAGAGAC GACCCGATAC
GTCGTGTACG CGCCCCTGCA GGTCGGAGAC AGGCAGTTCT CGCTCGTCGG CACCGTTCCC
GAGAGCGACA TCGAGGGGCC GATCGCGGCG CTGGGAGACG CGCTCCAGCA ACGCACCGAC
GACGCCCGCA ACCTCATCCT GTTGCTGTCG GGCATCGCGG CGCTGGTCGT CGTCGCCGCG
GGCTACGCCG TCGCGCGTTA CGTCGCCAGT CCGATCGAAC AGATGCGCGA CCGGGCCACG
GAGATGGCAC AGGGGCGGTT CGACGGCGAA CTCGACGTGG ACGACCGCGA CGACGAGATC
GGGGAGATGG TCGAGGCCTT CGAGGAGATG GAGGCGAACC TGACCCGTCA GATAACGGAG
ATCGAGGCCG TGTCGGCGTC GCTCAGCGAG GGCGAGATCG ACGACGACCT CCACACGGAC
CTGCCGGGCT CGTTCGGCGA CATCATGACG GACCTCCAGG ACGGCATGAC ACAGCTGCGG
ACGAGCTTCG CCCAGATCAG CCGGGCCAGT GAGAACGTCC GGACCGGCAA GCTCGACCAG
GAGATCGACA GCGACCTCCC CGGAGAGTAC GGCGCGGTGA TGGACGAACT CGACGCGGGA
CTCCAGCAGC TCTCCGAGAG CTTCGACCGG CTCCGGACGG CGAGTACGGC ACTCAGCGAG
CGCCGGCTCG ACGAGGACCT CGGTACCGAC CTGCCCGGCG CGTACGGCGA CGTGATGACC
AACATCGACG CGGGGCTCGA CGCCGTCGAG GAGAGCATCG CGCGGGTGCA GTCGATCGCA
CGGGAGGTGA GTTCGGCCAC CGACGACGCC GCGGCCAGCG CACAGGAGGT CGAGCGGGCC
AGCCAGGAGG TCGCCGAGTC CGTGCAAGAG ATCTCTCACG GCGCGGACCG CCAGTCCGAA
CAGCTCCAGG AGGCCGCCGC GGAGATGAAC GACATGTCTG CGACGGTCGA GGAGGTCGCC
TCCTCCGCGG TCGACGTGGC CAACACGTCA GAACGCGCCG CGGACCGCGC CGAGGCCGGG
AGCGAGCGGG CGAGCGAGGC CTCCCGCGAG ATCGAGGGGA TCGAAGCGGA GACCGAGCAG
GCGGTCGAAC AGGTCGCGGC GCTCGAATCC GAGATCGACG AGATCAGCGA GATCGTGAGT
ATGATCACCG ACATCGCCGA GCAGACGAAC ATGCTGGCGC TGAACGCCTC TATCGAGGCC
GCTAGAGCCG GTGAGGCCGG CGAGGGCTTC GCCGTCGTCG CCGACGAGAT CAAGGGGCTG
GCCGGCGAGG CCGCCGCGGC GACAGAGGAG ATCGACGGTC TCATCACCGA GCTACAGGCG
ACGACCGGGG AGACCGTCGA CGACATCGAG TCGATGCGCG ACCGCGTCGA GAACGGTGCC
GACACGATCG AGGACGCGAT CGCCATGTTC GACGAGATCG CCGAGAGCGC AACCGAGGCC
GAGCACGGCG TCCGAGAGAT CAGCGAGGCG ACCGACGACC AGGCCGCCTC GACCGAGGAG
GTCGTCGCGA TGGTCGACGA GGTCAGCAGC GTCAGCCAGC AGGCCGCGGC CGAGGCGACA
AGCGTCTCCT CGGCGGCGGA GGAACAGGCC GCATCGGTCA ACAACATCTC CCAGAACGTC
CGCTCCGTCG CCGACATGGC CGACGATCTC GAAGCGCTGG TCGAGGCCTT CGAGGTCCAG
CAGTCCGCGG CCGACGGCGA CGCGAGCGCA GCGGGAGCCA GCGCCGACGG CGGCTGGTCG
CCCGACGACG GCGCGGAGTG A
 
Protein sequence
MSMDLKVKLV VLMLAISLLP LGSVGALATQ NMGSLNQDAQ DRSAQELRGA MTDDLNNSVR 
ARQSSLQNQF DQREVDMRSL AQSGAMDNYQ AARQGKMQLV QEASQRQVGY TALEMRNAIE
STTQTVLDTE YDGREWEELS PREQRAVKQR VERIIAGTDG DGTTADGTMD ETFQPGYIGE
TGYAYVTDLD SNVVVHHRLE DGHNFADDSG GSLTVFDDIK ANIESTPALR NGETWGVAEY
EWEDTTQDGN PTETQLVAYT YYEPFDWIVA PSVYYYELQE RAVADSEARL ADSFEQYLET
RTVSVNREQR YAYDEIIFVD KRGREVISAT REGERAVTER DGSGWHADKS WFTSAGAVAN
GEVAFNRIEE ENGSQRMYVS TPVYQDGRFR GVVVAQFNYS IVTEITNGVS VGETGYLYVV
NDRGEMVSHP NETKVAQRVD VAGGEYGEEL GNITTEQMLA ARSGMATHTR TVGGEETTRY
VVYAPLQVGD RQFSLVGTVP ESDIEGPIAA LGDALQQRTD DARNLILLLS GIAALVVVAA
GYAVARYVAS PIEQMRDRAT EMAQGRFDGE LDVDDRDDEI GEMVEAFEEM EANLTRQITE
IEAVSASLSE GEIDDDLHTD LPGSFGDIMT DLQDGMTQLR TSFAQISRAS ENVRTGKLDQ
EIDSDLPGEY GAVMDELDAG LQQLSESFDR LRTASTALSE RRLDEDLGTD LPGAYGDVMT
NIDAGLDAVE ESIARVQSIA REVSSATDDA AASAQEVERA SQEVAESVQE ISHGADRQSE
QLQEAAAEMN DMSATVEEVA SSAVDVANTS ERAADRAEAG SERASEASRE IEGIEAETEQ
AVEQVAALES EIDEISEIVS MITDIAEQTN MLALNASIEA ARAGEAGEGF AVVADEIKGL
AGEAAAATEE IDGLITELQA TTGETVDDIE SMRDRVENGA DTIEDAIAMF DEIAESATEA
EHGVREISEA TDDQAASTEE VVAMVDEVSS VSQQAAAEAT SVSSAAEEQA ASVNNISQNV
RSVADMADDL EALVEAFEVQ QSAADGDASA AGASADGGWS PDDGAE