Gene Hmuk_3382 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_3382 
Symbol 
ID8409460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013201 
Strand
Start bp184490 
End bp187348 
Gene Length2859 bp 
Protein Length952 aa 
Translation table11 
GC content69% 
IMG OID645018304 
Productputative PAS/PAC sensor protein 
Protein accessionYP_003175825 
Protein GI257373051 
COG category[R] General function prediction only 
COG ID[COG3413] Predicted DNA binding protein 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCTGG ACTCACTCGC GTCTGCACGC ATTCTGCTGG TCGGGTCCTC GGAGTGGGTC 
GACGACGCTG CGACCGCGTT CGACGAGCGG ACGACGCTCC TGCGGGCCGA CACCGCCGAC
GCGGCGCTCA CGCTGGTCGC CGAGCAGTCG CCGGACTGTC TGGTCACGGC GTACGACCTC
CCCGACTCGA CCGGGGTCGC CCTCCTGGGA GACGTTCGCG ACGTGGCCCC GACGCTCCCG
GTCGTCGTCG GTGCGGCCGA CGGCAGCGAA TCGGCAGCCA GCGACGCGAT CGCCGCTGGC
GTCGACGACT ACGTCGTGAT CGACGAAGCG GCCGACCGAC CGAGCGACGT GCTCCTCGAC
CGGACGAGGG GGGTACTGGA AGCGACGCCG GACCGCGACG GTCACGAGCG ACGGGCACGC
CAGTTCGAGG CGATGTTCGC GGACGCGCGG ACCGCGACGT GGGTGCTCGA CCCCGACGGC
TCGCTCGTGC GAGCGAACCA GACCGCTCGC GAGATCGGCG ACGACGCGGC GTCGGCACAG
ACGGGTGGCC CGTTCTGGAC GCTCCCGTGG CTGTCGGGCG ACGAACGCAC CAGGGCCGAC
GTGCGACGCC TCGTGGAGCG TGGCGTCGAC GGGGCGTTCG CCCACGCCGT CGTCACGCTG
GCGGCGATCG ACGGGCCGTC GCGAGTGTTC GACCTGTCGG TCCGGCCGGT CGACGACGGC
GGCAGCGTCG ATTCGATCGT CGTCGAGGCC GTCGACATCA CGGACCGGGT CGAACTCGAA
CGCGACCTTC GCCGATCCGA GGAACTCCAC CGAGTGACGC TGAACAACAT GACCGACACC
GTCCTCATGA CCGACGAGGA CGGGGAGTAC ACGTACGTCT GCCCGAACGT CCACTTCATC
TTCGGCTACA CGGCCGCGGA GATCCGCGAG CAAAAGCCCA TCGAGGAGCT GCTGGGGGAG
GACCTCTTCG ACCGCGAGGA GCTGGCTCGG CGTGGCGTCT GCAAGAACAT CGAGTGTACG
GCCACGGACA AGGCCGGACG CGAACACACG CTCCTGGTCA ACGTCCGGGA GGTGTCGATC
CAGGACGGGA CGCTCCTCTA CAGCTGTCGG GACATCACCA AACGCAAGCG CCGCGAGGAC
GCGCTGGCCG CGCTGCACGG GACTGCACGG CGGTTCCTCT ACGCGGAGAC CCACCAGGAG
ATCGCACAGC GCGTCGTCGA CGACGCCCCG GCAGTCCTCG GCGTCGACGC CGTCGCGGTG
TACCTGTTCG ACGACGCGGC CAACGAACTC CGGCCGGCGG CGTACTCGCC GTCGGCGACA
GAGCTACACG GGCCGCTCCC GACGGTCGCG GTCGACGACG AGACGCTGCC GGGACACAGC
TTCGTCGCGG ACGACACGCT GTTTTTCGAC GACGTTCACC ACGCGGACCG ACTGGACAAC
CGGGCGACGG AGTTTCGCGG GACGACCTAC ATCCCGCTGG GTGACCACGG CGTGTTCGTC
GCCGCCTCGC CCGAGGTCGG TGCCTTCGAC GACGTGACCC GAGAGCTGTC GGACCTGTTC
GCTGCGACCA CCGAGGCCGC ACTCGACCGC GTGGCCCGCG AGGCACAGCT CCGCGAGCGA
GATCGGCAAC TCCAGCGCCA GAACGAGCAG CTGTCGGCGC TGAACCGCAT CAACGACACG
ATCCGCGAGA TCGGGCGGAC GATCGTGCGA GCGGAGACCC GCGAGGAGAT CGACCGGGCG
GTCTGTGAGC GACTGACCGA CGGCGACCGG TTCAGCTTCG CGTGGATCGG CTCCGTCGAC
CCCGCACACG AGGTCCTCGA ACCGCGGGCC TGGGCCGGCG ACGAGCAGGG GTATCTCGAC
TGCCAGTCGT TCGCCGTCGG CGCGTCCGAC GCCGAACCGT CCGGCCAGGC CGCGGCGACG
GACACCGTGA CTCTGGAGAC GAACGTCGCG GCCGGACTGC GAGACGAGCC CTGGCGAACG
GAGGCGCTCC GGCGCGATTT CACGTCCGTC CTGAGCATTC CGCTCGTGTA CAACGACCTC
AGCCACGGCG TCCTCTCGGT CTACGCCGAC TCGAAGGACG CCTTCGACGA GACGGCGCGG
GCGGTGCTCG CCGAACTCGG CGAAACGATC GCGTCGGCGC TCAGCGCTCT CGAACGGAAG
AACGCGCTCC TCACGCCGTC TGTCACCCGC GTCGAGTTCG CGGTCGACGA TCCGACCTTC
CTCCTCTCGC GGCTCGCCGA CCAGGCGGAG TGTACGGTCA GCTATCAGGG CGGCATCCGA
CAGACCGACG CGGGTAGCGC CCTCTTTCTC ACCGTCGAGG GTGCGCCGGT CGACGCCGTC
GTCGACCACG CCACCGAGAT GGCGGCGGTC GACGAGGTGC AGGTGGTCAG CGCCGACGAG
AACGGCGGCG TCGTGCGGCT CGTCGTGGCG CGCTTTCTCG CACAGGAGCT GGCCGACCAC
GGCGCGCTCC TCCGGGAGGT CACCGCCAGT CAGGGGGGCA CCGAGCTCCG TGTCGACGTG
CCAGAGCGCA TCACCGTGCG GGAGATCACG GAACTGATCG GCCAGACGGT GTCCAACGTC
GAACTCCGCT CCAGACGGAG CGTCGAGCGA CCCACACAGC ACGACGTTCG CTCGACCGTC
CTCGACGAGA TGACCGAGCG GCAACTGGAA GTGGTCCAGA CAGCCTACTA CAGCGGGTTC
TTCGAGTCGC CCCGAGAGAC CAACGGCAAG GAACTCGCGG CGATGCTCGA CATCTCCCCA
CCCGCGTTCT ACCAGCACGT CAGAGCCGTC CAGCAGAAGC TGTTCACGGC GGTGTTCGAG
GCATACGGAC TCCTGGAGTC TGGCAGGTTC AATAGTTAA
 
Protein sequence
MALDSLASAR ILLVGSSEWV DDAATAFDER TTLLRADTAD AALTLVAEQS PDCLVTAYDL 
PDSTGVALLG DVRDVAPTLP VVVGAADGSE SAASDAIAAG VDDYVVIDEA ADRPSDVLLD
RTRGVLEATP DRDGHERRAR QFEAMFADAR TATWVLDPDG SLVRANQTAR EIGDDAASAQ
TGGPFWTLPW LSGDERTRAD VRRLVERGVD GAFAHAVVTL AAIDGPSRVF DLSVRPVDDG
GSVDSIVVEA VDITDRVELE RDLRRSEELH RVTLNNMTDT VLMTDEDGEY TYVCPNVHFI
FGYTAAEIRE QKPIEELLGE DLFDREELAR RGVCKNIECT ATDKAGREHT LLVNVREVSI
QDGTLLYSCR DITKRKRRED ALAALHGTAR RFLYAETHQE IAQRVVDDAP AVLGVDAVAV
YLFDDAANEL RPAAYSPSAT ELHGPLPTVA VDDETLPGHS FVADDTLFFD DVHHADRLDN
RATEFRGTTY IPLGDHGVFV AASPEVGAFD DVTRELSDLF AATTEAALDR VAREAQLRER
DRQLQRQNEQ LSALNRINDT IREIGRTIVR AETREEIDRA VCERLTDGDR FSFAWIGSVD
PAHEVLEPRA WAGDEQGYLD CQSFAVGASD AEPSGQAAAT DTVTLETNVA AGLRDEPWRT
EALRRDFTSV LSIPLVYNDL SHGVLSVYAD SKDAFDETAR AVLAELGETI ASALSALERK
NALLTPSVTR VEFAVDDPTF LLSRLADQAE CTVSYQGGIR QTDAGSALFL TVEGAPVDAV
VDHATEMAAV DEVQVVSADE NGGVVRLVVA RFLAQELADH GALLREVTAS QGGTELRVDV
PERITVREIT ELIGQTVSNV ELRSRRSVER PTQHDVRSTV LDEMTERQLE VVQTAYYSGF
FESPRETNGK ELAAMLDISP PAFYQHVRAV QQKLFTAVFE AYGLLESGRF NS