Gene Hmuk_0011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0011 
Symbol 
ID8409507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp8006 
End bp10351 
Gene Length2346 bp 
Protein Length781 aa 
Translation table11 
GC content69% 
IMG OID645018348 
Productputative PAS/PAC sensor protein 
Protein accessionYP_003175869 
Protein GI257386096 
COG category[R] General function prediction only 
COG ID[COG3413] Predicted DNA binding protein 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.206274 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.555511 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGCG AAACCGGGTC CGGATCGGAG ACGAGCGACG GAAAGCGAGT GCTCTACGTC 
GACACGCCGA CCGACAGCGT CGCCGACCGG CTCGCCGCTC GCCTCGCCGA CTGTCGCCTA
CACCACGAGA CCGGCGTCGA GGCGGTCACG GCCGCCGAGT CCGAATCGTG GGACGGGCTC
GTCGTGACCG ACGGTATCGA GGCGGACCGA CGCAGCGCGC TCCTCGATGC CGTCGACTGT
CCGTCGGTGC TGTACGCGAG CGCGGACCCG GCGACGATCC CAGCGGAGAC GACACGCCGA
GTCGAGACGA TCGTCGAACG CGAGACCGCG GACGCGGCGT CTCTCCTCGC AGAGAAGATC
GAGACGCTCG TCGGACAGTT CCCCGACGCC TTCGAAGACG CGCGTGGGAG TGCTCTCTCG
ACGATCTGTC GGGAGGTGAC CGAAGGCAGC GCACTGTTCG CCGTCGACGA CGAGGGCCGC
GTGCGCTGGT CGAACCACAG CTTCGAAGCG CTCTTTCCCG TCGATCGGAT CGAGCGCTCG
ATCCCGGAGA CGGAGGACTT CTACGCGCGA CTGGACGCGA TCGTCTCGGC GGAGCCCGAC
GACGGTGGCC GCCGCGACAT CGGCCCTCGG GATGGGCCGG TCGAGAACCA CTCGGTGGTC
GTTGCCACCC CCACTGGCAC CCGCTACTAC GTCCACCAGC GACATCGCCT CGAGAGTGTG
GGAGACGGAA TCACGATCGA GCAGTTCGAG GACATCACGG ACCGCGTCCG TCGCGAGACG
CGGCGGCGAC TGCTCGCGTT GCTGGTCGAA CAGGCGCGCG ACGGCCTGTA CACGCTCGAT
CACAACGGCG TCGTCGACTT CTGTAACGAG TCGTTCGCGG CGATGCTGGG CTACGAACGC
GGAGAGCTGA TCGGGATGCA CGCGTCCGAG ATGCTCGCTC CCGGCGAACT GGTGGCCGGC
CAGCGCACCG TCCAGGCGCT GCTCGACGAC CCCGACACCG ACGGCGCGGA AGTCGACATG
ACCTTCCGGA CCCGCGACGG CGAGGATCGC GAGCTGTCGA TCCACTACAC GCTCCTGTCG
ACCGGCGGGG ACAGTTACGG CGGTCTGATG GGAGTCGCGC GCGACGTGAC CGAACGCCGC
GAGCGGACCC GCCGGATCGA GTCCCAGCGA GACGAACTCT CGACGCTGAG CCGGACGAAC
GTACTGGTAC AGGACATCAT CGGCGCGCTT GGCACCGCTG CCAGCACCAT CGAACTCCGG
CAGACCGTCT GTGATCGGCT GGTCGAGTCG GGTCGTTACG GACTGGCCTG GATCGGCGAG
CGCCACGGTG GCGACGACGT GATCGTCCCG CTGACCAGCG CCGGTGCGTG GACCGAGCAC
CTCGACGATG TCGAGCTCCG CGCCGACGAC GGGGCGGATG CCGCTCCGGT CGCTCGTGCC
TACCGCACCG GCGAGGTACA GGTCGTCTCG GACACGCGAT CGCTGGCGGC GTTCTCGCCC
CGGCCAGAGC GAGCACCGGC TGCTGGCTTC GAGGCGGTGA TCGTCACCCC GCTCTGTCAC
GGCGAGACGA CCCACGGCAT CCTCGCCGTG TACGCGACCG AACCGGACGC GTTCAGCGAG
CGCATCGCAC GTAGCTTCGC CGTCCTCGGG GAGACGATCG GGCTCGCTCT CACGGCGATC
CAGAACAAGC GGCTGCTCCA GCACGAGGCC CCGCTCAGGC TGGCGTTTCG CTCCGACAGC
GACGACGCCT GTCTCGTCCG GGTCGCGCGA AGCTGTGACT GTACGCTCGA AACGGCCGGC
GTCGTCGAGA CCAGCGACGG CGTGGTGCAG TACCTCCGGG TCGACGGAGC CGCCCCCAGT
ACGGTGGTCG ACACGGTCTG CGAGTCCGAG CTCGTCGACG ACGCGGCGGT GACCCGGACC
GACGGGGACT CTCTCGTCGA ACTCCACGGA CCGTCGTCGC CGGGGACGGA GCTCGCAGAG
GTGGGTGCCC GTCTCGTCGA GACCGAGATC GATCCCGGCG GGGCTCGCCT CGTTGTCGAG
ACGACCGGCG ACGCGGACCT CCGATCGGTC CAGACCGTCG TCGAACGGTG GTTCCCGGAC
GTGACGCTCG TCTCCAAGCG CAAGCGGACC CAGCCCGAAA GCGACGGTGA ACGGTCGCCG
CTGTCGCCGC TGACCGACCG CCAACGAGAG GTCCTCCGGG CCGCGTACCT CTCGGGCTAC
TACGACTGGC CTCGCGAGAC GACCGCCGAG GAGCTCGCGG ACTCGCTGGG CATCGCCTCG
CCGACACTCC ACCAGCACCT CCGCCGAGCA GAGCGGAACC TCCTCGCTGG CGTGCTCGAT
CGGTAG
 
Protein sequence
MTGETGSGSE TSDGKRVLYV DTPTDSVADR LAARLADCRL HHETGVEAVT AAESESWDGL 
VVTDGIEADR RSALLDAVDC PSVLYASADP ATIPAETTRR VETIVERETA DAASLLAEKI
ETLVGQFPDA FEDARGSALS TICREVTEGS ALFAVDDEGR VRWSNHSFEA LFPVDRIERS
IPETEDFYAR LDAIVSAEPD DGGRRDIGPR DGPVENHSVV VATPTGTRYY VHQRHRLESV
GDGITIEQFE DITDRVRRET RRRLLALLVE QARDGLYTLD HNGVVDFCNE SFAAMLGYER
GELIGMHASE MLAPGELVAG QRTVQALLDD PDTDGAEVDM TFRTRDGEDR ELSIHYTLLS
TGGDSYGGLM GVARDVTERR ERTRRIESQR DELSTLSRTN VLVQDIIGAL GTAASTIELR
QTVCDRLVES GRYGLAWIGE RHGGDDVIVP LTSAGAWTEH LDDVELRADD GADAAPVARA
YRTGEVQVVS DTRSLAAFSP RPERAPAAGF EAVIVTPLCH GETTHGILAV YATEPDAFSE
RIARSFAVLG ETIGLALTAI QNKRLLQHEA PLRLAFRSDS DDACLVRVAR SCDCTLETAG
VVETSDGVVQ YLRVDGAAPS TVVDTVCESE LVDDAAVTRT DGDSLVELHG PSSPGTELAE
VGARLVETEI DPGGARLVVE TTGDADLRSV QTVVERWFPD VTLVSKRKRT QPESDGERSP
LSPLTDRQRE VLRAAYLSGY YDWPRETTAE ELADSLGIAS PTLHQHLRRA ERNLLAGVLD
R