Gene Hmuk_1581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1581 
Symbol 
ID8411103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1506527 
End bp1508161 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content72% 
IMG OID645019907 
Productprotein of unknown function DUF255 
Protein accessionYP_003177402 
Protein GI257387629 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1331] Highly conserved protein containing a thioredoxin domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.397722 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCAGT TCGCAGCGGA GACGAAAGTC GAGTGGCGCG AGTGGGGGCC GGCGGCCTTC 
GAGGCTGCGC GGGAGGCCGG CAAGCCGATC CTGCTCGCGC TGACGGTGCC CTGGAGCCCC
GAGTGTCGCG AGATGGACCG CAAGACGTAC GCGGAGCCAC GGATCGCGGC CAACGTCAAC
GACGGCTTCG TCCCGGTCCG GGTCGACGGC GACCGCCACC CCGAGGTGCG CGAGCGATAC
ATCATGGGTG GGTTTCCGTC GACGGTGTTT CTCACGCCCG AGGGGACAGT GCTGACCGGG
GCGACGTATC TCGGACCCGA CGGCTTCCGT GGGATCCTCG ACAGCGTCCG CGAGACGTGG
GAGACGGAGG GAGAGGCCGC TGGTTCCGTC CCGCGCTCGC TCCAGACCGA CGCGCCACCA
GCGGGCGAGG TGACCGCACG GATCGAGGAG GCGATGGTCG AGCAGTTGCT CGCGGCTTAC
GACGAGGAGT ACGGCGGCTG GGGCACCGAC GTGAAGTTCC CGTTGCCACG GACCGTCGAG
TTCGCGCTGG TGCGGGCTCG CGACCAGGCG ACACGGACGC TGGAGGCGAT TCAGACGCAC
CTTCGGGACA CCGACGACGG CGGGTTCTAT CGCTACGCGA ACGGCCGGAC GTGGTCGGAC
GCGCGGACAG AGCGACTCCT CGACGAGAAC GCCGCTCTGG TCAGAGCGTT CGCTCACGGC
TATCGCTACA CGGGGGAGGA GGCCTATCGC GAGACCGCAG AGCGCGCCAT CGAGTACCTG
ACGACGAAGC TGTGGGTCGA CACCAGCGGG GACACCAGCG GTGCGTTCGC CGGCAGTCAG
GCCGGCGACG ACACGTATCA CCGGCTGGAC GCCAGCGATC GGGCCTCGGC GGATCCACCC
CGGGTCGACG AGACGGTCTT CGCCGACCGG AACGGGATGG CCATCGACGC GCTGGCGACG
TACGCCGCCT ACACCGACGA CGAGCGCGCT CGCCGGTACG CCGAGCGCGC TCGCGAGACG
ATCGCCGAGA CGCTCGTCGA GAACGGCGCG GCCACGCACT ACCGGACCGA CGAGGCCGTG
GGGCCGACCG GGCTCCTCCT CGACCAGGCG CGGGTCCTGC AGGGACTGAC GACCAGCTGG
CAGGTGCTCG GCGAGGGCGG CCCCGCCAGA GCGATCGCCG ACTGGGCGAT CGAGCACCTC
CAGACCGAGA GCGGCGCGTT TCGCGACGGA CCGGCCGACG GGCCGGGTCT GTGCTCGCGT
TCGCAGTACC CGCTCGACGC GACCGTCGAG CTGGCCGACG CCTTGCTCGA CCTGGCCGCG
CTCGCCGACG ACGAGCGCTA CCGGGAGGCC GCTCACGGTG CCATCGCCGC CTTCGCCGGT
GCGTCCGACC GGATGGGCGT CGAAGTTGCA CACTACGCGG CCACGGCCGC CCGGCTCCGA
TCGCCCGCCG TCCTTCGCGT CGGGCCGCGG GCCGGGAGCG ATCTCCACCG GGCCGCACTC
CGGCTGGCCG ACCACGAGAC CGTCGTCGTC CCCGATGCTG GCGGCGACGA GGCGGTCCTG
TTCGAAGACG GCGAGCGGGT CGGCACCGCC GAGGAGCCGG CGGGGCTCGA AGCCGTCCTG
ACGGGCGACG CGTAA
 
Protein sequence
MDQFAAETKV EWREWGPAAF EAAREAGKPI LLALTVPWSP ECREMDRKTY AEPRIAANVN 
DGFVPVRVDG DRHPEVRERY IMGGFPSTVF LTPEGTVLTG ATYLGPDGFR GILDSVRETW
ETEGEAAGSV PRSLQTDAPP AGEVTARIEE AMVEQLLAAY DEEYGGWGTD VKFPLPRTVE
FALVRARDQA TRTLEAIQTH LRDTDDGGFY RYANGRTWSD ARTERLLDEN AALVRAFAHG
YRYTGEEAYR ETAERAIEYL TTKLWVDTSG DTSGAFAGSQ AGDDTYHRLD ASDRASADPP
RVDETVFADR NGMAIDALAT YAAYTDDERA RRYAERARET IAETLVENGA ATHYRTDEAV
GPTGLLLDQA RVLQGLTTSW QVLGEGGPAR AIADWAIEHL QTESGAFRDG PADGPGLCSR
SQYPLDATVE LADALLDLAA LADDERYREA AHGAIAAFAG ASDRMGVEVA HYAATAARLR
SPAVLRVGPR AGSDLHRAAL RLADHETVVV PDAGGDEAVL FEDGERVGTA EEPAGLEAVL
TGDA