Gene Hmuk_2314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2314 
Symbol 
ID8411855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2232034 
End bp2234181 
Gene Length2148 bp 
Protein Length715 aa 
Translation table11 
GC content69% 
IMG OID645020657 
Productprotein of unknown function DUF255 
Protein accessionYP_003178133 
Protein GI257388360 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1331] Highly conserved protein containing a thioredoxin domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.40107 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCG ACTCGGGTCC GACGGACCGG AACCGCCTCG ACGAGGCGGA GAGTCCCTAC 
CTCCGCCAGC ACGCGGACAA CCCCGTCAAC TGGCAGCCGT GGGACGAACA GGCACTCGAA
ACCGCCAGAG AACACGACGC GCCGATCTTC CTCTCGATCG GGTACTCGGC GTGTCACTGG
TGTCACGTCA TGGAAGACGA GAGCTTCTCC GACCCCGAGA CGGCGACGCT ACTCAACGAG
CACTTCGTCC CGATCAAGGT CGACCGCGAG GAGCGCCCGG ACCTCGACGC CATCTACATG
AGCATCTGTC AGCAGGTGAC CGGCCGCGGT GGCTGGCCGC TGTCGGCGTG GCTCACTCCC
GACGGGGAAC CATTCTACGT CGGGACCTAC TTCCCACCCG AGGAGCGCCG CGGGATGCCG
GCGTTCGGTC AGCTGCTCGA AGACATCGCG GGTTCGTGGT CCGATTCGGA GCAACGCGAG
GAGATGTACA ACCGCGCTCG GCAGTGGACC GACGCCATCG AGAGCGACGT CGGCGACGTG
GGGCAGCCGG GCGACGTGCC CGACGACGAG GCGCTGCAGG CGGCCGTCGA CGCCGCGATT
CGGGCCGCAG ACCGCGAACA CGGCGGCTGG GGGAACGGCC CGAAGTTCCC ACAGCCGGGG
CGACTCCACT ACCTCATGCG CGAGGTCGCC CGTTCGGACC GCGACGACGT GCGCTCGGTC
GTCACCGAGA CGCTCGATGC GATGGCCGAC GGCGGGCTCT TCGACCACGT CGGCGGCGGC
TTCCACCGCT ACTGCACGGA CCGGGAGTGG GTCGTTCCCC ACTTCGAGAA GATGCTCTAC
GACAACGCGA CGCTCCCGCG AGCCTACCTC GCGGGCTACC AGCTCACCGG CGACGAGCGG
TACGCGGAGG TCGCTCGGGA GACGTTCGCC TTCGTCGAGC GCGAACTCAC ACACGAGGAC
GGCGGCTTCT TCAGCACGCT CGACGCACAG AGCGTCCCAC CCGCCGGCCG TCGCGAGGAC
GCAGACGCCG AGCCCGAGGA GGGCGCGTAC TTCGTCTGGA TCCCCGATGA GGTCCGCGCG
GCCGTGGACA GCGAGACGGC GGCCGACCTG CTGTGTGACC GCTTCGGGAT CACCGAGTCG
GGCAACTTCG AGGGCAAGAC GGTGCTGACC GTCGACGCCT CCATCGAGGC CCTGAGCGAG
TCCAGCGGCC TCGAAGCGAG CGACGTCGAG CGCACGCTGG CAAGCGCACG CGAGCAGGTC
TTCGAGGCAC GCGAGGAGCG GCCACGCCCC GCACGCGACG AGAAGGTGCT CGCGGGCTGG
AACGGACTGA TGATCACGGC CATCGCGGAG GGCGCGATCG TCCTCGATGA TGTCGATCCG
GACCCGGCCG CCGACGCCCT CGCGTTCGTC CGCGAACACC TCTGGGACGA GAGCGAGCAA
CGGCTGGCGC GGCGCTACAA GGACGGCGAC GTGGCGATCG ACGGGTACCT CGAAGACTAC
GCCTTCCTCG CTCGCGGCGC GCTGACGCTG TTCGAGGCGA CCGGCGAAGT CGAACACCTC
GCCTTCGCTC TGGACCTCGC CCACGCCATC GAGCGAGAGT TCTGGGACGC GGACGACGGC
ACGCTGTACT TCACCCCGAC CAGCGGCGAG TCGCTGGTGG CCCGCCCACA GGAGCTCACC
GACCAGTCGA CGCCCTCCAG CACCGGCGTC GCGGTCCAGG CGCTGCTCTC GCTGTCGGCG
TTCGTCCCGC ACGATCGCTT CGAGACGATC GCGGCGGGCG TCCTGGAGAC ACACGCCAAC
AAGATCGAGG CGAATCCGAT GCAACACGCC TCGCTGGTCG TCGCGGCCGA CCGGTATCTG
CGGGGCGACC TCGAACTCAC GCTGGTCGCC GACGAGGTCC CGGCGGAGTG GCGAACGACG
CTGGCCGAGA CGTACCTCCC GGACCGACTG CTCGCGTGGC GGCCGCCGGG CGACGGCGAC
CTCGACGCGT GGCTCGACGT GCTGGGGCTG GACGATGTCC CACCGATCTG GGCAGATCGG
ACCGAGCGCG ACGGCGAGGC GACCGTCTAC GCCTGCCGCC AGTTCACGTG TTCGCCGCCA
CAACACCACC TGCGGAACGC GCTCGACTGG GCGACAGAGC AGGCGTGA
 
Protein sequence
MSSDSGPTDR NRLDEAESPY LRQHADNPVN WQPWDEQALE TAREHDAPIF LSIGYSACHW 
CHVMEDESFS DPETATLLNE HFVPIKVDRE ERPDLDAIYM SICQQVTGRG GWPLSAWLTP
DGEPFYVGTY FPPEERRGMP AFGQLLEDIA GSWSDSEQRE EMYNRARQWT DAIESDVGDV
GQPGDVPDDE ALQAAVDAAI RAADREHGGW GNGPKFPQPG RLHYLMREVA RSDRDDVRSV
VTETLDAMAD GGLFDHVGGG FHRYCTDREW VVPHFEKMLY DNATLPRAYL AGYQLTGDER
YAEVARETFA FVERELTHED GGFFSTLDAQ SVPPAGRRED ADAEPEEGAY FVWIPDEVRA
AVDSETAADL LCDRFGITES GNFEGKTVLT VDASIEALSE SSGLEASDVE RTLASAREQV
FEAREERPRP ARDEKVLAGW NGLMITAIAE GAIVLDDVDP DPAADALAFV REHLWDESEQ
RLARRYKDGD VAIDGYLEDY AFLARGALTL FEATGEVEHL AFALDLAHAI EREFWDADDG
TLYFTPTSGE SLVARPQELT DQSTPSSTGV AVQALLSLSA FVPHDRFETI AAGVLETHAN
KIEANPMQHA SLVVAADRYL RGDLELTLVA DEVPAEWRTT LAETYLPDRL LAWRPPGDGD
LDAWLDVLGL DDVPPIWADR TERDGEATVY ACRQFTCSPP QHHLRNALDW ATEQA