Gene Hmuk_2059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2059 
Symbol 
ID8411594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1968496 
End bp1970025 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content69% 
IMG OID645020397 
Productprotein of unknown function DUF1508 
Protein accessionYP_003177879 
Protein GI257388106 
COG category[S] Function unknown 
COG ID[COG3422] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACGTT CCAGTGCGGA GGACGGAGCC CTGCTCAAGT GGTATCGCAA CCGGATCGCC 
GAGCCGACGA CCGACGACGA AGTGTACGGC TACTGGCTGT TCGTCTTCGG TGTCGTGGTC
GGTCTGTTCG GCGTGTTACT CGCCGGCTCG ACTGCCACAC AGACCAGCGG GTCGTTCCAG
CGACTGGCCG GGATCGCCCT GGGCGGACTC GGACTGCTGT TCGTCCTGAT CGGGCCGATC
ATCCGGCTCC CGCTGGAACG CCGGGCGACG AGGGTCTCCT ACCTCGGGGC GGCCATCAGC
GTCCTCGCGA TCGTCTGGCT CGTCCTCTCG TTCGACGGCG GGCGCGGACA GCGCACGATC
GTCTTCGGGA CCTACGGCAT CGGGATGGCC GTCATCGCGC TGGGTGCGAT CCTCGTGCCG
ATGCTGAACG AGACCGACGG CGAGACCGCC GTCTCGGCCG CTGAGTCGGC GGCCCAACGG
GCGCAGGCGG ACGCCGACAG TGCCCGTACG GAGGCCGACG AACACCGGGC GGACGCCGAC
GACCTTCGCG ATCAGGTCGC CGCTCGGACC GACGAACTGG CAGTGCTCCA CGACAGCGCG
GCGACGTTCG AGCTGTACGA GGACAGGGGC GGCAAGTACC GCTGGCGGCT CCGCCACCGC
AACAGCAACA TCATCGCCGA CAGCGCCCAG GGGTACTCCC GCCGCGTCGA CGCCCAGAAC
GGGATCGAGA GCGTTCGCCG CAACGCACTG GGCGGCGCGC TCGTGGAACT GGAGACCACG
CTCGAAGACT CGACCGACGA GGCAGAGACG GGCGTCGTCG CCTATCCGCC GGGCGAGAGC
AAGGCGACGT TCGAGCTGTA CGAGGACAAC GTCGGGGGAT ACCGCTGGCG ACTCCAGCAC
GCCAACGGCA ACATCATCGC CGACAGCGGC GAGGGCTACA CCGAGCGCCG CGAGGCCGAG
GCGGCCATCG AGCGCGTGAG CGAGCGGGTC GGACCGGCAC AGTACCTCCG TGCCGATCCG
ACGGCCTTCG AGGTGTACCA GGACGCCGCC GAAGAGTGGC GCTGGCGGCT GATTCACAAG
AACGGCAAGA TCCTCGCCGA CAGCGGTGAG GGGTACGCTT CCCGTAGCGG GGCTCGTGAC
GCCGTCGATC GCGTCCGGTC TGGTCTGGAC GACGACGACT ACGAGTTCGA GGTCTACGAG
GACACGGCCG GCAAGTACCG CTGGCGGCTC GACGCGGACA ACGGCGAAAC CGTCGCCGAC
AGCGGGCAGG GGTTCAGCGA GGAGCGGGGC GCGCGCGAGA GCATCGATCG CGTCGAGGGA
CACGCGCTCG ACGCCGCGGC ACTGGACATC GGACTGGCGG CTTTCGAGGT CTTCGAGGAC
AGCGCCGGGA AGTATCGCTG GCGACTCCGC CACCGCAACG GGAACATCCT CGCCGACGGG
GGCCAGGGGT ACAGCTCGCG GACGAAGGCC TTCGACGGCA TCGACAGCGT CAAGCGCAAC
GCGCCCAACG CCGAGACCAC CGACGCCTGA
 
Protein sequence
MERSSAEDGA LLKWYRNRIA EPTTDDEVYG YWLFVFGVVV GLFGVLLAGS TATQTSGSFQ 
RLAGIALGGL GLLFVLIGPI IRLPLERRAT RVSYLGAAIS VLAIVWLVLS FDGGRGQRTI
VFGTYGIGMA VIALGAILVP MLNETDGETA VSAAESAAQR AQADADSART EADEHRADAD
DLRDQVAART DELAVLHDSA ATFELYEDRG GKYRWRLRHR NSNIIADSAQ GYSRRVDAQN
GIESVRRNAL GGALVELETT LEDSTDEAET GVVAYPPGES KATFELYEDN VGGYRWRLQH
ANGNIIADSG EGYTERREAE AAIERVSERV GPAQYLRADP TAFEVYQDAA EEWRWRLIHK
NGKILADSGE GYASRSGARD AVDRVRSGLD DDDYEFEVYE DTAGKYRWRL DADNGETVAD
SGQGFSEERG ARESIDRVEG HALDAAALDI GLAAFEVFED SAGKYRWRLR HRNGNILADG
GQGYSSRTKA FDGIDSVKRN APNAETTDA