Gene Hmuk_1967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1967 
Symbol 
ID8411496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1876834 
End bp1877961 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content66% 
IMG OID645020299 
Productmulticopper oxidase type 3 
Protein accessionYP_003177787 
Protein GI257388014 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2132] Putative multicopper oxidases 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCGCC GAGAGTTCGT CGCCGCAACG GGCCTGAGCG GTTCGCTGGC ACTCGCCGGC 
TGTCAGGCCC CCACGTCGTC CTCGAAGGTC TCGACGACGA GCGAGACGGC GACGGAGACC
TCGATGGCGA GTCAGTCCGA GTCGCTGCCG ACGACGTCGC CGCCGGAGAT CGTCAACGTC
GACGAGCAGG GCGGCGAGGT CACGATTTCG ACCGTCGCCT CGAAGCACAT GGTCCACCCC
GAGGAGACGA TGGGCGGCCC GGTCGAACTG CCGAAGGTGT GGGCGTTCCA GGCCGACGAC
GGCGAGCCCA GCGTCCCCGG TCCGATCCTC CGGACGACCG AAGGCGAGGA CATGGAGGTG
ACGCTCGACA ACACCGACGC CAACATGCCC CACACGCTGC ACTTCCACGG CGTCCGCAAG
ACCTGGGAGA ACGACGGCGT GCCGACGACG ACGGGCATCA CGGTCAATCC CGGCGAGAAG
CACACCTACG AGATTCCGGC CAACGTCCCC GGGACGCACC TCTATCACTG CCACTACCAG
ACCCAGCGTC ACATCGACAT GGGGATGTAC GGCATCTTCC GCGTCGACCC GAAGGGGTAC
GAGGAGGCCG ACCAGGAGCT GTTCATGACG CTGAAAGACT GGGACTCCAC CCTGAACAAG
CAGATGGCCG GCATGGACGC CACCTACAGC CCGCGGGACC GCGACCCCGA CGTGTTCACG
ATCAACGGCA AGAGCGCGCC CCGGACGCTC CATCCCGAAG ACGGCTCGCC GGTCATCGTC
TCGCAGGGCG ACACGGTCCG CATCCATCTC ACCAACAACG GGTACATGAG CCACCCGATG
CACACGCACA ACCACCGCTT CCGCGTGGTC GAGAAAGACG GCTCGAAGGT CCCCGAGGCA
GCGCAGTACG AGCAGGACAT CGTCAACGTC GCTCCCGCGG AGCGAAAGGC CATCGAGTTC
GACGCCGACG CCGACCCCGG CATCTACCTC ATGCACTGCC ACAAGGTCAG CCACGCGATG
AACGGCAACA CCTACCCCGG CGGGATGGTC GGCGGCATCG TCTACGAGGA GGCGATGGAC
TCGGATATCT TCGCCGAACT GATGGACTAC GCCGGCTACG AAGGCTAA
 
Protein sequence
MSRREFVAAT GLSGSLALAG CQAPTSSSKV STTSETATET SMASQSESLP TTSPPEIVNV 
DEQGGEVTIS TVASKHMVHP EETMGGPVEL PKVWAFQADD GEPSVPGPIL RTTEGEDMEV
TLDNTDANMP HTLHFHGVRK TWENDGVPTT TGITVNPGEK HTYEIPANVP GTHLYHCHYQ
TQRHIDMGMY GIFRVDPKGY EEADQELFMT LKDWDSTLNK QMAGMDATYS PRDRDPDVFT
INGKSAPRTL HPEDGSPVIV SQGDTVRIHL TNNGYMSHPM HTHNHRFRVV EKDGSKVPEA
AQYEQDIVNV APAERKAIEF DADADPGIYL MHCHKVSHAM NGNTYPGGMV GGIVYEEAMD
SDIFAELMDY AGYEG