Gene Hlac_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1049 
Symbol 
ID7400121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1041329 
End bp1042900 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content60% 
IMG OID643708117 
Productmulticopper oxidase type 2 
Protein accessionYP_002565716 
Protein GI222479479 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2132] Putative multicopper oxidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.541627 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTCCA CCGGCAGTCG AAAGGGTGCA CCCTACTACC ACATCGAGAT GGCACCGGGG 
ACGCACGAGC ACCACCCCGA CATACCAGAC ACGCCGATCT GGGGGTACAA GGGCCCGAAC
GACGACGAAG GGAAGTATCC GGGGAAGACG ATCGAAGCCA CTCAGAACCA GCGTTTGAAA
GTCGAGTTCT CAAACGATCC ACTCCCCGAA ACGCATCTCC TCACCGATAG TGTGGACACG
GCTGTTCACG GGACGAAACC CGAAGACTAC GAGGACCGAT ATCCAGACTG GGTAAGCCAG
TTCGAAGCGT TCGGCGGAAC CTTCGCATTC CCGGAAGTCA GGACGGTGAC TCATGCCCAC
GGCGTGCACG TCGAGTCCGC GAGCGATGGC CTGCCCGAAC AGTGGCAATC ACCGGGAGGG
ATAGAAGGTC CGCAGTTCCA AAAGGCCGTC TACGATTACC CTAACCGGCA GTCGCCCGCG
ACGCTATGGT ACCACGACCA CGCCCTCGGG ATTACGCGGC TCAACGTCTA TGCCGGCCTC
GCCGGCTTCT ACCTGATTCG GGGCCGGTCA GACCGTCGCC TCGGACTCCC GAGCGGCGAT
CAGGAGATCC CTCTCCTCTT TCAGGACCGC ATGTTCCACG AGAACGGTCG ATACAAATAT
CCGGCCGAGT TCGCGCCCGA GTTCGCCGGT GACGTTTCGG TTGTCAATGG CAAGGCTTGG
CCCACATTCG TGGTCCAACC GCGACAGTAC CGGTTTAGGC TCCTCAACGG GTCGAACGGA
CGGTTCTTCG ATATCAGTCT GGAGAACGAA AACGACGGCG AGGTGCCGAC CATCTACCAG
ATCGGAACCG ACCTCGGCTT CCTTCAAGAC GTCGTGCCGA TCGGATCGGG ACAGGATACC
ACCTCCCTGC TTCTCGGGCC GGCCGAACGG GCGGACGTCA TCGTCGACTT CTCAGAGTAC
GCCGGCGATA CGCTCACCGT AAAAAACGAT GCGGGCTTCC CGTTCGTGAG TCCAGATGCA
GATAACAACG ATGGTGGCGG ACTCCCTGAG TTGGCGCAGT TCAGGGTGGC TGACACGGAT
CCGGAGACAC CGGTCGTGGA TCCGACGACA CTCAAATTGC CCGGGCCAGA GACCTTCCGC
GAAGAGGCGA CGAAGACAAC ACGGCAGATG AGTTTGGAGA CGACGACACT CAACGGGCTG
GATACCCACC TCCTCGGCGA GGAGGGAGGT CGTGCGGGTG GCGAACATTG GAACGATCCT
GTGTTGACCA AACCCCAGAT AGGGACGACG GAAGTCTGGG AGATTACGAA CAACACACCG
GACTCTCACC CAATCCACCT TCACCTAGTC GACTTCCAGG TAATCGGACG TGGCCCCGAC
GGCACAGAAC CGCCAGAACC GACAGAGCGC GGTAACAAGG ACACGGTCAA CGTTTATGGC
GGAGAAACCG TCCGAATCAT CAGCCGGTTC GGCGAATTCT CCGGACGGTA TGTCTGGCAC
TGTCACATCC TCGAACACGA GGATCAGGAA ATGATGCGCC CGTACGAGGT GATCCAGGGG
AACTCCTCGT AA
 
Protein sequence
MESTGSRKGA PYYHIEMAPG THEHHPDIPD TPIWGYKGPN DDEGKYPGKT IEATQNQRLK 
VEFSNDPLPE THLLTDSVDT AVHGTKPEDY EDRYPDWVSQ FEAFGGTFAF PEVRTVTHAH
GVHVESASDG LPEQWQSPGG IEGPQFQKAV YDYPNRQSPA TLWYHDHALG ITRLNVYAGL
AGFYLIRGRS DRRLGLPSGD QEIPLLFQDR MFHENGRYKY PAEFAPEFAG DVSVVNGKAW
PTFVVQPRQY RFRLLNGSNG RFFDISLENE NDGEVPTIYQ IGTDLGFLQD VVPIGSGQDT
TSLLLGPAER ADVIVDFSEY AGDTLTVKND AGFPFVSPDA DNNDGGGLPE LAQFRVADTD
PETPVVDPTT LKLPGPETFR EEATKTTRQM SLETTTLNGL DTHLLGEEGG RAGGEHWNDP
VLTKPQIGTT EVWEITNNTP DSHPIHLHLV DFQVIGRGPD GTEPPEPTER GNKDTVNVYG
GETVRIISRF GEFSGRYVWH CHILEHEDQE MMRPYEVIQG NSS