Gene Hlac_1904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1904 
Symbol 
ID7399856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1906566 
End bp1907762 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content67% 
IMG OID643708975 
Productmulticopper oxidase type 3 
Protein accessionYP_002566552 
Protein GI222480315 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2132] Putative multicopper oxidases 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGAG ACGGCGACGA CACCGGGCGA CGCATCGCTT CCAGACGCGG CTTCCTCTCG 
GCGGCCGCCG CACTCGGCAC CGTCGGGCTC GCGGGGTGTG GCGCTCCGCG AGCGGATGCC
GGGGAGGGCG CGGAGAACGC GGTCACCGAC GAGACGGTCC AACAGCAGGT CGAGGAGTGG
TCAGGAAGCG ACTCTACGGG CGTGGAGACG GATCACCCGT ACACGTCCCC GCGAACGACG
ATCGACCTCG ACGAGCGGGA CGGGCAGATC ACGGTGTCCA CGACCCCGTG TCGCCACCAG
CTGCTCGGCG AGGACACGCA GGGCGGTCCG TGGGAGCTCC CCGAGGTCTG GGCGTGGCAG
ACGCCGGACA CGGATCCGAG CGTCCCCGGC CCGTTGCTCC GGGTAACCGA GGGAACCCAA
CTGGAGATCA CCTACGACAA CTCGGCGCAC AACCGCCCGC ACACCTTCCA CGTCCACGGG
CTCAGCAAGG ACTGGATGGA CGACGGCGTC CCGACGACGA CGGGCCAGCA GGTCGCGCCC
GGCGAGGAGT ATACCTACGA GATTGACGCG AACCAGCCGG GCACCCACTT TTACCACTGC
CACTACCAGA CGCAGAACCA TCTCGATATG GGGATGTACG GGATCCTTCG CGTCGACCCG
GAGGGGTACG AAGCCCCCGA CAAGGAGGCG TTCATGACGA TCAAAGACTG GGACACTCGC
CTGTCCGCCT CGACGGCTGG CGGCGACGTG GACTTCAGCC ACCGCGACCG CAACCCCGAC
GCCTTCACCG TGAACGGTCG TTCCGCGCCG TACACATTCC ACCCCGAACA GGGCTCCCCC
TTGATCGTCG AGGAGGGCGA TCAGGTGCGG ATCCACTTCG TCAACGCCGG CTACGAGTCA
CACGCGATGC ACAACCACAA CCACGGCTTC ACCGTGGTCG AGAAGGATGG CGGCGTCATC
CCCGAGGCCG CCAGGCACCG TGAGGACGTG ATCCCCATCG CACCCGCCGA GCGGAAGACG
ATCGAGTTCA CCGCCGACGC CGACCCGGGT GTCTACGCGC TCCACTGTCA CAAGGTGAAC
CACGCGATGA ACGGCGACAG CTACCCCGGC GGCATGATCG GCGGGATGGT GTACGAGAGC
GCGATGGACT CAGAGCAGTT CGCCTCCGTG ATGGAGATGG CGGGCTACGA AGCCTAG
 
Protein sequence
MTRDGDDTGR RIASRRGFLS AAAALGTVGL AGCGAPRADA GEGAENAVTD ETVQQQVEEW 
SGSDSTGVET DHPYTSPRTT IDLDERDGQI TVSTTPCRHQ LLGEDTQGGP WELPEVWAWQ
TPDTDPSVPG PLLRVTEGTQ LEITYDNSAH NRPHTFHVHG LSKDWMDDGV PTTTGQQVAP
GEEYTYEIDA NQPGTHFYHC HYQTQNHLDM GMYGILRVDP EGYEAPDKEA FMTIKDWDTR
LSASTAGGDV DFSHRDRNPD AFTVNGRSAP YTFHPEQGSP LIVEEGDQVR IHFVNAGYES
HAMHNHNHGF TVVEKDGGVI PEAARHREDV IPIAPAERKT IEFTADADPG VYALHCHKVN
HAMNGDSYPG GMIGGMVYES AMDSEQFASV MEMAGYEA