Gene Hlac_1747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1747 
Symbol 
ID7399618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1762684 
End bp1763943 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content61% 
IMG OID643708814 
Producthypothetical protein 
Protein accessionYP_002566399 
Protein GI222480162 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0945012 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCGA GAACTGAAAC CGCGGCGGTC GTCGTCGTCG GCTGCGGTCC CGGTGGCGCA 
GTCCTCGCGT ACCTCCTAGC CCGGAGCGGG ATTGACGTCG CGCTCGTTGA GCGTGCCGCC
ACGTTCGAGC GTGAATACCG AGGATTTGGC TGGAATCCCG GTGTGGTTCG TCTGTTCGAC
GAGATGGACC TCCTTGACGA TGTGCTTGCT CTGGCTCACG AGACGGTCAC AGAGGGTTCG
TTCTCGCTGT ACGGCGAGGA GAGTACGGTT CTCGACTTCG ATCTGCTCGA TACCGACTAT
CCGTATGCGC TGCTGATGGA GCAGCCAGCG CTGCTCGACT GTCTCGTCGA CCGCGCCAAC
TCCTCCGACA ACTTCACGTT TCACCCTGCG ACTACCGTCA CGGACATCTG TACGAACACC
GCAGATGGAA TTCGGGGGGT AACAGCCCGA GACCGAGACG CCGACGAGGA TGTCGAGTTC
AAAACGCAGT GCGTTGTCGG AGCCGATGGT CGCTACTCGA CAGTTCGCGC CAGCGCCGGC
ATCGATCCAG GGCTGTTCGA GTCGCCGATC GATCTCGTGT GGTTCAAGCT CCCTCGCGGC
GCTATCAATG CGACGACGCA GGGTCGAATC GACCGCAACG GTGTCCTCCT ATACTTCGGT
CTGGGTGGCG GCGATCTCCA AATTGGATAC CTCATCCGAA GCGGTGAGTG GCCCTCGATC
AGGCAAGCGG GGTTCGACGC GTTCCGGGAG CGGGTCGCAG AGATCGACCC GCGAGTCGGC
TCAACCATGG CTGTGCAGTT GGATGGGTTT CGTGATACCA GTCTCCTCGA CGTTGCTCCG
GGAATCGCGG ACAGTTGGAG TCGGGACGGG CTTCTTCTCA TCGGTGACGC CGCGCACACT
GCAAGTCCCA TCGGTGCACA GGGAAACCCG CTCGCCGTCG AGGACGCCGT CGTCGCGCAC
AGTCTTCTCG TCGAGAAACT CACTGGCACG GACGGAATTC TCGAACGCAA AACCCTCCAC
GAATTCGAGG TTCGACGACG TGCGCACGTC GAACAGGTTA TTTCGCTCCA GCGGCGCGCC
GCGACCAATC TCGCGTACTG GCTCGAGTAC GGTCGCTATG TCCCTCCACG CCTCGTTCGT
GGGATGACGA AGGCGGCCAG GGTGATTGTC CCTCGTTCGA GGTCGGTGCG GAACACGATC
GAAACGTTCG CACTCGGTCA ACGTTCGGTT TCGGTCGACC GATCTCACTT CATTGACTAA
 
Protein sequence
MTARTETAAV VVVGCGPGGA VLAYLLARSG IDVALVERAA TFEREYRGFG WNPGVVRLFD 
EMDLLDDVLA LAHETVTEGS FSLYGEESTV LDFDLLDTDY PYALLMEQPA LLDCLVDRAN
SSDNFTFHPA TTVTDICTNT ADGIRGVTAR DRDADEDVEF KTQCVVGADG RYSTVRASAG
IDPGLFESPI DLVWFKLPRG AINATTQGRI DRNGVLLYFG LGGGDLQIGY LIRSGEWPSI
RQAGFDAFRE RVAEIDPRVG STMAVQLDGF RDTSLLDVAP GIADSWSRDG LLLIGDAAHT
ASPIGAQGNP LAVEDAVVAH SLLVEKLTGT DGILERKTLH EFEVRRRAHV EQVISLQRRA
ATNLAYWLEY GRYVPPRLVR GMTKAARVIV PRSRSVRNTI ETFALGQRSV SVDRSHFID