Gene Hlac_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2044 
Symbol 
ID7402063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2035259 
End bp2036323 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content71% 
IMG OID643709115 
ProductLuciferase-like monooxygenase 
Protein accessionYP_002566692 
Protein GI222480455 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03558] luciferase family oxidoreductase, group 1 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0320399 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0772623 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCTCT CGATCGTCGA TCTCTCGCCG GTCCCGGCGG GCGGGACCGC CGCCGACGCG 
TACGCGAACA CGGTGACAGC CGCGGAGCAG GCCGAGCGGC TCGGCTACTC GCGGTTCTGG
GTCGCCGAGC ATCACGGCAT GGGCGATCAC CTGGCCGGGA CGACTCCCGA GGTCCTGCTC
GGTCGTCTCG CGGGCGCGAC AGACTCGATC CGGCTCGGCA CCGGTGCCGT CCTGCTCAAC
CACTACAGCC CGTACAAGGT CGCGGAGGCG TTCGGCACGC TCGACGGGCT CGCGCCGGGA
CGGATCGACG CCGGCCTCGG TCGGGCCAAC GGCTCGCCCG CGTCAGACCG GGCGCTCGGC
ACGGACCGAC GCGTCGAGAA CCCGGACGAG GACCACGAGG AACGTATCCG TGCGGTCGTG
AACCATCTGT ACGACGCCTT CCCCGATGAT CACCCGTACG CCGACCTGAC GGTCCCGCGC
TCCGGTGCGG CCCCGCCGGT CCCGTGGGTG CTCGGCTCCA GCCCGGCGAG CGCGGCGATC
GCCGGCAAGC TCGGCGCCCG GTTCTGTTTC GCTGCGTTCA TCCGACCCGG GTTCGCCGAG
CGCGCCTTCG CGGCCTACCG CGAGCAGTTC GAGCCCGCCG AACTCGGAGA CGGTCCCGAT
GAGCCGTACG GGATGGTCGC GGTCAACGCG GTCGCCGGTG AGACGGACGC GGCCGCGGCG
CGGCGCCGCG CGCCAGCGGA AGCGACCTTC AAGCGGATGC AGCGCGGCGT GGTGGGGACA
ATTCCCTCCG TCGAGGAGTC GATCGCGGAA CTCGGCGGCG TTCCCGAGCC GACGCCCGCG
ACGCTCGATC CCGACGAATG GCCGCGAGCG ATCTCCGGGA GCCCGGAGAC GATCGAGGGA
CTGCTCGAAC AGCTCGCTGA CCGGATCGGC GCCGACGAGG TGATGATTCA GCACACGGTC
CCCGATCACG AGGATTCGCT GGCGTCGCAC GCGCTGCTCG CGGAGGCTGT CGGGCTGGAG
GGCGTCGGGT CGGAGGGCGA CGGGTTGGTA GATATCGATC GGTGA
 
Protein sequence
MDLSIVDLSP VPAGGTAADA YANTVTAAEQ AERLGYSRFW VAEHHGMGDH LAGTTPEVLL 
GRLAGATDSI RLGTGAVLLN HYSPYKVAEA FGTLDGLAPG RIDAGLGRAN GSPASDRALG
TDRRVENPDE DHEERIRAVV NHLYDAFPDD HPYADLTVPR SGAAPPVPWV LGSSPASAAI
AGKLGARFCF AAFIRPGFAE RAFAAYREQF EPAELGDGPD EPYGMVAVNA VAGETDAAAA
RRRAPAEATF KRMQRGVVGT IPSVEESIAE LGGVPEPTPA TLDPDEWPRA ISGSPETIEG
LLEQLADRIG ADEVMIQHTV PDHEDSLASH ALLAEAVGLE GVGSEGDGLV DIDR