Gene Hlac_2192 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2192 
Symbol 
ID7401125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2174847 
End bp2176043 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content66% 
IMG OID643709262 
Productlycopene cyclase domain protein 
Protein accessionYP_002566839 
Protein GI222480602 
COG category 
COG ID 
TIGRFAM ID[TIGR03462] lycopene cyclase domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.035344 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCATCG CTCGGCACGA ACACGGACTA CAGGCTGACC TGCGGGCGTT GCTCTCGCAG 
GTCCACCCGG TGTTCATGTT GCCGCCGCTT GCGGCCTCGT GGTTCGGTGC GGCCGTCGCC
GGCGAGTTCG CGCTCTCGGT GGGCGCGATT CACATGACCG CGATCTTCTT CGCGGTGTAT
ACGGCACACG TGAAGGACGG CTACGTCGAC TTCCATCGGC GCGGCGAGGA CGACGACCAT
CCGATGACGA TCCGCGGCTG TCGCCTCTCC CTTCTCGCCG CCGGCGTCGG CTTCGCCGTC
TGTACGCTCA CGCTCGGAGT ATTCGTCGGC CCGGGCGCCG CGCTCGTCAC CCTTCCGACC
TGGTTCATCG GGTATCTCCA CGCGCCACAG CTCGATACGA ATCCGCTCAC CACGACGCTG
GGGTACCCGA GCGGCATCGC GCTCGCGCTG CTCGGCGGGT TCTACGTCCA GACGACCGAG
ATGACCGCGG CGATACTCGG ATTCGCGCTC GTCTTCCTCG TGACACTCGC CGGGGTGAAG
ATCATCGACG ACGAACAGGA CTACGCGTAC GACAGATCGA TCGACAAACG GACCGTCTCG
GTACTGCTCG GCCGACCCCG AGCCCGGACG CTGGCGTTCT CTCTGCTGAT GGCCGGGCTC
GTCGGCGTTC TCTGGGGGAC AGTCGACGGA CTGTTTCCCC CGTCGGCGCC GGCCGCGGCG
CTCGCGTTCG CCCCGATAGC ACTGGTGGCC AGACGGGCCC GCCCGACGAT CGCGACGATG
CTACTGATCC GTGGCGCCTA CGTCTTCCTG GCCGTCCTGA TCGTGGCCGT CTGGTTTCGA
CCGCTGTCCG GCACCCCGCT TCCGGACATC ACCGTTCTCG GGTCGTACAC GTACCTCGCC
ACCGAGATCG TCTTCGGCGC GCTCGCGTTC GGCCTGCTCC GCTACGCCGG CGCGCTCCGT
CAGTCGGCCC GGACGATCGC CGCCCTGTAT CCGATCGCGT ACCTCTGGGA CTGGTACACG
CTGGAGATCG GCGTCTTCGA GATCACGATG CGCACCGGAT ACGACCTGTT CGGGATCCCG
ATCGAGGAGC ACCTCTTCAT GATCGTCGTG CCGGCACTTG TCCTCGGCAT TCACGAGACC
ATCCGGACGC TCTCGGCCGA GTCGGACGAC GCGTCTCGAA GCGATACTCA CAGGTGA
 
Protein sequence
MAIARHEHGL QADLRALLSQ VHPVFMLPPL AASWFGAAVA GEFALSVGAI HMTAIFFAVY 
TAHVKDGYVD FHRRGEDDDH PMTIRGCRLS LLAAGVGFAV CTLTLGVFVG PGAALVTLPT
WFIGYLHAPQ LDTNPLTTTL GYPSGIALAL LGGFYVQTTE MTAAILGFAL VFLVTLAGVK
IIDDEQDYAY DRSIDKRTVS VLLGRPRART LAFSLLMAGL VGVLWGTVDG LFPPSAPAAA
LAFAPIALVA RRARPTIATM LLIRGAYVFL AVLIVAVWFR PLSGTPLPDI TVLGSYTYLA
TEIVFGALAF GLLRYAGALR QSARTIAALY PIAYLWDWYT LEIGVFEITM RTGYDLFGIP
IEEHLFMIVV PALVLGIHET IRTLSAESDD ASRSDTHR