Gene Hlac_3222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3222 
Symbol 
ID7399348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012028 
Strand
Start bp469821 
End bp471803 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content63% 
IMG OID643707019 
ProductSqualene cyclase-like protein 
Protein accessionYP_002564641 
Protein GI222476120 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAGT ATCCAACGCG ACGGCGGTTC CTTGCCGGAT CGGTGGGGGC GATGTCGACA 
CTCCCCGTCG GGACCGCGAG TGGAGACATC TGGGACGATG AAGATGAGAT CTTGGGCGAT
GATGACGACG ATGACGATAT CGAGGAACCC GACTGGGACG ACCTCTGGCG TGCGATTAAT
CGAACAGAGA CGTATCTTCG CCGGGAGTTG TCGGAGGACA ACACGTGGTA CGACGACGAC
CTGACACACT GGGATACGAC GGTTCGGTTC GAGGGCGCCG CCGATCTCAG AACCACCGTC
TACTACGCGC TGTTGACCGA CCGGATCTGG GGGAACGAAG ACGAACGCGA CGACTGTCTC
TCGTATCTGC TCGACCGGCG GTCGACCGAC GGCGGCTGGG ACGACACGGT AACCAACTTC
GGGATGCTGC TGCTGTTCGA CCGGATGGAC GACCACGAAT ACGAGTCGGT GGTCGAAGAT
ATCTTCCGGG AGATTGATGC GAAAAACCAG TCGCTGATGG CCTCGAAAGA CGAGGCAGGC
TTCACCGATA GGTTCCGAAT GCGATTGCTG TATGCGGTGC TCTCCGATGA CCACAGCTTC
GACGAACTGT TCCCACGGGA GTTACCGATC CGGGCTGGTG ATCTGATGGC GCTCACGTCG
GCCTTCGACG GAGAGGCGGT TTCCGCGCAG GATCAGACGG CTCGGCCGAG CTACATCACC
AGACAGTTAG CGTACTTCCT GCTCGGCGCG ACCGCCTACG ACCGGGAGCT CGACGAGGAC
GAAACACAAC TGGTCGAGGT CGCCGAGAAC GTGCTCCTGA GCCGACGCCT GACGAACGGC
GCGTGGAACA CCGTTCCGAC GACGCTCTTT GCGACGTTCG CACTTGTCGA ACGGGGCTAT
TCATGGTTCG ACGGCGAGGT TGGAACGCCC GTCCGCTGGA TCGCCGACAA CCGGCTGACC
GACGAGGGAC GCGTCGAGAG CTATCGACTC CCTGTCCGGG ATACGACATT GGTACAGAAA
GCGCTACTCG CCGCTGGCAG TTCGCCGGAC GCCGAGTTCC TGCAGGAATC GGCCCAGTGG
CTCTCCGAGG CGCGGACGCC ACAGACGGTC GGTCGTGAGT TGGACCGCGA GCCAGCGCCG
TTTCGACGCC ACCACGGACA CGGCTGGGGG TACATGCCGC ATGCGTTCTC TAACTGGGAC
GACACTGCGG TCGCACTCGG CTCGCTGTCG GCGCTCACCG ATGGTGATCT CGACGAACAG
ATCGACTTCC TCCGTCGGGT ACAAAACCGC GACGGAAGCT GGTCGGCGTA CACGACCGAC
TTCGCCCCCT TCCAGTCGAC GGAGATCGCC GACGAGGCCC GCTTGACGAT CGGCGACGAA
CAGTATCAGC TCCGCTTCGG CTATATTCCC GCGCCAGACA TCACTGGAAA CGCCCTTTCG
ACACTCGGTA TGCAGGGAGA CACCGTCGAC GACGATCGCC ACATCGACGA CGCCGTCTCC
TATCTGCGAG AGAATCGGGC GGACAACGGG CTGTGGCTCG GCGTCCGGGG GCAGGGCTAC
ACCTACGGCA CCGCCCGTGT GATGGAGGGG CTCCGAGCAG TCGACGTCGA CATGGATGAC
GACTTCGTGA CCGCTGCCCG CGGAACACTA CTCAGCCGAC AGAACGACGA CGGTGGCTGG
GGCGAACAGA CTCGCTACGA TCCGAGCCAG CCGGAATCGG GAAGCATCGA GTATCAACCG
GGCGAGTCGA CGCCGATACA GACCGGCTGG GCGCTGCAGG CCCTCCTTTA TGGTGGCATC
GGCCCCGATC GGGAGGCGGT CTGGGACGCC GTCGACTACC TGCTCGAAAC ACAGCAGTCC
GACGGCTCGT GGGAGGTCGA TCCCGTGCTG TACACCTTCG GCGGCCCGGG GTACAGCACC
GAAGCGATCA CACAGGCCGC AGTCCTCAGA GCACTCGGTC TGTACGAGTC GACCATCGAC
TAG
 
Protein sequence
MKEYPTRRRF LAGSVGAMST LPVGTASGDI WDDEDEILGD DDDDDDIEEP DWDDLWRAIN 
RTETYLRREL SEDNTWYDDD LTHWDTTVRF EGAADLRTTV YYALLTDRIW GNEDERDDCL
SYLLDRRSTD GGWDDTVTNF GMLLLFDRMD DHEYESVVED IFREIDAKNQ SLMASKDEAG
FTDRFRMRLL YAVLSDDHSF DELFPRELPI RAGDLMALTS AFDGEAVSAQ DQTARPSYIT
RQLAYFLLGA TAYDRELDED ETQLVEVAEN VLLSRRLTNG AWNTVPTTLF ATFALVERGY
SWFDGEVGTP VRWIADNRLT DEGRVESYRL PVRDTTLVQK ALLAAGSSPD AEFLQESAQW
LSEARTPQTV GRELDREPAP FRRHHGHGWG YMPHAFSNWD DTAVALGSLS ALTDGDLDEQ
IDFLRRVQNR DGSWSAYTTD FAPFQSTEIA DEARLTIGDE QYQLRFGYIP APDITGNALS
TLGMQGDTVD DDRHIDDAVS YLRENRADNG LWLGVRGQGY TYGTARVMEG LRAVDVDMDD
DFVTAARGTL LSRQNDDGGW GEQTRYDPSQ PESGSIEYQP GESTPIQTGW ALQALLYGGI
GPDREAVWDA VDYLLETQQS DGSWEVDPVL YTFGGPGYST EAITQAAVLR ALGLYESTID