Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_3222 |
Symbol | |
ID | 7399348 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012028 |
Strand | + |
Start bp | 469821 |
End bp | 471803 |
Gene Length | 1983 bp |
Protein Length | 660 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643707019 |
Product | Squalene cyclase-like protein |
Protein accession | YP_002564641 |
Protein GI | 222476120 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGAGT ATCCAACGCG ACGGCGGTTC CTTGCCGGAT CGGTGGGGGC GATGTCGACA CTCCCCGTCG GGACCGCGAG TGGAGACATC TGGGACGATG AAGATGAGAT CTTGGGCGAT GATGACGACG ATGACGATAT CGAGGAACCC GACTGGGACG ACCTCTGGCG TGCGATTAAT CGAACAGAGA CGTATCTTCG CCGGGAGTTG TCGGAGGACA ACACGTGGTA CGACGACGAC CTGACACACT GGGATACGAC GGTTCGGTTC GAGGGCGCCG CCGATCTCAG AACCACCGTC TACTACGCGC TGTTGACCGA CCGGATCTGG GGGAACGAAG ACGAACGCGA CGACTGTCTC TCGTATCTGC TCGACCGGCG GTCGACCGAC GGCGGCTGGG ACGACACGGT AACCAACTTC GGGATGCTGC TGCTGTTCGA CCGGATGGAC GACCACGAAT ACGAGTCGGT GGTCGAAGAT ATCTTCCGGG AGATTGATGC GAAAAACCAG TCGCTGATGG CCTCGAAAGA CGAGGCAGGC TTCACCGATA GGTTCCGAAT GCGATTGCTG TATGCGGTGC TCTCCGATGA CCACAGCTTC GACGAACTGT TCCCACGGGA GTTACCGATC CGGGCTGGTG ATCTGATGGC GCTCACGTCG GCCTTCGACG GAGAGGCGGT TTCCGCGCAG GATCAGACGG CTCGGCCGAG CTACATCACC AGACAGTTAG CGTACTTCCT GCTCGGCGCG ACCGCCTACG ACCGGGAGCT CGACGAGGAC GAAACACAAC TGGTCGAGGT CGCCGAGAAC GTGCTCCTGA GCCGACGCCT GACGAACGGC GCGTGGAACA CCGTTCCGAC GACGCTCTTT GCGACGTTCG CACTTGTCGA ACGGGGCTAT TCATGGTTCG ACGGCGAGGT TGGAACGCCC GTCCGCTGGA TCGCCGACAA CCGGCTGACC GACGAGGGAC GCGTCGAGAG CTATCGACTC CCTGTCCGGG ATACGACATT GGTACAGAAA GCGCTACTCG CCGCTGGCAG TTCGCCGGAC GCCGAGTTCC TGCAGGAATC GGCCCAGTGG CTCTCCGAGG CGCGGACGCC ACAGACGGTC GGTCGTGAGT TGGACCGCGA GCCAGCGCCG TTTCGACGCC ACCACGGACA CGGCTGGGGG TACATGCCGC ATGCGTTCTC TAACTGGGAC GACACTGCGG TCGCACTCGG CTCGCTGTCG GCGCTCACCG ATGGTGATCT CGACGAACAG ATCGACTTCC TCCGTCGGGT ACAAAACCGC GACGGAAGCT GGTCGGCGTA CACGACCGAC TTCGCCCCCT TCCAGTCGAC GGAGATCGCC GACGAGGCCC GCTTGACGAT CGGCGACGAA CAGTATCAGC TCCGCTTCGG CTATATTCCC GCGCCAGACA TCACTGGAAA CGCCCTTTCG ACACTCGGTA TGCAGGGAGA CACCGTCGAC GACGATCGCC ACATCGACGA CGCCGTCTCC TATCTGCGAG AGAATCGGGC GGACAACGGG CTGTGGCTCG GCGTCCGGGG GCAGGGCTAC ACCTACGGCA CCGCCCGTGT GATGGAGGGG CTCCGAGCAG TCGACGTCGA CATGGATGAC GACTTCGTGA CCGCTGCCCG CGGAACACTA CTCAGCCGAC AGAACGACGA CGGTGGCTGG GGCGAACAGA CTCGCTACGA TCCGAGCCAG CCGGAATCGG GAAGCATCGA GTATCAACCG GGCGAGTCGA CGCCGATACA GACCGGCTGG GCGCTGCAGG CCCTCCTTTA TGGTGGCATC GGCCCCGATC GGGAGGCGGT CTGGGACGCC GTCGACTACC TGCTCGAAAC ACAGCAGTCC GACGGCTCGT GGGAGGTCGA TCCCGTGCTG TACACCTTCG GCGGCCCGGG GTACAGCACC GAAGCGATCA CACAGGCCGC AGTCCTCAGA GCACTCGGTC TGTACGAGTC GACCATCGAC TAG
|
Protein sequence | MKEYPTRRRF LAGSVGAMST LPVGTASGDI WDDEDEILGD DDDDDDIEEP DWDDLWRAIN RTETYLRREL SEDNTWYDDD LTHWDTTVRF EGAADLRTTV YYALLTDRIW GNEDERDDCL SYLLDRRSTD GGWDDTVTNF GMLLLFDRMD DHEYESVVED IFREIDAKNQ SLMASKDEAG FTDRFRMRLL YAVLSDDHSF DELFPRELPI RAGDLMALTS AFDGEAVSAQ DQTARPSYIT RQLAYFLLGA TAYDRELDED ETQLVEVAEN VLLSRRLTNG AWNTVPTTLF ATFALVERGY SWFDGEVGTP VRWIADNRLT DEGRVESYRL PVRDTTLVQK ALLAAGSSPD AEFLQESAQW LSEARTPQTV GRELDREPAP FRRHHGHGWG YMPHAFSNWD DTAVALGSLS ALTDGDLDEQ IDFLRRVQNR DGSWSAYTTD FAPFQSTEIA DEARLTIGDE QYQLRFGYIP APDITGNALS TLGMQGDTVD DDRHIDDAVS YLRENRADNG LWLGVRGQGY TYGTARVMEG LRAVDVDMDD DFVTAARGTL LSRQNDDGGW GEQTRYDPSQ PESGSIEYQP GESTPIQTGW ALQALLYGGI GPDREAVWDA VDYLLETQQS DGSWEVDPVL YTFGGPGYST EAITQAAVLR ALGLYESTID
|
| |