Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1206 |
Symbol | |
ID | 7399473 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 1214957 |
End bp | 1216024 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643708271 |
Product | Squalene/phytoene synthase |
Protein accession | YP_002565870 |
Protein GI | 222479633 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1562] Phytoene/squalene synthetase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000204086 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00000600855 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCGGAC GAGCACACGG TCCCGCTGAG CCCGACCTCT CGTGGTGTCA CGAGGCGGTT CAGGGCGTTT CTCGGACCTT CGCGCTGACG GTCGACGTGT TGGAGGAGCC GATGGCCTCG CATATCTGCG TGGGATACCT CCTCTGTCGG GTCGCCGATA CCGTCGAGGA CGCCGGACAC ATCCCTCCTG AAGCTCAAAG CGACGTGTTG CGGACGTACC GGCGCGCGAT CGACCCCGAC GACGACACCG ACATTGTGGC GTTCCGCGAC GCGGTCGACG AGTGGCTCCC GGCGCCCGCC GACCGCAACG ACGACTGGGA AGTCGTCGCC GAGGTACCGA CGATCGTGGC GACGTTCGCG GAGTTCGAGC CCGAGGCCCG CGACGCGATC GTCCCACCCG TCTTGGAAAT GGTTGACGGG ATGGCGATGT TCGTCGATCG GCACGCCACC GAGGGCGGGC TCCGTATCGA CGACCGCGAC GAACTAGAGC AGTACTGCTA CTACGCTGCC GGCACGGTCG GGAACCTCAT CACGAACCTG CTCACCCGTG GCGACGTCAC CGAGGACCGC GCGCGGCAAC TACGAGACAC AGCCGAGGAG TTCGGACTCC TCCTCCAACT CGTGAACGTC TCGAAAGACG TGTACGACGA CTACACCGAG GAGAACAACG TCTACCTTCC CGCCGAGTGG CTCGCCGACG AGGGGGTCGA CCAAGAGCGC GTCGTCCACC CGGAGAATCG GGAGTCGTCC GCCCGCGTCG TCAGCCGGAC CGCCGAGTAC GCCCGGTCGT TCCTCGACGA CGCGCAGGCG TACTTAGAGA CGATGCCGCT CTCGAACGGA AACACGATGG AGGCGTGGAC CGTCCCGTAC CTGCTCGCGG TCGGTACCCT CCGCGAACTC AGTTCCCGTC CCGAGGACGC GCTCACCGAA ACCGGCGTGA AAATCTCCCG CCAGGAGGTG TTCGCGGTGA TGGCCGTCGC CGGCGACGCC GGCCGCGACT CCCTCGCAGA TCTCCGGCAG ACGATCGCGC GGACTCCCTT CCACCGGGCC GTCGAGCCTG CGGACTGA
|
Protein sequence | MSGRAHGPAE PDLSWCHEAV QGVSRTFALT VDVLEEPMAS HICVGYLLCR VADTVEDAGH IPPEAQSDVL RTYRRAIDPD DDTDIVAFRD AVDEWLPAPA DRNDDWEVVA EVPTIVATFA EFEPEARDAI VPPVLEMVDG MAMFVDRHAT EGGLRIDDRD ELEQYCYYAA GTVGNLITNL LTRGDVTEDR ARQLRDTAEE FGLLLQLVNV SKDVYDDYTE ENNVYLPAEW LADEGVDQER VVHPENRESS ARVVSRTAEY ARSFLDDAQA YLETMPLSNG NTMEAWTVPY LLAVGTLREL SSRPEDALTE TGVKISRQEV FAVMAVAGDA GRDSLADLRQ TIARTPFHRA VEPAD
|
| |