Gene Hore_15920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_15920 
Symbol 
ID7312628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1705287 
End bp1706252 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content36% 
IMG OID643612039 
ProductSqualene synthase 
Protein accessionYP_002509336 
Protein GI220932428 
COG category[I] Lipid transport and metabolism 
COG ID[COG1562] Phytoene/squalene synthetase 
TIGRFAM ID[TIGR01559] farnesyl-diphosphate farnesyltransferase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.13397 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTGATT CCTTAAAATT TTGCAAAAAA ATGCTGCCAA AGGTTTCAAG AAGTTTTGCC 
CTTACCATAC CATTACTTGA TGATAAGCTT TATATGCCTG TTTTGACTGT ATACCTTCAG
GATAGATTAC TGGATAATTT TGAAGATGAA GTTAAGGGGA TAGATATTGA TACCAGAAAA
GATTTAATGG ATAGTGTAGT TGCCCTTTTT GACCCGGATA ATGAGGGAAC AAATTATATT
TGCAACCGGT TAAAGGGCTA TTCTGACTAT ATTCCTGATA ATGATTTACA GGCCTTGACC
GAAGGGTGCC ATTTGTTAAA AGAAGTATAC CAGAATTTAC CCCATAAGGT TCAGAGGCTA
TCCTTTAAAT GGTTAAATGA GATGAATGAG GGAATGAAAA AATACTTAAC TCTCAAGATT
AAAACCTTTG AAGAACTTGA TGAATATTGT TATTATGTAG CCGGTACAGT TGGAGGATTT
TTAACTGAAC TGGTACTCCT TTACGGTGAT ATCCATGATC ATAAACAGAA GGAAGATCTC
TTAGAAAATT TCACAGAGGC CGGGTTGTTT CTGCAGAAAA TAAATATAAT CAGGGATATA
AAAATAGATT TAGAGGAACG GAACAAATCT TTCTGGCCCA TGGAAGAACT GGGATTATCG
GATGAAAAAA TTTTAAATCC TGATTATAAA GAAGATGCCA GCCGTGCCCT CAGGGTAATG
CTTTCTAATG CAAAGGGGCA TATTGAAGGA CTGGCTACAT ACTTTGAAGC CATACCTGAT
AATCTGCCGG GTTACAGGAA ATTTTTCAGT GTTAATAATG CACTGGGAAT AGCTACCCTC
GAGGAACTCG AAGGAAACCT TAAACTCTTT TACGGACGGG GCAAGGTGAA AGTCAGTAAA
ATGAAATTCC TGAAAATACT AAATGATCCT GTTAAATTCT TCAGACAGAT GGTAATAAGA
TATTAG
 
Protein sequence
MVDSLKFCKK MLPKVSRSFA LTIPLLDDKL YMPVLTVYLQ DRLLDNFEDE VKGIDIDTRK 
DLMDSVVALF DPDNEGTNYI CNRLKGYSDY IPDNDLQALT EGCHLLKEVY QNLPHKVQRL
SFKWLNEMNE GMKKYLTLKI KTFEELDEYC YYVAGTVGGF LTELVLLYGD IHDHKQKEDL
LENFTEAGLF LQKINIIRDI KIDLEERNKS FWPMEELGLS DEKILNPDYK EDASRALRVM
LSNAKGHIEG LATYFEAIPD NLPGYRKFFS VNNALGIATL EELEGNLKLF YGRGKVKVSK
MKFLKILNDP VKFFRQMVIR Y