Gene Hmuk_0902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0902 
Symbol 
ID8410417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp867091 
End bp868146 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content70% 
IMG OID645019236 
ProductSqualene/phytoene synthase 
Protein accessionYP_003176738 
Protein GI257386965 
COG category[I] Lipid transport and metabolism 
COG ID[COG1562] Phytoene/squalene synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.883554 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.674566 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGAG TGCCGCAGCA GGCGACGCTG GCTGACGATC GGACCTGGGC GTTCGAGGCC 
GTCCAGTCGG TCTCCCGGAC GTTCGCGCTG AGTGTCGAGT TGCTGGACGA GCCGATGACG
GAGTGGGTCT GTACCGGCTA TCTCCTCTGC CGGACCGCAG ACACGATCGA GGACGAACCG
ACGATCCCGA TGGGCCGACG CGCCGAGCTC TTAGAGACCT TCGACGCGAT GCTGGCCGAA
GAGTCGGAGA CGACCGTCGA GGACTTTCTC TCGGCCGTCG AGCCGGAGAC GCCGGCCGAC
GGGGGCGACG ACTGGGCCGT CCTCGGTCAG ACCGACCGGA TCGTCCGCCT CTGGCGGTCG
TTTCCCGACC CCGTCCAGGA CGGGATGCGC TCGATCACCC GCGAGATGGC GACGGGCATG
GCGGACATCC TGCGCCGCCA CGAGGACAGC GGCGGCCTCC GTCTGGAGAC GCTCGACGAG
CTCGAAGAGT ACTGCTGGTA CGTCGCCGGC ACCGTCGGCC AGCTGTTCAT GAAGCTCCAG
ACCGCCCGAG CCGACCCCGA CGACCCCACG CCGGACCCCG AAGACGCCCG CGCGTTCGCA
CTCCTGCTCC AGCTCGTCAA CATCGCCAAG GACGTTCGCG CCGACTGGGA CGAAGAGCAC
AACGTCTACC TGCCCGGCGA GTGGCTCGCC GAGGAAGAAC TCGACCACGA GGCCGTCGCC
GAGCCCGAGC ACTCGACCGC GGTCGCCCGC GTCGTCGGCC GGGTCGTCGA CCAGGCCGCC
GACTACGCAC ACGGTGCCCA GCGGTACCTC TCGACGGTCC CGGAGGGAGA CAACGGCGGT
CTCCTGGAGG CGACGGCGCT GCCCTACCTG CTGGCACTCG GGACGATCCG CGAACTCCGC
GAACGGACCG TCGACGCCGT CGAACAGCCC GACGCGGTCA AGCTCGAACG CGAGGAGGTC
GAGGCGCTGT TCGCCGAGGC CGAGGACGGC TTCACCCGCG ACCAGGTCCG CGATCTCGCA
GCCACGGTGC GAGCCGGTCC GTACCACGAG CAGTAG
 
Protein sequence
MTGVPQQATL ADDRTWAFEA VQSVSRTFAL SVELLDEPMT EWVCTGYLLC RTADTIEDEP 
TIPMGRRAEL LETFDAMLAE ESETTVEDFL SAVEPETPAD GGDDWAVLGQ TDRIVRLWRS
FPDPVQDGMR SITREMATGM ADILRRHEDS GGLRLETLDE LEEYCWYVAG TVGQLFMKLQ
TARADPDDPT PDPEDARAFA LLLQLVNIAK DVRADWDEEH NVYLPGEWLA EEELDHEAVA
EPEHSTAVAR VVGRVVDQAA DYAHGAQRYL STVPEGDNGG LLEATALPYL LALGTIRELR
ERTVDAVEQP DAVKLEREEV EALFAEAEDG FTRDQVRDLA ATVRAGPYHE Q