Gene Huta_2049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2049 
Symbol 
ID8384343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2070073 
End bp2071113 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content64% 
IMG OID644973119 
ProductPolyprenyl synthetase 
Protein accessionYP_003130950 
Protein GI257053117 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0142] Geranylgeranyl pyrophosphate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000493992 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGATA CTGTCGACTC GCAGGCGGTC ATGGAGGCGA TCGAGTGGCG ACGCGGGCAG 
GTCAACGACG CGATTCCCGA GAACCTCCCG GTCGTCGAGC CCAAGAAGCT CTATGAGGCC
TCGCGGTACC TGCTGGACGC CGGCGGCAAG CGACTCCGGC CGACGATCCT GCTGCTCGTA
GCCGAGTCGA TCGCCGACGT CCTCCCGCGG AGCGAGCCGT ATCGAGAGTT CCCGGCCGCC
GAGGGCCCAA TCGACATGAT GTCCGCCGCG GTGAGCATCG AGATCATCCA GTCGTTTACG
CTCATCCACG ACGACATCAT GGACGACGAC GACATGCGCC GGGGCGTCCC GGCCGTCCAC
CGGGAGTTCG ACCTCTCAAC GGCGATCCTG GCGGGTGACA CTCTTTATGC GAAGGCCTTC
GAGAACATGC TCGAAACCGG CTCGACCGGC GACCGCTCGG TCCGGGCGCT GTCGGAACTC
GCGACGACCT GTACCCAGAT CTGTGAGGGC CAGTCGATGG ACATCGAGTT CGAGACCGAC
GAGACGGTCA CGACCGAGGA CTATCTGGAG ATGGTCGAAC TCAAGACGGC GGTGCTTTAC
GCCGCCGCGG CGTCGATCCC GCCGATCCTC ATGGGCGAGG ACGACTACGT CGATCCGCTC
TACCAGTACG GCCTCAACAT CGGCCGGGGC TTCCAGATCC AGGACGACCT GCTGGATCTG
ACGACGCCGA GCGAGAAACT CGGCAAGCAA CGGGGCAGCG ACCTCGTCGA GAACAAGCGG
ACGATCATCA CCGTCCACGC CCGCAATCAG GGCGTCGACG TCGAAAATCT CGTCCCGACC
GACGACGTCG ACGCCGTCGA CGAGGCGACC ATCGACGAGG CCGTTGCCGA GCTAGAGGAA
GCCGGCAGCA TCGACTTCGC CCGCGAGACG GCCGAGGGAC TCATCAGGGA CGGCAAGCGG
AACCTCGAAG TGCTCCCCGA CAACGAGTCC CGGGATCTGT TGGAAGGCAT CGCCGACTTC
CTGATCGAAC GCGAGTACTG A
 
Protein sequence
MTDTVDSQAV MEAIEWRRGQ VNDAIPENLP VVEPKKLYEA SRYLLDAGGK RLRPTILLLV 
AESIADVLPR SEPYREFPAA EGPIDMMSAA VSIEIIQSFT LIHDDIMDDD DMRRGVPAVH
REFDLSTAIL AGDTLYAKAF ENMLETGSTG DRSVRALSEL ATTCTQICEG QSMDIEFETD
ETVTTEDYLE MVELKTAVLY AAAASIPPIL MGEDDYVDPL YQYGLNIGRG FQIQDDLLDL
TTPSEKLGKQ RGSDLVENKR TIITVHARNQ GVDVENLVPT DDVDAVDEAT IDEAVAELEE
AGSIDFARET AEGLIRDGKR NLEVLPDNES RDLLEGIADF LIEREY