Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2049 |
Symbol | |
ID | 8384343 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 2070073 |
End bp | 2071113 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644973119 |
Product | Polyprenyl synthetase |
Protein accession | YP_003130950 |
Protein GI | 257053117 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0142] Geranylgeranyl pyrophosphate synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000493992 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGATA CTGTCGACTC GCAGGCGGTC ATGGAGGCGA TCGAGTGGCG ACGCGGGCAG GTCAACGACG CGATTCCCGA GAACCTCCCG GTCGTCGAGC CCAAGAAGCT CTATGAGGCC TCGCGGTACC TGCTGGACGC CGGCGGCAAG CGACTCCGGC CGACGATCCT GCTGCTCGTA GCCGAGTCGA TCGCCGACGT CCTCCCGCGG AGCGAGCCGT ATCGAGAGTT CCCGGCCGCC GAGGGCCCAA TCGACATGAT GTCCGCCGCG GTGAGCATCG AGATCATCCA GTCGTTTACG CTCATCCACG ACGACATCAT GGACGACGAC GACATGCGCC GGGGCGTCCC GGCCGTCCAC CGGGAGTTCG ACCTCTCAAC GGCGATCCTG GCGGGTGACA CTCTTTATGC GAAGGCCTTC GAGAACATGC TCGAAACCGG CTCGACCGGC GACCGCTCGG TCCGGGCGCT GTCGGAACTC GCGACGACCT GTACCCAGAT CTGTGAGGGC CAGTCGATGG ACATCGAGTT CGAGACCGAC GAGACGGTCA CGACCGAGGA CTATCTGGAG ATGGTCGAAC TCAAGACGGC GGTGCTTTAC GCCGCCGCGG CGTCGATCCC GCCGATCCTC ATGGGCGAGG ACGACTACGT CGATCCGCTC TACCAGTACG GCCTCAACAT CGGCCGGGGC TTCCAGATCC AGGACGACCT GCTGGATCTG ACGACGCCGA GCGAGAAACT CGGCAAGCAA CGGGGCAGCG ACCTCGTCGA GAACAAGCGG ACGATCATCA CCGTCCACGC CCGCAATCAG GGCGTCGACG TCGAAAATCT CGTCCCGACC GACGACGTCG ACGCCGTCGA CGAGGCGACC ATCGACGAGG CCGTTGCCGA GCTAGAGGAA GCCGGCAGCA TCGACTTCGC CCGCGAGACG GCCGAGGGAC TCATCAGGGA CGGCAAGCGG AACCTCGAAG TGCTCCCCGA CAACGAGTCC CGGGATCTGT TGGAAGGCAT CGCCGACTTC CTGATCGAAC GCGAGTACTG A
|
Protein sequence | MTDTVDSQAV MEAIEWRRGQ VNDAIPENLP VVEPKKLYEA SRYLLDAGGK RLRPTILLLV AESIADVLPR SEPYREFPAA EGPIDMMSAA VSIEIIQSFT LIHDDIMDDD DMRRGVPAVH REFDLSTAIL AGDTLYAKAF ENMLETGSTG DRSVRALSEL ATTCTQICEG QSMDIEFETD ETVTTEDYLE MVELKTAVLY AAAASIPPIL MGEDDYVDPL YQYGLNIGRG FQIQDDLLDL TTPSEKLGKQ RGSDLVENKR TIITVHARNQ GVDVENLVPT DDVDAVDEAT IDEAVAELEE AGSIDFARET AEGLIRDGKR NLEVLPDNES RDLLEGIADF LIEREY
|
| |