Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2029 |
Symbol | |
ID | 8384323 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 2046713 |
End bp | 2047951 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644973099 |
Product | GTP cyclohydrolase II |
Protein accession | YP_003130930 |
Protein GI | 257053097 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase |
TIGRFAM ID | [TIGR00505] GTP cyclohydrolase II [TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGTCA TCGAATCGGA CCGGGATCGC TCGCGCTTCG ACGACATCGA GACGGCGATC GACGCCATCG AGGACGGCGA GATGATCCTG TTGGTCGACG AGCAGAGTCG CGAGGACGAG GGCGACCTGT ACGTGCCCGC GGAGGCGATC ACCCCCGAGC AGATGAATTT CATGCTCAAG CACGGCCGTG GACTGGTCTG TGCGCCTGTC AGCCCCGAAA TCACGGACGA TCTGGGCCTC GAACAGATGG TCCCCGCCTC CGAGAACACC GAGGAGATGA GTACCCGCTT CACCGTCTCG GTCGACGCCG CCTCGACGGG GACGGGCATC TCGGCGTACG ATCGCGCCGA GACGGTCCAG GCGCTGGTCG ATCCCGAGAC CGAACCGGCA GATCTCGACA AACCAGGACA CATCTTCCCC CTGGAGGCCA AAGCCGACGG CGTGCTCGAC CGCGAGGGTC ACACCGAAGC GGCGGTCGAC CTCGCCCGAA TCGCCGGCTA CCGGCCCGGC GGCGTGATCT GCGAGGTCGT CGACGACGAC GGCACGATGG CCCGGGAGGA TCGGCTGCTG GAATTCGCCG ACGAACACGA GTTGCCGATC GTAACCGTTG CCGATGTCCT CGAGTATCGC CACCTGACCG AGACGCTCGT CTCCCGGGAA GTCGACACCC GGCTGCCGAC GGCGTTCGGT ACCTTCGACA TGTACGGGTA CGACTACCGC GGCGAGACCC ACGTCGCGCT GGTCAACCTC GACGACGTCG ATCCCGAGAC CGACCGGCCG CTGGTCCGGA TCCACTCGAA GTGTCTGACC GGCGACGCCT TGCACTCCCT GAAGTGTGAC TGCGGGTTCC AGCTCGAAGA GACCATGCAA CGCATCAGCG ACGAGGGTGG CGTCCTGCTG TACCTCGATC AGGAGGGACG CGGGATCGGC CTGCTCAACA AACTCAAGGC CTACGAGTTA CAGGAGCACG GCTACGACAC CGTCGAGGCC AACGTCGAAC TCGGGTTCGA ACCCGACGAG CGACGCTTCG ACGCGGCCGC CCAGATGCTC AGGGACATCG GGCTGGATCG CGTGCGTCTG CTGACGAACA ACCCGCGGAA GGCCGCCGCG CTCGAACGGT TCGAGTTCGA CGTCGAAATC GACTCCCTGG AGATCGAACC CAACCCCGAG AACGAGGCCT ATCTCGCGAC CAAAGCCGAA AAGCTCGACC ATCAACTCGA CGTGTTCAAC TCCGACTGA
|
Protein sequence | MTVIESDRDR SRFDDIETAI DAIEDGEMIL LVDEQSREDE GDLYVPAEAI TPEQMNFMLK HGRGLVCAPV SPEITDDLGL EQMVPASENT EEMSTRFTVS VDAASTGTGI SAYDRAETVQ ALVDPETEPA DLDKPGHIFP LEAKADGVLD REGHTEAAVD LARIAGYRPG GVICEVVDDD GTMAREDRLL EFADEHELPI VTVADVLEYR HLTETLVSRE VDTRLPTAFG TFDMYGYDYR GETHVALVNL DDVDPETDRP LVRIHSKCLT GDALHSLKCD CGFQLEETMQ RISDEGGVLL YLDQEGRGIG LLNKLKAYEL QEHGYDTVEA NVELGFEPDE RRFDAAAQML RDIGLDRVRL LTNNPRKAAA LERFEFDVEI DSLEIEPNPE NEAYLATKAE KLDHQLDVFN SD
|
| |