Gene Huta_2029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2029 
Symbol 
ID8384323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2046713 
End bp2047951 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content64% 
IMG OID644973099 
ProductGTP cyclohydrolase II 
Protein accessionYP_003130930 
Protein GI257053097 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase 
TIGRFAM ID[TIGR00505] GTP cyclohydrolase II
[TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGTCA TCGAATCGGA CCGGGATCGC TCGCGCTTCG ACGACATCGA GACGGCGATC 
GACGCCATCG AGGACGGCGA GATGATCCTG TTGGTCGACG AGCAGAGTCG CGAGGACGAG
GGCGACCTGT ACGTGCCCGC GGAGGCGATC ACCCCCGAGC AGATGAATTT CATGCTCAAG
CACGGCCGTG GACTGGTCTG TGCGCCTGTC AGCCCCGAAA TCACGGACGA TCTGGGCCTC
GAACAGATGG TCCCCGCCTC CGAGAACACC GAGGAGATGA GTACCCGCTT CACCGTCTCG
GTCGACGCCG CCTCGACGGG GACGGGCATC TCGGCGTACG ATCGCGCCGA GACGGTCCAG
GCGCTGGTCG ATCCCGAGAC CGAACCGGCA GATCTCGACA AACCAGGACA CATCTTCCCC
CTGGAGGCCA AAGCCGACGG CGTGCTCGAC CGCGAGGGTC ACACCGAAGC GGCGGTCGAC
CTCGCCCGAA TCGCCGGCTA CCGGCCCGGC GGCGTGATCT GCGAGGTCGT CGACGACGAC
GGCACGATGG CCCGGGAGGA TCGGCTGCTG GAATTCGCCG ACGAACACGA GTTGCCGATC
GTAACCGTTG CCGATGTCCT CGAGTATCGC CACCTGACCG AGACGCTCGT CTCCCGGGAA
GTCGACACCC GGCTGCCGAC GGCGTTCGGT ACCTTCGACA TGTACGGGTA CGACTACCGC
GGCGAGACCC ACGTCGCGCT GGTCAACCTC GACGACGTCG ATCCCGAGAC CGACCGGCCG
CTGGTCCGGA TCCACTCGAA GTGTCTGACC GGCGACGCCT TGCACTCCCT GAAGTGTGAC
TGCGGGTTCC AGCTCGAAGA GACCATGCAA CGCATCAGCG ACGAGGGTGG CGTCCTGCTG
TACCTCGATC AGGAGGGACG CGGGATCGGC CTGCTCAACA AACTCAAGGC CTACGAGTTA
CAGGAGCACG GCTACGACAC CGTCGAGGCC AACGTCGAAC TCGGGTTCGA ACCCGACGAG
CGACGCTTCG ACGCGGCCGC CCAGATGCTC AGGGACATCG GGCTGGATCG CGTGCGTCTG
CTGACGAACA ACCCGCGGAA GGCCGCCGCG CTCGAACGGT TCGAGTTCGA CGTCGAAATC
GACTCCCTGG AGATCGAACC CAACCCCGAG AACGAGGCCT ATCTCGCGAC CAAAGCCGAA
AAGCTCGACC ATCAACTCGA CGTGTTCAAC TCCGACTGA
 
Protein sequence
MTVIESDRDR SRFDDIETAI DAIEDGEMIL LVDEQSREDE GDLYVPAEAI TPEQMNFMLK 
HGRGLVCAPV SPEITDDLGL EQMVPASENT EEMSTRFTVS VDAASTGTGI SAYDRAETVQ
ALVDPETEPA DLDKPGHIFP LEAKADGVLD REGHTEAAVD LARIAGYRPG GVICEVVDDD
GTMAREDRLL EFADEHELPI VTVADVLEYR HLTETLVSRE VDTRLPTAFG TFDMYGYDYR
GETHVALVNL DDVDPETDRP LVRIHSKCLT GDALHSLKCD CGFQLEETMQ RISDEGGVLL
YLDQEGRGIG LLNKLKAYEL QEHGYDTVEA NVELGFEPDE RRFDAAAQML RDIGLDRVRL
LTNNPRKAAA LERFEFDVEI DSLEIEPNPE NEAYLATKAE KLDHQLDVFN SD