Gene Hore_10010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_10010 
Symbol 
ID7314589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1091911 
End bp1093125 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content41% 
IMG OID643611440 
Product3,4-dihydroxy-2-butanone 4-phosphate synthase 
Protein accessionYP_002508752 
Protein GI220931844 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase
[COG0807] GTP cyclohydrolase II 
TIGRFAM ID[TIGR00505] GTP cyclohydrolase II
[TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACAA TTGAAGAAGC AATTGAAGAT ATTAAACAGG GGAAAATGGT TATAGTAGTT 
GATGATGAAG ACAGGGAAAA TGAAGGAGAT CTGGTTATGG CTGCCGAAAA AGTAACTCCT
GAAAGTATTA ATTTTATGAT CAGGTTTGCC CGCGGGCTGG TCTGTGTTCC CATGGAAGAG
GAAAGACTTA GAGAACTTGA TTTACCTATG ATGGTTGTTA ATAATACCGA CCCCCAAGAA
ACAGGGTTTA CTCTTTCAGT AGACCACCGG GATACAACAA CTGGAATCTC AGCCAGGGAA
CGGGCTTTGA CTGTTAAAGA ACTGGTTAGC GATAATAGTA AACATACAGA TTTCAGGCGA
CCCGGTCATA TTTTCCCCCT CCGCGCCAGG CCAGGCGGGG TTTTAAGGAG GGCAGGACAT
ACTGAAGCTG CAGTAGATCT GGCCAGGCTG GCTGGTCTGA AACCAGCCGG TGTTATTTGT
GAGATTATAA AAGAAGATGG TACAATGGCC AGGTTACCTG ATCTGGAGGA GTTTTCTAAA
AAGCATGGAC TTAAAATAGT TACAATTGAA GATTTAATAA AATATCGTAA GTCTAAAGAA
AAACTTGTCA AAAAAGAGGC TGAAGCCCAA CTTCCAACCA GGTTTGGTGA TTTTAAAATA
AAAGTTTATA AGTCTATCAT TGGTAACCAG GAACATATTG CCCTGGTAAA AGGTGAGGTA
GCTGGCAAAA AGAATGTTCT GGTAAGAGTA CACTCAGAAT GTTTAACAGG TGATACCTTT
GGCTCTTTAA GATGTGATTG TGGTCAACAG CTGGCCGCAG CTTTGAAAAT GATTGATAAA
GAAGGCCATG GTGTTTTGCT ATACATGAGA CAGGAGGGAA GAGGGATTGG TCTTTTAAAC
AAAATAAAGG CATATGCTTT ACAGGATAAG GGGATGGATA CTGTTGAAGC CAATCTGGCC
CTTGGCTTTC CTCCTGATCT GAGGGATTAC GGGATAGGAG CCCAGATTCT GGCAGACCTG
GGGTTAACCT CTATACGTCT TCTTACCAAT AACCCACGAA AGATTATCGG ACTGGAAGGG
TATGGTCTTA TGGTTGTAGA CAGGGTTCCC ATTGAAATTG AACCCAATGA AACCAATAAA
TATTACCTTG AGATAAAAAG GGATAAAATG GGTCATTTAT TAAATTTAGA AGAAAGGGGT
AAAAATAATG GGTAA
 
Protein sequence
MNTIEEAIED IKQGKMVIVV DDEDRENEGD LVMAAEKVTP ESINFMIRFA RGLVCVPMEE 
ERLRELDLPM MVVNNTDPQE TGFTLSVDHR DTTTGISARE RALTVKELVS DNSKHTDFRR
PGHIFPLRAR PGGVLRRAGH TEAAVDLARL AGLKPAGVIC EIIKEDGTMA RLPDLEEFSK
KHGLKIVTIE DLIKYRKSKE KLVKKEAEAQ LPTRFGDFKI KVYKSIIGNQ EHIALVKGEV
AGKKNVLVRV HSECLTGDTF GSLRCDCGQQ LAAALKMIDK EGHGVLLYMR QEGRGIGLLN
KIKAYALQDK GMDTVEANLA LGFPPDLRDY GIGAQILADL GLTSIRLLTN NPRKIIGLEG
YGLMVVDRVP IEIEPNETNK YYLEIKRDKM GHLLNLEERG KNNG