Gene Hore_10820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_10820 
Symbol 
ID7312824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1174736 
End bp1176391 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content43% 
IMG OID643611519 
Productdihydroxy-acid dehydratase 
Protein accessionYP_002508831 
Protein GI220931923 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGGGAGTG AAACAATAAC ACAAGGTTTT AAGAGGGCCC CCCATAGATC TTTATTATAT 
GCGTTAGGTC TAGATGAAAA GGAGTTAGAA AAACCAATAA TAGGTATTGC CAGCTCTTAC
AGTGAAATCA TACCTGGTCA TAAGCACCTT GATAAGATTG CAGAAGCTGT AAAGTATGGG
GTTTACAGTG CCGGTGGGAC ACCGGTCATT TTTTCTACAA TCGGGGTCTG TGATGGTATT
GCTATGGGGC ATTCTGGTAT GAAATATTCA CTGGCCAGTC GAGAAATAAT AGCTGATTCA
GTGGAAACAG TTGTCAGAGC CCACCAGTTT GATGGGTTAG TTTTAGTACC CAACTGTGAT
AAGATTGTTC CTGGAATGTT GATGGCAGCG GCCAGATTAG ATATTCCAGC TATCGTTGTC
AGTGGAGGAC CCATGCTTGC CGGTGATTAT CAGGGCAAGT CGCTGGATCT TCATAATGTC
TTTGAAGCAG TTGGTGAAGT GAAAGCGGGT AAAATTACAG AAGGAGAACT GGAAAATATA
GAAAAAGCGG CCTGTCCCGG GTGTGGGTCA TGTGCCGGAA TGTTTACGGC AAATTCAATG
AACTGCTTAA CAGAAGTGCT GGGGATGGCT TTACCCGGAA ACGGAACTAT TCCTGCAGTT
TATGCTGAAA GGATCAGGCT TGCCAAAAAG TCAGGTAGAC AGATTATAAA TCTTGTCGAA
AAAAATATTA AACCTTCAGA TATTATGACC CGGGAGGCTT TTAAAAATGC TATCTGTGTT
GATATGGCCC TTGGATGTTC TACCAATACA GCCCTGCATC TACCGGCAAT AGCCCACGAG
GCTGGTCTTG ATTTAGAACT TGATTTATTT AACGATATAA GTAGGAAAGT ACCTCACATT
TGTAGTCTGA CACCAGCTGG AATTTATCAC ATAGAAGACT TATACAGGGT TGGCGGTATT
CCAGCTGTTA TGAAAGAACT CAGTGAGAAG GATTTAATAC AGCTTGATCA GCTTACCGTA
ACTGGAGATA CTGTTGGCAC TAACATAAGT AGAGTAGGTT ATATTGATCA TAAAATTATA
CGTCCTGTAA GTAATCCCTA TCACAATCAG GGAGGGCTGG CTGTCTTAAA GGGGAATATT
GCTCCCGGTG GTTCAGTAGT AAAGCAGGCA GCAGTAGCTG ACAGTATGAT GGTCCATAGA
GGTCCAGCCC GGGTTTTTAA AGGTGAAGAG GAAGCTGTTG ATGCCATCAT TAATGGTCAG
ATCAGCGAAG GGGATGTTGT AGTTATAACT TACGAAGGTC CCAGGGGCGG ACCCGGAATG
AGAGAGATGT TAACCCCTAC CTCCGCCCTG GCTGGTCTTG GCCTTGATGA TAAAGTTGCC
CTTATTACTG ATGGGCGTTT TTCCGGTGCT ACCCGGGGAG CTGCCATTGG TCATGTTTCT
CCTGAAGCAG CGTCAGGTGG ACCTATTGGA ATTATCCAGG ACGGCGATAT TATAGAAATT
GATATTCCTG CTAAATCTCT AAATGTAGAC ATATCAGAGG AAGAATTTGA GAAGAGAATG
AGTAATTTTA ATCCTGAATT ACCTGACATA TCAGGTTATC TGGGTCGATA TGCTAAACAT
GTTTCTTCTG CAAGTACCGG AGCAGTTTTA GAATGA
 
Protein sequence
MGSETITQGF KRAPHRSLLY ALGLDEKELE KPIIGIASSY SEIIPGHKHL DKIAEAVKYG 
VYSAGGTPVI FSTIGVCDGI AMGHSGMKYS LASREIIADS VETVVRAHQF DGLVLVPNCD
KIVPGMLMAA ARLDIPAIVV SGGPMLAGDY QGKSLDLHNV FEAVGEVKAG KITEGELENI
EKAACPGCGS CAGMFTANSM NCLTEVLGMA LPGNGTIPAV YAERIRLAKK SGRQIINLVE
KNIKPSDIMT REAFKNAICV DMALGCSTNT ALHLPAIAHE AGLDLELDLF NDISRKVPHI
CSLTPAGIYH IEDLYRVGGI PAVMKELSEK DLIQLDQLTV TGDTVGTNIS RVGYIDHKII
RPVSNPYHNQ GGLAVLKGNI APGGSVVKQA AVADSMMVHR GPARVFKGEE EAVDAIINGQ
ISEGDVVVIT YEGPRGGPGM REMLTPTSAL AGLGLDDKVA LITDGRFSGA TRGAAIGHVS
PEAASGGPIG IIQDGDIIEI DIPAKSLNVD ISEEEFEKRM SNFNPELPDI SGYLGRYAKH
VSSASTGAVL E