Gene Hore_13100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_13100 
Symbol 
ID7313631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1406323 
End bp1407333 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content36% 
IMG OID643611750 
ProductL-threonine 3-dehydrogenase 
Protein accessionYP_002509055 
Protein GI220932147 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAAGTT TAGTTTTTTA TGGACCAGGA AATGTAAAAA TTGAAGATAA AGAGATTCCT 
AAAATAGATG AAAATGAAGT ACTTGTTAAA GTTAAAGCAG CTGGTATCTG TGGTACTGAT
AGACATATTT ACAGAGGAGA GGCACCTGCC CGGACTCAGG TAATACTGGG GCATGAAAAT
GCAGGAGAGA TAATTGAGAC TGGACGACAG GTGCGTAGTC TCAAAAAAGG TGATAAAGTT
TGCATAGATC CCAATATATT TTGTGGTCAA TGTTATTATT GTCACCGTGG AGAAGTTCAT
TTATGTAAAG AATTGCAGGC AATCGGAGTT ACAAGAAATG GTGGTTTTGC TGAGTATTTG
GTTGCTCCTG CTACTAATGT TTATAAAGTT AAGGAGAATG TTAGTTATAA AGAAATGGCT
CTTGTAGAAC CACTGGCCTG TTGCCTCCAT GGGATTGACC TTGCCGGGAT AAGGCCTGGA
GATTTTGTTG TGATTTTGGG GGCAGGAGCT ATTGGTTTAA TTTTACTTCA GCTGGCATTA
CATAGTGGTG CCAGTGAGGT AATAGTTAGT GAACCAAATT CAAAAAAACG AAAACTAGCC
CTGAAATTAG GGGCCAGTAA AGTATTTGAT CCCTTTAATG ATAATCTTGA AAAAGAAATT
AAAATAATTA AAAGGGAAGG AGCAGATGTT GTAATAGAGG CCGTGGGTAA TATCCATACT
TTTAAGCAAT CTATTCAGTT AGCCCGAAAG GGAGGTAGTG TTCTTTTATT TGGTGTTCCT
CCTAAAGATA AGGTTATAGA AATTAACCCT TTTGAAATTT ATAAAAGGGA ATTAAAATTA
AAGGGGGCAT ATATTAATCC CTTTGTTACT GATAGAGCTG TTAGAATACT GGAATCAGGG
ATGGTTAATT TAAAAAAGTT AGTAACTTCT CAATACACAC TTGAAGAGCT ACCAAATGTT
TTGGACGGAA ATTTAGATGA GAGTAATGTC AAATCGCTAG TTATATATTA A
 
Protein sequence
MKSLVFYGPG NVKIEDKEIP KIDENEVLVK VKAAGICGTD RHIYRGEAPA RTQVILGHEN 
AGEIIETGRQ VRSLKKGDKV CIDPNIFCGQ CYYCHRGEVH LCKELQAIGV TRNGGFAEYL
VAPATNVYKV KENVSYKEMA LVEPLACCLH GIDLAGIRPG DFVVILGAGA IGLILLQLAL
HSGASEVIVS EPNSKKRKLA LKLGASKVFD PFNDNLEKEI KIIKREGADV VIEAVGNIHT
FKQSIQLARK GGSVLLFGVP PKDKVIEINP FEIYKRELKL KGAYINPFVT DRAVRILESG
MVNLKKLVTS QYTLEELPNV LDGNLDESNV KSLVIY