Gene Hore_20680 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_20680 
Symbol 
ID7314392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2237364 
End bp2238350 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content44% 
IMG OID643612512 
ProductUDP-glucose 4-epimerase 
Protein accessionYP_002509808 
Protein GI220932900 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1087] UDP-glucose 4-epimerase 
TIGRFAM ID[TIGR01179] UDP-glucose-4-epimerase 


Plasmid Coverage information

Num covering plasmid clones55 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATAC TGGTGACAGG AGGAGCCGGT TATATTGGGA GTCATGTAGT GAAAAGCCTG 
TTTGAGGCTG GTTATAATGT TGTTACCCTG GATAATCTTG AGAAGGGTCA CCGGGAAGCT
GTCCTGGGCG GTGAGTTTAT TAAGGGTGAT CTCAAGGACA GAGAGCTGTT AGACAGCATA
ATGAAAGATT ATGAAATAGA TGGTGTCATT CATCTGGCTG CCCACAGTCT GGTAGGAGAG
TCAATGGAAA ACCCGGGGAA GTATTATAAA AATAATGTTT CCAATGGCTT AAATTTACTG
GAAGCTATGG TTGATAATGA TGTGAAATAC CTGGTTTTTT CTTCTACAGC TGCAGTTTAT
GGGGAACCCA GGGAAGTCCC CATCACAGAA GATCATCCAA CAGCTCCGAC AAATACCTAT
GGGGAGAGTA AACTCTTTTT TGAAAAGATG ATGAAACGGT ATGATGAAAT TTATGGACTT
AAGTATGTAT CCCTCCGTTA CTTTAATGCA GCCGGGGCCG ATCTATCAGG TAAAATTGGG
GAAGACCATG ACCCTGAGAC CCATTTGATT CCCATTGTAC TTCAGAAAGC ACTGGGTTTA
CGGGATAAGC TATATATTTT CGGGAATGAT TACCCGACCA GGGATGGAAC TTGTATCCGG
GATTATATCC ATGTCAATGA CCTGGCTGAT GCCCATGTCC TGGCTATTGA AGGTTTAACA
CGGGGTCTGG AGAGCCGTAT TTATAACCTT GGTAATGGTG AAGGTTATTC TGTAAAAGAG
GTAATTGAAA CTGCCAGCAG GGTTATCGGC AAACCGATTG AAGCCGGGGT TGGTGACAGG
CGACCCGGGG ATCCAGCTGT TCTGGTGGCA AGTTCAGATA AAATTAAAGA GGAGCTGGGA
TGGGATCCAC AGTATCCTGA CCTGGAAACT ATAATTGAAA CTGCCTGGCA ATGGCATAAA
AGGGGTGGTT TTAATGAAAA TGAATAA
 
Protein sequence
MNILVTGGAG YIGSHVVKSL FEAGYNVVTL DNLEKGHREA VLGGEFIKGD LKDRELLDSI 
MKDYEIDGVI HLAAHSLVGE SMENPGKYYK NNVSNGLNLL EAMVDNDVKY LVFSSTAAVY
GEPREVPITE DHPTAPTNTY GESKLFFEKM MKRYDEIYGL KYVSLRYFNA AGADLSGKIG
EDHDPETHLI PIVLQKALGL RDKLYIFGND YPTRDGTCIR DYIHVNDLAD AHVLAIEGLT
RGLESRIYNL GNGEGYSVKE VIETASRVIG KPIEAGVGDR RPGDPAVLVA SSDKIKEELG
WDPQYPDLET IIETAWQWHK RGGFNENE