Gene Hore_20670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_20670 
Symbol 
ID7314391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2236262 
End bp2237251 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content42% 
IMG OID643612511 
Productgalactose-1-phosphate uridylyltransferase 
Protein accessionYP_002509807 
Protein GI220932899 
COG category[C] Energy production and conversion 
COG ID[COG1085] Galactose-1-phosphate uridylyltransferase 
TIGRFAM ID[TIGR00209] galactose-1-phosphate uridylyltransferase, family 1 


Plasmid Coverage information

Num covering plasmid clones65 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAAT TAAGGTGGAA CCCGATTTTA AAGGAGTGGG TTATTACAGC AACCCACCGG 
CAGAACAGGA CTTTTAAACC ACCGAAGGAT TTTTGTCCCC TTTGTCCAAC CAGGGAAGGT
GGCTTTCCCA CCGAAGTCCC GGCTGAAGAT TATGATATAG TTGTTTTTCA AAATAGATTT
CCATCCCTGC AGCCAGATGC CCCCGAGCCC GATATAGAGG GTTCCGAATT GTATCCTGTA
GATCTGGCAC AGGGTATATG TGAAGTCGTT TTATTTACAT CAGAGCATGA AGGGGTTATG
TCCCAACAGC CGTTAAGTAA ATTTGAAAAA CTGGTTAAGG TCTGGAAGGA TCGTTATCAG
GAACTGGGAA AAAAGGATTT TATAGATTAT GTATATATTT TTGAAAACAA AGGGGAGGAA
GTCGGGGTTA CTTTACACCA TCCTCATGGC CAGATATATG CCTATCCCTT TATTCCCCCT
ATAATAGAGC GGGAGTTAAA CTCAAGTAAG GAACATCTGG AAAAGGAAGG GGAATGCCTT
TTCTGCAGGG TTCTCCGGGA AGAAAAGGAG GATGGCAGGC GGATAATAGC CAGTAATAAG
TCTTTTACTG CCGTTATTCC CTTTTTTGCC CGATATACCT ATGAAGTTCA CATTTATGCC
AACAAACATT TACCCTCAAT GGCTGAGTTC GGACCTGAGG AGGAAAAGGA CCTGGCCCGG
ATATTAAAGT TATTAATTAT GAAATATGAT AATCTCTTTG AGTTTGTTTT CCCTTATATT
ATGTGTATTC ACCAACAACC TACTGATGGT AGTGGTTTTG ACTATTCCCA TTTCCATATA
GAGTTCTATC CACCATACCG GACAAAAGAC AAGTTAAAAT ACCTGGCCGG TAGTGAAGCC
GGGGCAGGTA CTTTCATCAA CGGTTCTCTG GCTGAAAATA AAGCAGCTGT ATTGAGGGAA
ACCAGTCCAG TTTCCTTTGA AGATATGTAG
 
Protein sequence
MSELRWNPIL KEWVITATHR QNRTFKPPKD FCPLCPTREG GFPTEVPAED YDIVVFQNRF 
PSLQPDAPEP DIEGSELYPV DLAQGICEVV LFTSEHEGVM SQQPLSKFEK LVKVWKDRYQ
ELGKKDFIDY VYIFENKGEE VGVTLHHPHG QIYAYPFIPP IIERELNSSK EHLEKEGECL
FCRVLREEKE DGRRIIASNK SFTAVIPFFA RYTYEVHIYA NKHLPSMAEF GPEEEKDLAR
ILKLLIMKYD NLFEFVFPYI MCIHQQPTDG SGFDYSHFHI EFYPPYRTKD KLKYLAGSEA
GAGTFINGSL AENKAAVLRE TSPVSFEDM