Gene Hore_19750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_19750 
Symbol 
ID7312790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2123239 
End bp2125698 
Gene Length2460 bp 
Protein Length819 aa 
Translation table11 
GC content39% 
IMG OID643612421 
Productglycosyltransferase 36 
Protein accessionYP_002509717 
Protein GI220932809 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3459] Cellobiose phosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATACG GGTATTTTGA TGACCGGGCC AGGGAATATG TAATTAATAA TCCAGAAACT 
CCGTATCCCT GGATAAATTA TCTGGGTTCA AAAAATTATT TCTGTCTTCT TTCCAATACT
GCAGGAGGGT ATAGTTTTTA CAAGGATGCC AGATTACTCA GGATTACCAG ATATCGATAC
AATAATGTTC CGATTGATGC AGGTGGTAAG TATTACTATA TATATGAAGA CGGAGAATAC
TGGTCCCCTA CCTGGGCTCC GGTTAAGGCA AAGCTTGATT ACTACCAGTG TCAGCATGGC
CTGGGTTATA GTGTAATAAC CGGTGAACGG AAAAACATTT CAACTGAAAT ATTATATTTT
GTTCCAGTAG ATGATAACTG TGAAATACAC CGTTTAAAAA TAAAAAATAA TGGTTCATCT
GTTAGAAAAA TAAAACTGTT TTCCTTTGTT GAGTTTTGTC TCTGGGATGC ATATGATGAT
ATGACTAATT TCCAGAGGAA CTTAAGTACT GGTGAGGTAG AAGTAGAGGG TTCAACCATA
TTCCATAAAA CAGAATATAG AGAGAGGAGA AACCATTTTT CCTTTTTTAC AGTCAATAGT
AATATTTCAG GTTTTGATAC CGATAGGGAA TCATTTATAG GTCTGTATAA CGGTTTCCAT
GAACCACAGG TTGTTGTCAG TGGCAGACCA GGTAATTCTA TTGCCAGTGG GTGGTCACCA
ATTGGCTCAC ACTGCCTGGA AATAGTATTA CAACCGGGTG AAGAACAAAG TTATATTTTT
ATACTTGGCT TTATGGAAAA TAATCCTGAT AACAAGTGGT CTTCATGCGG GGGAATTAAT
AAAGATAAAG CCTATAGTAT GATCAGGCGC TATAATTCTG ATATCAAAGT TGACAATGCA
TTGCAACAGC TTAAATCTTA CTGGGATGAC CTCCTTTCAG GCTTTTATCT TGAATCTGGT
GATGACAGGT TAGACAGGAT GGTTAATACC TGGAATCAAT ATCAGTGTAT GGTTACCTTT
AATCTGGCCA GGAGTGCATC ATATTTCGAA TCAGGTATTA GTAGAGGAAT AGGATTTAGG
GATTCCAATC AGGACCTTCT GGGGGTTGTT CACATGGTTC CTGAAAGGGC AAGAGAAAGG
TTGATAGATC TGGCCCGTAC ACAGTTTGAG GACGGGAGCA CGTACCATCA ATACCAGCCC
CTTACCAAAG AAGGTAATAA TGAAATAGGA AGTGGGTTTA ATGATGACCC CTTATGGCTA
ATTTTAGGTA CTGCAGCTTA TATCAAGGAA ACCGGCGATA TTTCTATTCT TAATGAAGAG
GTTAGCTTTA ATAATGGTAA GGAGGCAACT TTATTTGATC ATTTACTGGC ATCATTTAAC
CATGTTGTCT TAAATAAGGG GCCGCATGGA TTGCCATTGA TTGGCAGGGC TGACTGGAAT
GATTGTTTAA ACTTAAATTG TTTTTCGACT GACCCTGGAG AGTCATTCCA GACGACAACT
AATAAAGATG ATGGGCAGGC TGAATCTGTT CTGATTGCCG GGATGTTTGT GACGATAGGT
CCTGAATTTG TTAAGATGTG CAGATTTATC GGTAAAGATG AAATTGCTGA CAGAGCTCAA
TCAGAAATTG AGGACATGAA AAAGGCAGTA ATGGAGCACG GGAGAGACAA AAACTGGTTT
CTGAGGGCAT ACGATTATTA TGGAAATAAG GTTGGTAGTA TTGATAACAA TGAAGGTCAA
ATATATATAG AACCTCAGGG TTTTTGTGTT ATGGGGGGGC TGGGTATTGA ATCAGGGTTT
GCCCGTAAGG CACTTGATTC TGTCAAAGAG AGGCTTGATA CAGAATATGG CCTTGTATTA
CTGGATCCGC CCTATAAACA ATATTACCCT GAACTTGGTG AAATTTCTTC ATACCCGCCT
GGATATAAGG AAAATGGAGG TATTTTCTGT CATGCCAACC CCTGGATAAT GATTGCAGAA
ACAGTCCTGG GGAGAGGGAA TAATGCGTTT GAATACTATA AAAAGATAGC CCCTGCATAT
CTTGAAGAAA TCAGTGATAT CCACAGGATG GAGCCATATG TATATGCCCA GATGATTGCA
GGCAAAGATG CAGTTAACCA TGGTGAAGCA AAGAATTCGT GGTTAACCGG AACTGCTGCC
TGGAATTATG TGGCTATCAC ACATTATATA CTGGGAATAA GGCCGGAATA TGAAGGTTTG
AAAATAGATC CATGTATCCC TGAAGAATGG TCTGGATTTT ATGTAGAGAA GAGATTTAGA
GGTAAAGTGT ATAAAATTCA TGTAAATAAC CCTGCAAAAG TTAGTAAAGG GGTAAAACAT
ATCAAGGTAG ATGGAGAAAT AATTGAAGGT AATTTAATTA GAATTCCGCT AGAAAATCAA
GCCAGAGAAA ATGTCAATGA TAATACAACT CAGCACATTG TAGAAGTAAT TATGGGCTGA
 
Protein sequence
MKYGYFDDRA REYVINNPET PYPWINYLGS KNYFCLLSNT AGGYSFYKDA RLLRITRYRY 
NNVPIDAGGK YYYIYEDGEY WSPTWAPVKA KLDYYQCQHG LGYSVITGER KNISTEILYF
VPVDDNCEIH RLKIKNNGSS VRKIKLFSFV EFCLWDAYDD MTNFQRNLST GEVEVEGSTI
FHKTEYRERR NHFSFFTVNS NISGFDTDRE SFIGLYNGFH EPQVVVSGRP GNSIASGWSP
IGSHCLEIVL QPGEEQSYIF ILGFMENNPD NKWSSCGGIN KDKAYSMIRR YNSDIKVDNA
LQQLKSYWDD LLSGFYLESG DDRLDRMVNT WNQYQCMVTF NLARSASYFE SGISRGIGFR
DSNQDLLGVV HMVPERARER LIDLARTQFE DGSTYHQYQP LTKEGNNEIG SGFNDDPLWL
ILGTAAYIKE TGDISILNEE VSFNNGKEAT LFDHLLASFN HVVLNKGPHG LPLIGRADWN
DCLNLNCFST DPGESFQTTT NKDDGQAESV LIAGMFVTIG PEFVKMCRFI GKDEIADRAQ
SEIEDMKKAV MEHGRDKNWF LRAYDYYGNK VGSIDNNEGQ IYIEPQGFCV MGGLGIESGF
ARKALDSVKE RLDTEYGLVL LDPPYKQYYP ELGEISSYPP GYKENGGIFC HANPWIMIAE
TVLGRGNNAF EYYKKIAPAY LEEISDIHRM EPYVYAQMIA GKDAVNHGEA KNSWLTGTAA
WNYVAITHYI LGIRPEYEGL KIDPCIPEEW SGFYVEKRFR GKVYKIHVNN PAKVSKGVKH
IKVDGEIIEG NLIRIPLENQ ARENVNDNTT QHIVEVIMG