Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_19750 |
Symbol | |
ID | 7312790 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | - |
Start bp | 2123239 |
End bp | 2125698 |
Gene Length | 2460 bp |
Protein Length | 819 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643612421 |
Product | glycosyltransferase 36 |
Protein accession | YP_002509717 |
Protein GI | 220932809 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3459] Cellobiose phosphorylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 52 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATACG GGTATTTTGA TGACCGGGCC AGGGAATATG TAATTAATAA TCCAGAAACT CCGTATCCCT GGATAAATTA TCTGGGTTCA AAAAATTATT TCTGTCTTCT TTCCAATACT GCAGGAGGGT ATAGTTTTTA CAAGGATGCC AGATTACTCA GGATTACCAG ATATCGATAC AATAATGTTC CGATTGATGC AGGTGGTAAG TATTACTATA TATATGAAGA CGGAGAATAC TGGTCCCCTA CCTGGGCTCC GGTTAAGGCA AAGCTTGATT ACTACCAGTG TCAGCATGGC CTGGGTTATA GTGTAATAAC CGGTGAACGG AAAAACATTT CAACTGAAAT ATTATATTTT GTTCCAGTAG ATGATAACTG TGAAATACAC CGTTTAAAAA TAAAAAATAA TGGTTCATCT GTTAGAAAAA TAAAACTGTT TTCCTTTGTT GAGTTTTGTC TCTGGGATGC ATATGATGAT ATGACTAATT TCCAGAGGAA CTTAAGTACT GGTGAGGTAG AAGTAGAGGG TTCAACCATA TTCCATAAAA CAGAATATAG AGAGAGGAGA AACCATTTTT CCTTTTTTAC AGTCAATAGT AATATTTCAG GTTTTGATAC CGATAGGGAA TCATTTATAG GTCTGTATAA CGGTTTCCAT GAACCACAGG TTGTTGTCAG TGGCAGACCA GGTAATTCTA TTGCCAGTGG GTGGTCACCA ATTGGCTCAC ACTGCCTGGA AATAGTATTA CAACCGGGTG AAGAACAAAG TTATATTTTT ATACTTGGCT TTATGGAAAA TAATCCTGAT AACAAGTGGT CTTCATGCGG GGGAATTAAT AAAGATAAAG CCTATAGTAT GATCAGGCGC TATAATTCTG ATATCAAAGT TGACAATGCA TTGCAACAGC TTAAATCTTA CTGGGATGAC CTCCTTTCAG GCTTTTATCT TGAATCTGGT GATGACAGGT TAGACAGGAT GGTTAATACC TGGAATCAAT ATCAGTGTAT GGTTACCTTT AATCTGGCCA GGAGTGCATC ATATTTCGAA TCAGGTATTA GTAGAGGAAT AGGATTTAGG GATTCCAATC AGGACCTTCT GGGGGTTGTT CACATGGTTC CTGAAAGGGC AAGAGAAAGG TTGATAGATC TGGCCCGTAC ACAGTTTGAG GACGGGAGCA CGTACCATCA ATACCAGCCC CTTACCAAAG AAGGTAATAA TGAAATAGGA AGTGGGTTTA ATGATGACCC CTTATGGCTA ATTTTAGGTA CTGCAGCTTA TATCAAGGAA ACCGGCGATA TTTCTATTCT TAATGAAGAG GTTAGCTTTA ATAATGGTAA GGAGGCAACT TTATTTGATC ATTTACTGGC ATCATTTAAC CATGTTGTCT TAAATAAGGG GCCGCATGGA TTGCCATTGA TTGGCAGGGC TGACTGGAAT GATTGTTTAA ACTTAAATTG TTTTTCGACT GACCCTGGAG AGTCATTCCA GACGACAACT AATAAAGATG ATGGGCAGGC TGAATCTGTT CTGATTGCCG GGATGTTTGT GACGATAGGT CCTGAATTTG TTAAGATGTG CAGATTTATC GGTAAAGATG AAATTGCTGA CAGAGCTCAA TCAGAAATTG AGGACATGAA AAAGGCAGTA ATGGAGCACG GGAGAGACAA AAACTGGTTT CTGAGGGCAT ACGATTATTA TGGAAATAAG GTTGGTAGTA TTGATAACAA TGAAGGTCAA ATATATATAG AACCTCAGGG TTTTTGTGTT ATGGGGGGGC TGGGTATTGA ATCAGGGTTT GCCCGTAAGG CACTTGATTC TGTCAAAGAG AGGCTTGATA CAGAATATGG CCTTGTATTA CTGGATCCGC CCTATAAACA ATATTACCCT GAACTTGGTG AAATTTCTTC ATACCCGCCT GGATATAAGG AAAATGGAGG TATTTTCTGT CATGCCAACC CCTGGATAAT GATTGCAGAA ACAGTCCTGG GGAGAGGGAA TAATGCGTTT GAATACTATA AAAAGATAGC CCCTGCATAT CTTGAAGAAA TCAGTGATAT CCACAGGATG GAGCCATATG TATATGCCCA GATGATTGCA GGCAAAGATG CAGTTAACCA TGGTGAAGCA AAGAATTCGT GGTTAACCGG AACTGCTGCC TGGAATTATG TGGCTATCAC ACATTATATA CTGGGAATAA GGCCGGAATA TGAAGGTTTG AAAATAGATC CATGTATCCC TGAAGAATGG TCTGGATTTT ATGTAGAGAA GAGATTTAGA GGTAAAGTGT ATAAAATTCA TGTAAATAAC CCTGCAAAAG TTAGTAAAGG GGTAAAACAT ATCAAGGTAG ATGGAGAAAT AATTGAAGGT AATTTAATTA GAATTCCGCT AGAAAATCAA GCCAGAGAAA ATGTCAATGA TAATACAACT CAGCACATTG TAGAAGTAAT TATGGGCTGA
|
Protein sequence | MKYGYFDDRA REYVINNPET PYPWINYLGS KNYFCLLSNT AGGYSFYKDA RLLRITRYRY NNVPIDAGGK YYYIYEDGEY WSPTWAPVKA KLDYYQCQHG LGYSVITGER KNISTEILYF VPVDDNCEIH RLKIKNNGSS VRKIKLFSFV EFCLWDAYDD MTNFQRNLST GEVEVEGSTI FHKTEYRERR NHFSFFTVNS NISGFDTDRE SFIGLYNGFH EPQVVVSGRP GNSIASGWSP IGSHCLEIVL QPGEEQSYIF ILGFMENNPD NKWSSCGGIN KDKAYSMIRR YNSDIKVDNA LQQLKSYWDD LLSGFYLESG DDRLDRMVNT WNQYQCMVTF NLARSASYFE SGISRGIGFR DSNQDLLGVV HMVPERARER LIDLARTQFE DGSTYHQYQP LTKEGNNEIG SGFNDDPLWL ILGTAAYIKE TGDISILNEE VSFNNGKEAT LFDHLLASFN HVVLNKGPHG LPLIGRADWN DCLNLNCFST DPGESFQTTT NKDDGQAESV LIAGMFVTIG PEFVKMCRFI GKDEIADRAQ SEIEDMKKAV MEHGRDKNWF LRAYDYYGNK VGSIDNNEGQ IYIEPQGFCV MGGLGIESGF ARKALDSVKE RLDTEYGLVL LDPPYKQYYP ELGEISSYPP GYKENGGIFC HANPWIMIAE TVLGRGNNAF EYYKKIAPAY LEEISDIHRM EPYVYAQMIA GKDAVNHGEA KNSWLTGTAA WNYVAITHYI LGIRPEYEGL KIDPCIPEEW SGFYVEKRFR GKVYKIHVNN PAKVSKGVKH IKVDGEIIEG NLIRIPLENQ ARENVNDNTT QHIVEVIMG
|
| |