Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_19760 |
Symbol | |
ID | 7312791 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | - |
Start bp | 2125774 |
End bp | 2128128 |
Gene Length | 2355 bp |
Protein Length | 784 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643612422 |
Product | glycosyltransferase 36 |
Protein accession | YP_002509718 |
Protein GI | 220932810 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3459] Cellobiose phosphorylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 50 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTCATACG GATATTTTGA CAGTGAAAAC AAAGAATATG TTATAACCAA CCCACGGACC CCTACACCAT GGATTAACTA TATTGGTGGT GGAAATTATG GTGGAATTGT TTCCCAGACA GGGGGAGGAT ATAGTTTTGA TGGTGACCCC AGGTTCAAAA GGGTTTTAAG ATATAGATAT AATAGTATTC CTGAAGACCA ACCCGGAAGG TATATTTATC TAAGGGATAT TGATAACAAT AATTACTGGT CAGCTACCTG GCAGCCTGTT AAAGCTGATT TTGATGAGTA TGTATGCCGG CATGGACTTG GTTATACAAC AATTGAGCAA AAAAAGGATG ACATAGTTAC CAGTGTTACA TATTTTGTTC CGGGTAATGA ATCCCTGGAA ATATGGCAGT TAAATGTTAA AAATTTGGGT GAAGAAACCA GACATCTCTC CCTGTATACC TATGCGGAAT TTTCCTTTTT TGATGCAGTA AAGGATCAGC AGAATGTGGA CTGGACACAA CAGATACAAC AGGGTGAGTT CGAGGATAAT ATCCTATTCT GGAATGCATT TATGAAAAAC TGGGAATACA TTTTTATGAC AAGCAGTATT CAGGTAACGT CTTATGAAAC CAGCCGTGAA AGGTTTGTTG GTTGTTATCG TGACCTGTCC AATCCAATTG CTGTAGAGGA GAGTAATTGC AGCAATTATC TTGCCCAGAG GGGAAATGGT GTTGGTTGCT TAAATCATGA AATCAAACTT GAGCCTGGAT GTGAAAAAGA AATTATTTAT ATCCTTGGTA CTACTCCCTC AAAGCAAACT GTTAAAGAAA AAATATCCAG TTTTCTCGAC CCGGAACAGG TAGAGCAGGA AAAAGAAGGT TTAAAAAAAT ACTGGGATAA TTTTTTAAGC AGCTGTGAGG TTGATACACC TGACCCTGAA ATGAATTTAA TGTTAAATAC CTGGAACCAG TACCAGTGTA AAACAACATT TAACTGGTCA AGGTTTGTAT CACTGTATCA GTTAGGAATA AACAGAGGAA TGGGATTCAG GGATAGTGCC CAGGATGTTC TGGGTGTTAT GCATGCTATA CCGGATGAGT GCAGGGAATT AATTATTAAA CTCTTCAAGA TACAGCATCA GGATGGCCAT GCCTATCATT TATACTACCC ACTGACTGGA GAGGGTACAA CAGGTGAAGC AGGGGTGGAT GGGAGTGTTG ACTGGTATTC AGATGACCAC CTGTGGATAA TTATTTCTGT AGCTGCTTAC CTAAAGGAGA CTGGTGACTT TGACTTTTTA ACAGAGGTAG TCCCCTATGC AGATAATCAG GGAGAAGGTA CCGTTCTTGA ACATTTATGT AAGGCACTGG AATTCACGGA AAACCATAAG GGTAAGCACA ATATTCCCCT TGCTGGTTTT GCAGACTGGA ATGACACCAT TAACCTGGAT AATGGTAATG GGGTTGCAGA ATCTGTCTGG ACTGGTATGC TTTATTGTTA TGCATTGAAA GAGATGCTTG ACCTGCTTGA GTATTTAGGA GAAAACCAAC TTGTTACCCG TTATCAAGAG TATTACCAGT CACAAAGGGA AGCTATCAAT AAATATGCAT GGGATGGGGA CTGGTATTTA AGGGCCTATG ATGATAACGG GGAGCCATTA GGTTCCCAAC ATTGTGAACA TGGAAAAATA TTCCTTAATC CCCAGTCATG GTCTATAATG GCTGGAATAG CAGACAATAA GCAACAACAA AGATTATTAC AGAAAGTAGA TGAGATGTTA AATACTGAAT TTGGTGTTGT GCTGGTATAT CCCGCCTATC ATAAGTATGA CCCGGAAAAA GGTGGAATAA CAACTTACCC ACCCGGGGCG AAAGAAAATG GAGGCATATT CCTGCATACT AATCCCTGGC TTATTATTGC AGAAACAATG CTGGGAAATG GTAACAGGGC ATACCAGTAT TACCGGAAGT TGTTACCCCC ATTAAAGAAT GATATTGCAG ATCGATATGA GATCGAACCC TATGTATACT GTCAGAACAT ATTAGGAAAA GAACATCCCC AGTTTGGGCT TGGCAGAAAC TCATGGTTAA CCGGTACGGC AGCATGGATG TACAGGGCCG CAGTATATTA TATTCTTGGT GTGAGACCCA CTTATGACGG TTTAATTATT GACCCGGTTG TCCCTTCTTC CTGGGAAAAC TTTAGTATAA AAAGAAAATT CAGAGGTAAA GTAATTGAGA TAAATTGTAT CAAATCGGAT CGGAATTATA TCAGGATTAA TGGTGTCGGG GAAAAATCGG GTAATAAAGT ATCGCTGAGT GAGTTAACTG AAGATATCAA CAGACTTGAA GTATATTATA CATAA
|
Protein sequence | MSYGYFDSEN KEYVITNPRT PTPWINYIGG GNYGGIVSQT GGGYSFDGDP RFKRVLRYRY NSIPEDQPGR YIYLRDIDNN NYWSATWQPV KADFDEYVCR HGLGYTTIEQ KKDDIVTSVT YFVPGNESLE IWQLNVKNLG EETRHLSLYT YAEFSFFDAV KDQQNVDWTQ QIQQGEFEDN ILFWNAFMKN WEYIFMTSSI QVTSYETSRE RFVGCYRDLS NPIAVEESNC SNYLAQRGNG VGCLNHEIKL EPGCEKEIIY ILGTTPSKQT VKEKISSFLD PEQVEQEKEG LKKYWDNFLS SCEVDTPDPE MNLMLNTWNQ YQCKTTFNWS RFVSLYQLGI NRGMGFRDSA QDVLGVMHAI PDECRELIIK LFKIQHQDGH AYHLYYPLTG EGTTGEAGVD GSVDWYSDDH LWIIISVAAY LKETGDFDFL TEVVPYADNQ GEGTVLEHLC KALEFTENHK GKHNIPLAGF ADWNDTINLD NGNGVAESVW TGMLYCYALK EMLDLLEYLG ENQLVTRYQE YYQSQREAIN KYAWDGDWYL RAYDDNGEPL GSQHCEHGKI FLNPQSWSIM AGIADNKQQQ RLLQKVDEML NTEFGVVLVY PAYHKYDPEK GGITTYPPGA KENGGIFLHT NPWLIIAETM LGNGNRAYQY YRKLLPPLKN DIADRYEIEP YVYCQNILGK EHPQFGLGRN SWLTGTAAWM YRAAVYYILG VRPTYDGLII DPVVPSSWEN FSIKRKFRGK VIEINCIKSD RNYIRINGVG EKSGNKVSLS ELTEDINRLE VYYT
|
| |