Gene Hore_19760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_19760 
Symbol 
ID7312791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2125774 
End bp2128128 
Gene Length2355 bp 
Protein Length784 aa 
Translation table11 
GC content40% 
IMG OID643612422 
Productglycosyltransferase 36 
Protein accessionYP_002509718 
Protein GI220932810 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3459] Cellobiose phosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTCATACG GATATTTTGA CAGTGAAAAC AAAGAATATG TTATAACCAA CCCACGGACC 
CCTACACCAT GGATTAACTA TATTGGTGGT GGAAATTATG GTGGAATTGT TTCCCAGACA
GGGGGAGGAT ATAGTTTTGA TGGTGACCCC AGGTTCAAAA GGGTTTTAAG ATATAGATAT
AATAGTATTC CTGAAGACCA ACCCGGAAGG TATATTTATC TAAGGGATAT TGATAACAAT
AATTACTGGT CAGCTACCTG GCAGCCTGTT AAAGCTGATT TTGATGAGTA TGTATGCCGG
CATGGACTTG GTTATACAAC AATTGAGCAA AAAAAGGATG ACATAGTTAC CAGTGTTACA
TATTTTGTTC CGGGTAATGA ATCCCTGGAA ATATGGCAGT TAAATGTTAA AAATTTGGGT
GAAGAAACCA GACATCTCTC CCTGTATACC TATGCGGAAT TTTCCTTTTT TGATGCAGTA
AAGGATCAGC AGAATGTGGA CTGGACACAA CAGATACAAC AGGGTGAGTT CGAGGATAAT
ATCCTATTCT GGAATGCATT TATGAAAAAC TGGGAATACA TTTTTATGAC AAGCAGTATT
CAGGTAACGT CTTATGAAAC CAGCCGTGAA AGGTTTGTTG GTTGTTATCG TGACCTGTCC
AATCCAATTG CTGTAGAGGA GAGTAATTGC AGCAATTATC TTGCCCAGAG GGGAAATGGT
GTTGGTTGCT TAAATCATGA AATCAAACTT GAGCCTGGAT GTGAAAAAGA AATTATTTAT
ATCCTTGGTA CTACTCCCTC AAAGCAAACT GTTAAAGAAA AAATATCCAG TTTTCTCGAC
CCGGAACAGG TAGAGCAGGA AAAAGAAGGT TTAAAAAAAT ACTGGGATAA TTTTTTAAGC
AGCTGTGAGG TTGATACACC TGACCCTGAA ATGAATTTAA TGTTAAATAC CTGGAACCAG
TACCAGTGTA AAACAACATT TAACTGGTCA AGGTTTGTAT CACTGTATCA GTTAGGAATA
AACAGAGGAA TGGGATTCAG GGATAGTGCC CAGGATGTTC TGGGTGTTAT GCATGCTATA
CCGGATGAGT GCAGGGAATT AATTATTAAA CTCTTCAAGA TACAGCATCA GGATGGCCAT
GCCTATCATT TATACTACCC ACTGACTGGA GAGGGTACAA CAGGTGAAGC AGGGGTGGAT
GGGAGTGTTG ACTGGTATTC AGATGACCAC CTGTGGATAA TTATTTCTGT AGCTGCTTAC
CTAAAGGAGA CTGGTGACTT TGACTTTTTA ACAGAGGTAG TCCCCTATGC AGATAATCAG
GGAGAAGGTA CCGTTCTTGA ACATTTATGT AAGGCACTGG AATTCACGGA AAACCATAAG
GGTAAGCACA ATATTCCCCT TGCTGGTTTT GCAGACTGGA ATGACACCAT TAACCTGGAT
AATGGTAATG GGGTTGCAGA ATCTGTCTGG ACTGGTATGC TTTATTGTTA TGCATTGAAA
GAGATGCTTG ACCTGCTTGA GTATTTAGGA GAAAACCAAC TTGTTACCCG TTATCAAGAG
TATTACCAGT CACAAAGGGA AGCTATCAAT AAATATGCAT GGGATGGGGA CTGGTATTTA
AGGGCCTATG ATGATAACGG GGAGCCATTA GGTTCCCAAC ATTGTGAACA TGGAAAAATA
TTCCTTAATC CCCAGTCATG GTCTATAATG GCTGGAATAG CAGACAATAA GCAACAACAA
AGATTATTAC AGAAAGTAGA TGAGATGTTA AATACTGAAT TTGGTGTTGT GCTGGTATAT
CCCGCCTATC ATAAGTATGA CCCGGAAAAA GGTGGAATAA CAACTTACCC ACCCGGGGCG
AAAGAAAATG GAGGCATATT CCTGCATACT AATCCCTGGC TTATTATTGC AGAAACAATG
CTGGGAAATG GTAACAGGGC ATACCAGTAT TACCGGAAGT TGTTACCCCC ATTAAAGAAT
GATATTGCAG ATCGATATGA GATCGAACCC TATGTATACT GTCAGAACAT ATTAGGAAAA
GAACATCCCC AGTTTGGGCT TGGCAGAAAC TCATGGTTAA CCGGTACGGC AGCATGGATG
TACAGGGCCG CAGTATATTA TATTCTTGGT GTGAGACCCA CTTATGACGG TTTAATTATT
GACCCGGTTG TCCCTTCTTC CTGGGAAAAC TTTAGTATAA AAAGAAAATT CAGAGGTAAA
GTAATTGAGA TAAATTGTAT CAAATCGGAT CGGAATTATA TCAGGATTAA TGGTGTCGGG
GAAAAATCGG GTAATAAAGT ATCGCTGAGT GAGTTAACTG AAGATATCAA CAGACTTGAA
GTATATTATA CATAA
 
Protein sequence
MSYGYFDSEN KEYVITNPRT PTPWINYIGG GNYGGIVSQT GGGYSFDGDP RFKRVLRYRY 
NSIPEDQPGR YIYLRDIDNN NYWSATWQPV KADFDEYVCR HGLGYTTIEQ KKDDIVTSVT
YFVPGNESLE IWQLNVKNLG EETRHLSLYT YAEFSFFDAV KDQQNVDWTQ QIQQGEFEDN
ILFWNAFMKN WEYIFMTSSI QVTSYETSRE RFVGCYRDLS NPIAVEESNC SNYLAQRGNG
VGCLNHEIKL EPGCEKEIIY ILGTTPSKQT VKEKISSFLD PEQVEQEKEG LKKYWDNFLS
SCEVDTPDPE MNLMLNTWNQ YQCKTTFNWS RFVSLYQLGI NRGMGFRDSA QDVLGVMHAI
PDECRELIIK LFKIQHQDGH AYHLYYPLTG EGTTGEAGVD GSVDWYSDDH LWIIISVAAY
LKETGDFDFL TEVVPYADNQ GEGTVLEHLC KALEFTENHK GKHNIPLAGF ADWNDTINLD
NGNGVAESVW TGMLYCYALK EMLDLLEYLG ENQLVTRYQE YYQSQREAIN KYAWDGDWYL
RAYDDNGEPL GSQHCEHGKI FLNPQSWSIM AGIADNKQQQ RLLQKVDEML NTEFGVVLVY
PAYHKYDPEK GGITTYPPGA KENGGIFLHT NPWLIIAETM LGNGNRAYQY YRKLLPPLKN
DIADRYEIEP YVYCQNILGK EHPQFGLGRN SWLTGTAAWM YRAAVYYILG VRPTYDGLII
DPVVPSSWEN FSIKRKFRGK VIEINCIKSD RNYIRINGVG EKSGNKVSLS ELTEDINRLE
VYYT