Gene Hore_20170 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_20170 
Symbol 
ID7314341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2173706 
End bp2175931 
Gene Length2226 bp 
Protein Length741 aa 
Translation table11 
GC content31% 
IMG OID643612461 
Producttranscriptional antiterminator, BglG 
Protein accessionYP_002509757 
Protein GI220932849 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG1762] Phosphotransferase system mannitol/fructose-specific IIA domain (Ntr-type)
[COG3711] Transcriptional antiterminator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGTTTAA AACCCCGGGT TTGTCAATTA TTGAAATATT TAATTGACCA GGAGAAAGCT 
GTCTCTATTA AACAATTGGC TGATAGATTT AATGTTAGTA GTAGAACGAT CAGGTATGAT
CTAGATGATA TTGAATCTAG TATCTCTTCC TATGATGCTA AATTAATAAG AAAAACACGA
ATTGGGGTTT ATTTAGAGGG AAAAGAAGAA GAGTTAAATA AGATACAAGA AGAATTAGCT
AATCTTTATG GCCTTGAAAG AATTTTATCT CCTAAAGAGC GTCAGCATCT AATTTTATTT
AGACTATTTC AGGCTAATGA ACCGATTATT ATCAAAGAAT TAGAAATAAT GTTACGCATC
AGTAAGTCAA CAATAATTAA AGATTTAGAT GAAGTTGAAA ACTGGTTATC AAAGCACAAT
TTAATTCTTA TTAGAAAGAC CAATTATGGT TTAGAAATAA AAGGTGACGA GATAGATATT
AGACATGCCA TGATGAATGT TTTAGAAGAA ACAGCAAATG AAAAAGAATT AGTTGGATTT
TTACGACAAA TTCAGAAAAA AGCATTGGAA GAAAGAAATT TAGAGCCGGG GTTTTTTAAA
GAATTTGATA AATTAGTATC TGGTATTGAT TTGACCAAGA TTGAAAGTGT AATTTCTTTT
GCCGAAAAGC AATTAGGATT TCAATTTGCT GATGAAGCGT ATGCTAGTTT GTTAGTTCAT
TTAGCTTTAG CAATAAGCAG ATTATTAGAA GGAAAAGATA TTCAGCTACC CGAGGAACGA
TTAAAGGTTA TTAAAAAAAG TGATGAATAT AAGATAGCCA AGAAGATAGG AAAGATCATG
GAGCAAATTT TTGACATAAG CATTCCAGAT TCGGAGATAG GTTATGTTAC CTTACATTTA
ATGGGGGCTA AATTATGGCA AAAGATAGGT GATGACGATT ATAAAAAACT TATCAACCAG
GATTTAGACC AGGAGCTGAT TATTTTAACT AAAGAAATGG TTAAGGTAGC TGAAGACTAT
TTAGGGGTTA AGTTAATTGA TGATACTCAA TTAATAATTG GTCTGGCGTT ACATTTAAAA
CCAACTATTA ATCGTATAAA ATATGACCTA CCATTAAAAA ATCCCTTACT GTTAGATGTT
AAAAGCAGGT ACGGGGAAAT ATTTAAAGCA GCTAAAAGGG CAGCTAAGAT ATTACAGAGC
AAGTTGCAAA AATCAATTAG TTCAGATGAA ATTGGGTATA TTACCCTTCA TCTTGGAGCT
GCTTTAGAGA GGAGTAAATC TAGCAGAAAA TTAAGGGTAG TCTTAGTTTG TTCAAGTGGA
GTAGGAACAA CTAACTTGTT ATCTTCTAGA TTAAGTAAAG AATTTTCTGA GATCAAAATT
TGTAATGTTG TTTCTGTTAT TCAATTAGAG AATAATGAAG TTGATTTAAA AAACATTGAT
TTAATTATTA CAACTATCCC GTTAGATATT GATGATATTT TAGTATTACA GGTTAATCCT
CTATTAAGCC AGAAGGATAA GAAAAATGTT AAAGCTATTA TTCAAAGTAA GCGAGATATT
TTTGATTCCA CTGAAATTAA AGAGGACTCT GAATCTATAA CAGATTATTC CTTTGATATT
GAAGAAGTAA TTGAGGTTAT AGAGCAAGAA GCAATTGTAG GAGATAAAGA TAGCCTAAAA
AAAGATTTAA GAGATTTTTT TGCTTCTAAA GGAGTTAAAG TTTTAGATAA AATAGATGAT
ACAGCTGACA GGCAAAGTAT AGAAGAGAGT GGTAAAGGAC TATTAGAATT GTTAACTGAA
AATAATATAG CAGTAATTGA TAAGGTAGAT AATTGGAGAA CAGGAGTTAG AGTAGCAGCA
AAGCCTTTAG TAGATCAGGG TCATATATTA GAAGAATATG TTGAGCGAAC GATAGAAGTC
ATTGAACAAA AAGGTGCTTA CGTAGTAATA TCCCCTCATA TTAGTTTAAT ACATGCTAGA
CCTGAAGATG GAGTTGTAAA AAAGAGTATG GGATTAGGGA TTATAAAAGA GGGAGTTAAT
TTTGGTCATG ATTATGATCC TGTTCATTTA ATTTTTACAT TAGCTCCTAT AGATGAAGTT
TCCCATCTCT CTGCTTTATC TGAACTTTTG AAATTAATTA ATGAATATGG TTTTGTAGAG
AAAATGCTGA CAGTAGATAA CCAACATGAT GTTTTAATTA AGATAGAAGA AATGTTAATT
AAGTAA
 
Protein sequence
MSLKPRVCQL LKYLIDQEKA VSIKQLADRF NVSSRTIRYD LDDIESSISS YDAKLIRKTR 
IGVYLEGKEE ELNKIQEELA NLYGLERILS PKERQHLILF RLFQANEPII IKELEIMLRI
SKSTIIKDLD EVENWLSKHN LILIRKTNYG LEIKGDEIDI RHAMMNVLEE TANEKELVGF
LRQIQKKALE ERNLEPGFFK EFDKLVSGID LTKIESVISF AEKQLGFQFA DEAYASLLVH
LALAISRLLE GKDIQLPEER LKVIKKSDEY KIAKKIGKIM EQIFDISIPD SEIGYVTLHL
MGAKLWQKIG DDDYKKLINQ DLDQELIILT KEMVKVAEDY LGVKLIDDTQ LIIGLALHLK
PTINRIKYDL PLKNPLLLDV KSRYGEIFKA AKRAAKILQS KLQKSISSDE IGYITLHLGA
ALERSKSSRK LRVVLVCSSG VGTTNLLSSR LSKEFSEIKI CNVVSVIQLE NNEVDLKNID
LIITTIPLDI DDILVLQVNP LLSQKDKKNV KAIIQSKRDI FDSTEIKEDS ESITDYSFDI
EEVIEVIEQE AIVGDKDSLK KDLRDFFASK GVKVLDKIDD TADRQSIEES GKGLLELLTE
NNIAVIDKVD NWRTGVRVAA KPLVDQGHIL EEYVERTIEV IEQKGAYVVI SPHISLIHAR
PEDGVVKKSM GLGIIKEGVN FGHDYDPVHL IFTLAPIDEV SHLSALSELL KLINEYGFVE
KMLTVDNQHD VLIKIEEMLI K