Gene Hore_20180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_20180 
Symbol 
ID7314342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2176340 
End bp2177605 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content36% 
IMG OID643612462 
Productputative sugar-specific permease SgaT/UlaA 
Protein accessionYP_002509758 
Protein GI220932850 
COG category[S] Function unknown 
COG ID[COG3037] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones56 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATTTT TAGATTTTTT AGTTAATGAT ATATTAAGTG AGCCAGCTGT TTTAATTGGA 
GTAATGACAT TTATTGGTCT GGTAGCTGCT AAAAAGAACT TCTCTCAAAT TATGTCAGGG
ACTTTTAAAA GTATTATAGG TTTTGTTATC CTGGGGGCAG GGGCAGGGGT TTTAGTTCAA
AGTTTAAATA ATTTAGGGCC AATCATTACA GAAGCCTTTA ATATTCATGG GGTAGTTCCA
ACTAATGAAG CTGTAGTAGC TGTAGCACAA AAAACATTAG GAAAAGAAAC TGCTTTAATT
ATGGGATTTG GCTTTCTAGC CAATCTGGCT TATGCTCGTT TTACTCCATT GAAGTATATA
TTTTTGACAG GACATCATAC GTTCTTTATG GCGGCTTTAT TGGCAGCTGT ATTGGGAACT
GCAGGCTTAA CTGGAGCTCC ATTAGTAATT GTAGGGTCAG CAATATTAGG TTTTCTAATG
GTTTTGATGC CGGCATTGGC TGACTCGTTT ATGAAAGAAA TAACTGGCAG TGATGATATT
GCTTTAGGAC ATTTTGGAAC TACTGCTTAC GTTGTTTCTG GTTTTATAGG CAAGTTAGTA
GGGAATCCAG AAGATTCTAC AGAAGATATT GAAGTTCCAA AGTCATTGGG CTTTTTAAAA
CAGTCATTAT TGTCGACTGC TATAACAATG ACAGTTATCT TTTTAATTAT TGTTCTAAAG
GCTGGTCCAG AAATTGTTAG TAAATATGCA GGAGACCAGA GCTTATTTAT GTTTGCTGTA
ATGCAAGGGA TTACTTTTGC TGCAGGAGTT AGTATTATCA TGTCAGGTGT AAGAATGATT
TTGGGAGAAA TTGTCCCAGC TTTTGAAGGA ATCGTTGAAA AAGTAGTTCC AGATGCTAAA
CCAGCATTAG ATTGCCCGGT TACTTTCAAT TTTGCTCCTA CAGCTGTAAC TATTGGTTTC
TTATCTAGTT TTTTAGGTGG AATTGTAGGA ATGTTTTTGC TAGGGCCATT AGGACTGGCT
TTAATTATTC CAGGACTAGT ACCTCATTTT TTCTGTGGGG CTACTGCTGG AGTATTTGGT
AATGCTACTG GTGGGAAAAA GGGAGCTGTT TTAGGAGCAT TTGTTCATGG AATTATGATT
ACTTTCTTGC CAGCTTTATT ACTTCCGGTG TTAGGAAATT TAGGATTTGC TAATATTACT
TTTGGAGATG CTGATTTTGG AGTTGTAGGT ATTATTATTG GAACGATAGC TAAATTATTT
AGTTAA
 
Protein sequence
MQFLDFLVND ILSEPAVLIG VMTFIGLVAA KKNFSQIMSG TFKSIIGFVI LGAGAGVLVQ 
SLNNLGPIIT EAFNIHGVVP TNEAVVAVAQ KTLGKETALI MGFGFLANLA YARFTPLKYI
FLTGHHTFFM AALLAAVLGT AGLTGAPLVI VGSAILGFLM VLMPALADSF MKEITGSDDI
ALGHFGTTAY VVSGFIGKLV GNPEDSTEDI EVPKSLGFLK QSLLSTAITM TVIFLIIVLK
AGPEIVSKYA GDQSLFMFAV MQGITFAAGV SIIMSGVRMI LGEIVPAFEG IVEKVVPDAK
PALDCPVTFN FAPTAVTIGF LSSFLGGIVG MFLLGPLGLA LIIPGLVPHF FCGATAGVFG
NATGGKKGAV LGAFVHGIMI TFLPALLLPV LGNLGFANIT FGDADFGVVG IIIGTIAKLF
S