Gene Hore_04610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_04610 
Symbol 
ID7314440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp493986 
End bp495281 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content39% 
IMG OID643610884 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002508214 
Protein GI220931306 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones58 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAAAC TATTACTTTT GTTAATGGTG GTTAGTTTAT TGGCAGGGGT AGCTGCGACA 
GGGTTGGTTA TGGCAAAAGA ACCAATTGAA ATTAAGTTTG TTAGTCTGGC CTGGCAAAAG
CAATCTATTG AGGCTAATAA AGAAATTGTA GCTGAGTGGA ATAGAACTCA CCCTGATGTA
CAGGTAAAAT ATATTCAGGG GACATGGGGT TCAATCCATG ATTATATGAT TACTGCTTTT
GAAACAGGTT CTGTACCTGA TGTTTTTCAC TATGAATCTG CTGCGATAGT GGGTTTTGCC
CAAAAGGGGT ATCTGGCAGA ACTTAATTCA TTAATGTCTG AAGATTTAAA GAATGACATA
CTTGATGAAG CCTGGAAGAC TACCCAGCTT GAAAATGGTA AAATCTATGG TGTACCATTC
CTGTGGGAAT CTCAGATTAC ATTATATAAC AAAGCCCTGT TTAAAGAAGC CGGGATTACT
CCACCAACTA TTGATAATCC ATGGACCTGG GAAGACTTAA GAGAAGCTGC TAAAAAGCTG
ACCAAAGATA CTGATAATGA CGGTGAAATT GATCAATGGG GTGTTGGTTT AGGTTTAAAA
AGTCCGGCTA AAAAAATGCT CAGATTATCT GTGGGCTTTG GTGGAAAGTT CTTTAAAAAG
GAAAATGGTG AATATCATGT TGAGGTAGGG GAAGCAGAAA AGAAATTGTT AAAACAGTTT
TATGCTATGC TCTATGAAGA TAAAACAGCT CCTCTATCAG GTATAGGTCA ATCAGGTAGT
AGTATGATTC CTGGTTTTCT TGCCGGTAAA TATGCAATGG TACCCAGTGT TGGTGTCTGG
GCCAGGCAGC AGGTTGTTGT TAATGCCCCT GAAAATTTTG AATGGGGAGT AATTCCCCCA
ATTAAGGCCA AAACTCAGGC CCAGGGTGTT GGTACCCAGA CTTTAAGTAT TCCATCTGCA
TCTAAATATA AAAAAGAAGC CATGGAATTT ATTGAATTTT TCTTGAACAC CAGGAATATG
GCAAGACTGG CAAAAGGAGA CTGGATGCTT CCTACCAGAA AATCAACTAT GAATTTACCT
ATGTTCCAGA CCGATGAAAA TGGCTGGAAG GTTGCCATGA ATTCAGCTAA GTGTCTTGAA
GCAGGTCCCT GGCAAAATAT ACCAGGATTT CCTGAATGGA AAAACAGGGT TGGTAATCCA
GTTATTCAGC TATACTTAAA AGATAAAATA TCATTAGAAG ATGCTGCTAA AAGGCTAGAG
AGAGAAGGAA ACAGGATCCT GCAACGTTAT AAATAA
 
Protein sequence
MKKLLLLLMV VSLLAGVAAT GLVMAKEPIE IKFVSLAWQK QSIEANKEIV AEWNRTHPDV 
QVKYIQGTWG SIHDYMITAF ETGSVPDVFH YESAAIVGFA QKGYLAELNS LMSEDLKNDI
LDEAWKTTQL ENGKIYGVPF LWESQITLYN KALFKEAGIT PPTIDNPWTW EDLREAAKKL
TKDTDNDGEI DQWGVGLGLK SPAKKMLRLS VGFGGKFFKK ENGEYHVEVG EAEKKLLKQF
YAMLYEDKTA PLSGIGQSGS SMIPGFLAGK YAMVPSVGVW ARQQVVVNAP ENFEWGVIPP
IKAKTQAQGV GTQTLSIPSA SKYKKEAMEF IEFFLNTRNM ARLAKGDWML PTRKSTMNLP
MFQTDENGWK VAMNSAKCLE AGPWQNIPGF PEWKNRVGNP VIQLYLKDKI SLEDAAKRLE
REGNRILQRY K