Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_04610 |
Symbol | |
ID | 7314440 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | + |
Start bp | 493986 |
End bp | 495281 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643610884 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002508214 |
Protein GI | 220931306 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 58 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAAAAAC TATTACTTTT GTTAATGGTG GTTAGTTTAT TGGCAGGGGT AGCTGCGACA GGGTTGGTTA TGGCAAAAGA ACCAATTGAA ATTAAGTTTG TTAGTCTGGC CTGGCAAAAG CAATCTATTG AGGCTAATAA AGAAATTGTA GCTGAGTGGA ATAGAACTCA CCCTGATGTA CAGGTAAAAT ATATTCAGGG GACATGGGGT TCAATCCATG ATTATATGAT TACTGCTTTT GAAACAGGTT CTGTACCTGA TGTTTTTCAC TATGAATCTG CTGCGATAGT GGGTTTTGCC CAAAAGGGGT ATCTGGCAGA ACTTAATTCA TTAATGTCTG AAGATTTAAA GAATGACATA CTTGATGAAG CCTGGAAGAC TACCCAGCTT GAAAATGGTA AAATCTATGG TGTACCATTC CTGTGGGAAT CTCAGATTAC ATTATATAAC AAAGCCCTGT TTAAAGAAGC CGGGATTACT CCACCAACTA TTGATAATCC ATGGACCTGG GAAGACTTAA GAGAAGCTGC TAAAAAGCTG ACCAAAGATA CTGATAATGA CGGTGAAATT GATCAATGGG GTGTTGGTTT AGGTTTAAAA AGTCCGGCTA AAAAAATGCT CAGATTATCT GTGGGCTTTG GTGGAAAGTT CTTTAAAAAG GAAAATGGTG AATATCATGT TGAGGTAGGG GAAGCAGAAA AGAAATTGTT AAAACAGTTT TATGCTATGC TCTATGAAGA TAAAACAGCT CCTCTATCAG GTATAGGTCA ATCAGGTAGT AGTATGATTC CTGGTTTTCT TGCCGGTAAA TATGCAATGG TACCCAGTGT TGGTGTCTGG GCCAGGCAGC AGGTTGTTGT TAATGCCCCT GAAAATTTTG AATGGGGAGT AATTCCCCCA ATTAAGGCCA AAACTCAGGC CCAGGGTGTT GGTACCCAGA CTTTAAGTAT TCCATCTGCA TCTAAATATA AAAAAGAAGC CATGGAATTT ATTGAATTTT TCTTGAACAC CAGGAATATG GCAAGACTGG CAAAAGGAGA CTGGATGCTT CCTACCAGAA AATCAACTAT GAATTTACCT ATGTTCCAGA CCGATGAAAA TGGCTGGAAG GTTGCCATGA ATTCAGCTAA GTGTCTTGAA GCAGGTCCCT GGCAAAATAT ACCAGGATTT CCTGAATGGA AAAACAGGGT TGGTAATCCA GTTATTCAGC TATACTTAAA AGATAAAATA TCATTAGAAG ATGCTGCTAA AAGGCTAGAG AGAGAAGGAA ACAGGATCCT GCAACGTTAT AAATAA
|
Protein sequence | MKKLLLLLMV VSLLAGVAAT GLVMAKEPIE IKFVSLAWQK QSIEANKEIV AEWNRTHPDV QVKYIQGTWG SIHDYMITAF ETGSVPDVFH YESAAIVGFA QKGYLAELNS LMSEDLKNDI LDEAWKTTQL ENGKIYGVPF LWESQITLYN KALFKEAGIT PPTIDNPWTW EDLREAAKKL TKDTDNDGEI DQWGVGLGLK SPAKKMLRLS VGFGGKFFKK ENGEYHVEVG EAEKKLLKQF YAMLYEDKTA PLSGIGQSGS SMIPGFLAGK YAMVPSVGVW ARQQVVVNAP ENFEWGVIPP IKAKTQAQGV GTQTLSIPSA SKYKKEAMEF IEFFLNTRNM ARLAKGDWML PTRKSTMNLP MFQTDENGWK VAMNSAKCLE AGPWQNIPGF PEWKNRVGNP VIQLYLKDKI SLEDAAKRLE REGNRILQRY K
|
| |