Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_20870 |
Symbol | |
ID | 7313320 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | - |
Start bp | 2260408 |
End bp | 2261700 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 643612534 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002509827 |
Protein GI | 220932919 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 53 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAG GGTTATCTTT AATACTTGTT ATTTCATTAA TGCTGGTATT AAGTAGTGTT GTTTTTGCAG AAGAACAGGT TGAAATTACA ATTGCTGGTG GTAGTGTAGG TATCGAGCTC GACCTGACCA AAGAAGCAGC TCAATTATAT ATGGAAAGAC ACCCCAATGT AAAGGTTAAT GTACTGGACA CACCTGATTT AGCCAATGAC AGGCTCGGGT TATATCTTCA GTTTTTGGAA GCAAAAAGCC CCAAAATTGA CGTCTACCAG ATTGATGTTA TCTGGCCCGG GGATTTAGCT GAACATTTTG TTGATCTTTA TGAGTATGGG GCTGAAAAAT ATGTTGATGA TCACTTCCAG CCTATTGTAG AAAATAATAC TGTTGAGGGC AGACTGGTAG CTATGCCCTG GTTTACTGAT GCCGGTCTTC TTTACTACCG TAAGGATCTT CTTGAAAAAT ATGACCTGGA AGTCCCCAAA ACATGGGAAG AATTAGAAAG GGCGGCCAAG ATTATTCAGA CCGGAGAAAG GGCTGCCGGA AACCAGGACT TCTGGGGTTA TATCTGGCAG GGTAATGCTT ATGAAGGTTT AACATGTGAT GCCTTAGAAT GGGTTGCCTC CAATGGTGGA GGAACTATTA TCAGTCCTGA CAAAAAAATT ACTATTAACA ATGAAAAGGC AATAGAAGCC ATCGAAATGG CCGCTGACTG GGTAGGCTGG ATTTCTCCTC CAGGAACAAC TGGTCTTGTT GAAGAAAGCA CCCGTAAGAT GTGGGAAGCA GGTAATGCCG CCTTTATGAG AAACTGGCCT TACTGTTATA AACTTGGTAA TGCTGAAGGG TCTGCCATCA AAGGCAAGTT TGATGTAGCT CCCCTACCGG CCGGTGATAG TGGTAACGGG GCTGCTACCC TCGGTGGTTG GAACCTGGCT GTAAGCAAGT ACAGTGAACA CCCTGAAGTT GCCGCTGATT TTGTTTTCTT CCTGACCGGT TATGAAATTC AGAAACTCCG GGCTACCAAA GGTTCCTTTA ATCCGACCAT TAAAGCCCTT TATGAAGATG AAGAAGTCCT GGAAGCTAAC CCCTTCTTTG GTAAGCTTTA TGATGTTTTT GTAAATGCTG TTGCCCGTCC TTCTACTGCC ACTGCTCCTA ACTATAATGA AGTCTCCAGG TTATTCTTCC AGGCTGTACA TTCAGTCCTT TCTGGTGAAA TGGATGCCAG GACTGCAGTG GAATACTTAG AATTAGATCT TCAGGATTTA ACCGGTTTTG AAATTGGTGA ACCTCAAAAA TAA
|
Protein sequence | MKKGLSLILV ISLMLVLSSV VFAEEQVEIT IAGGSVGIEL DLTKEAAQLY MERHPNVKVN VLDTPDLAND RLGLYLQFLE AKSPKIDVYQ IDVIWPGDLA EHFVDLYEYG AEKYVDDHFQ PIVENNTVEG RLVAMPWFTD AGLLYYRKDL LEKYDLEVPK TWEELERAAK IIQTGERAAG NQDFWGYIWQ GNAYEGLTCD ALEWVASNGG GTIISPDKKI TINNEKAIEA IEMAADWVGW ISPPGTTGLV EESTRKMWEA GNAAFMRNWP YCYKLGNAEG SAIKGKFDVA PLPAGDSGNG AATLGGWNLA VSKYSEHPEV AADFVFFLTG YEIQKLRATK GSFNPTIKAL YEDEEVLEAN PFFGKLYDVF VNAVARPSTA TAPNYNEVSR LFFQAVHSVL SGEMDARTAV EYLELDLQDL TGFEIGEPQK
|
| |