Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_03340 |
Symbol | |
ID | 7314720 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | + |
Start bp | 344409 |
End bp | 345728 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643610757 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002508090 |
Protein GI | 220931182 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 46 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGTTAAAA AAACTTTTCT ATTAATGCTG GTTGCTTTGT TTTTAGTTTC GTTGACAGCT GTAGTAGGTG CTGCTGAATT TGATTGGAAG CGTTTTGAAG GTGAAACAAT TAAGTTATTA CTTAACAAGC ACCCTTATAC AGACGGTGTT TTAAAAGAAT TAGACAAATT TGAAAAAATG ACCGGGATAA ATGTGGAATA TGATATTTTA CCAGAGGAAC AGTATTTTAA TAAAGTCACT GTTACCCTAT CTTCAGGTTC CAGTGAATAT GATATTTTCA TGACAGGTGC TTATCAGGTT TGGCAGTATG CACCTCCAGG ATGGATGGAA CCTTTAGATA AATATATTGA AGATAATAGT TTAACCAGTC CAGATTGGGA TAAAAATGAT TTTATTCCCG GTATTATTAA TTCCTTGAAA TGGAACCTTC AACCGGGAAG TGAACTTGGA ACCGGTAAGC AATGGGCTTT ACCACTAGGT TGGGAAGAAA ATTGTTTAGT TTATAGAAAA GATATTTTTG AAAAATATAA CCTAAAAGTT CCTGAAACTT TAGATGAAGT AATTGAAGTA GGAAAAATAA TAGAAGAAAA GACTGACCTT ACAGGTATAG TAGTTAGGGG AACCCGTAGC TGGGCTACGA TTCACCCTGG ATTTTACTCT GGTCTGGTTG CTTCTGGTGG TTTTGATTAT GATGGTAATC TAAAACCACA GATGAATAGT GAAGTAGCCC TTGAGTTTAC AAAAAAATGG GTAGAAATGA TTAAGGAAGT GGGTCCCGAA CAATGGACCA CCTATACCTG GTATGATGTT TCTAATGCAC TTGGGTCTGG TAAAGCAGCT ATGATGTATG ATGCTGATAC TCTTGGCTTC TGGGCTGACC ACGCTGATGA AAGATTAGCA TGGGCCCCTG GTCCTGGTTT AAATAAAAAA GCTGATAAAA CAAATGTCTG GATCTGGTCT CTGGCTATGA GTGCCCATTC AAATTCAAAA GAACCAGCAT GGTACTTCTT ACAATGGGTA ACCGGTAAAG AATTCCTGAC TACTGCTGCT GTAAAACACG ATGGATTAAA CCCGGTACGC AAATCCATCT GGGAAAATCC AGATTTTAAA GAAAGGTTAG CTAAGTTTGA TGGTTATTAT AAAACCTATA AAAAAATTGA TGAAGCTGCA GTTCAATTTA CACCTCAACC GATGTTCTTC CAGACTACAA CTGAATGGGC AGCGGCTCTG CAGAAAATAG TAAATGGTGA AGATGCTGAA GAAGTATTAA ATAAATTAGT AGAAGATTTA GATAAGAAAA TGGAGTCAAT TAGACGTTAA
|
Protein sequence | MVKKTFLLML VALFLVSLTA VVGAAEFDWK RFEGETIKLL LNKHPYTDGV LKELDKFEKM TGINVEYDIL PEEQYFNKVT VTLSSGSSEY DIFMTGAYQV WQYAPPGWME PLDKYIEDNS LTSPDWDKND FIPGIINSLK WNLQPGSELG TGKQWALPLG WEENCLVYRK DIFEKYNLKV PETLDEVIEV GKIIEEKTDL TGIVVRGTRS WATIHPGFYS GLVASGGFDY DGNLKPQMNS EVALEFTKKW VEMIKEVGPE QWTTYTWYDV SNALGSGKAA MMYDADTLGF WADHADERLA WAPGPGLNKK ADKTNVWIWS LAMSAHSNSK EPAWYFLQWV TGKEFLTTAA VKHDGLNPVR KSIWENPDFK ERLAKFDGYY KTYKKIDEAA VQFTPQPMFF QTTTEWAAAL QKIVNGEDAE EVLNKLVEDL DKKMESIRR
|
| |