Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_16190 |
Symbol | |
ID | 7312655 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | - |
Start bp | 1736004 |
End bp | 1737209 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643612066 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002509363 |
Protein GI | 220932455 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.00305219 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAG TTTTAACTTT ACTCTCTGTT TTAGTTTTAG TTGTTGGTAT GACTTTAAGT GCCAGTGCTG TAAAGCTGGT AGTCTGGGAA TCTCCCGGTC CTGAAGAAGA ATTTATTCAG GAAATGGGTA AAATTTACAC TGAACAAACC GGTGTTGAAA TTGAGGTGCA ACCAGTAGAC CAGATTAACC AGGATGATAA ACTGGCCCTG GATGGTCCTG CTGGAAAGGG AGCCGATATT GTTGTCTGGC CCCATGATGG TATCGGTCGT TCTGTAGAGC AGGGTTTAAT CTGGCCTATT CCTGAAGATA AGGTGGATAC CAGTGCCTTT ACAGAGTCTT CTCTCAACGC GCTGACTTAC AAGGGTAAGC TATATGGATT ACCATATGCT GTTGAAAGTG TGGCCCTGTT ATATAACAAG GATCTGTTAC CAGAAGTACC TGAAACCTTT GATGAGTTTT TGGCTAAAGT AAAAGAACTA AATAAACCGG CTGAGGGCCA GTTTGGTTTT ATGGCCAACA TCGGTGACCT CTACCACGTT TTCGGGTTTA TCTCCGGTTA TGGTGGTTAT ATCTTTAAAC AGACCGAAAA TGGTCTTGAC ATAAATGATA TCGGTCTGGA TAGCCCTGGT GCTATTAAAG CCATGAAGTT TATAAAGAGC TTCAGGACCT CAGGTTTAAT GCCTGAAGGT ACTACCGGTG ATGTTATGAA TGGTCTCTTT TCCCAGGGTT CTCTGGCAGC TGTTATTGAC GGACTATGGG CTTTAGAAGG TTATCGTGAA GCCGGGGTTA ACTTTGGTGT TGCTCCCCTG CCCAGGCTTG ATAATGGTGA ATATCCCCAT ACCTTCATAG GCGTTAAAGG TTACTACATC AGTGCCTTCA GTGAACATAA AGAAGAAGCC CTGAAATTCA TTCAGTGGTT AACCACTAAA GAGAATTCCT TTAAACATTA TCAGAAGACA TATGTAATTC CTCCACGTAA AGATGTAATG GAAATGCCTG AATTTAAAGA AAATAAAGTT GTTGAAGCTT TTGCTATTCA GGCTTCAAGG GGTATGCCAA TGCCGAATGT ACCTGAAATG ATGGCTGTAT GGGAACCGGC TAATAATGCC CTTTCTTTCA TCCTTCAGGA TCAGGTTACA CCTGAAGAAG CAGCTAAACT CTGTGTTCAG AGAATCCAGG ATAATATTGA AATGATGAAA GAATAA
|
Protein sequence | MKKVLTLLSV LVLVVGMTLS ASAVKLVVWE SPGPEEEFIQ EMGKIYTEQT GVEIEVQPVD QINQDDKLAL DGPAGKGADI VVWPHDGIGR SVEQGLIWPI PEDKVDTSAF TESSLNALTY KGKLYGLPYA VESVALLYNK DLLPEVPETF DEFLAKVKEL NKPAEGQFGF MANIGDLYHV FGFISGYGGY IFKQTENGLD INDIGLDSPG AIKAMKFIKS FRTSGLMPEG TTGDVMNGLF SQGSLAAVID GLWALEGYRE AGVNFGVAPL PRLDNGEYPH TFIGVKGYYI SAFSEHKEEA LKFIQWLTTK ENSFKHYQKT YVIPPRKDVM EMPEFKENKV VEAFAIQASR GMPMPNVPEM MAVWEPANNA LSFILQDQVT PEEAAKLCVQ RIQDNIEMMK E
|
| |