Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_19770 |
Symbol | |
ID | 7312792 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | - |
Start bp | 2128418 |
End bp | 2129689 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643612423 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002509719 |
Protein GI | 220932811 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 64 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGT TTTTTCTGTT GTTAGTATTA ACAACTCTAT TTGTTGGTAC TCTGGCTGTT TCTGCAGGTG CTACCGAAAT TACTATGTGG GCAATGAATA ATGCTCCATC TGAATTAAAC ATTGCCTGGT TTAATGAAAA GGCTGCAGAA TTTGAAGAAC TGACCGGTAT CAAGGTTAAC TTTGAAGAGA TAGCCTGGTC CAGCTGCATG GAGGTTATTT CAACTGCACT GGCAACCGGT GAGGGCGCGA ATGTAATGCA GGTTGGAACA ACCCAGACAC CTTTCTTTGC AGCTACCGGT GGATTGGTCG AAATAGACAT TCGTGAATTT GGTGGAAAAG ATAATTTTAT GGAAGGCAAT TTAAAGTCCA CAGTACTGGA TGGTAAGTAC TATGGTGTAC CGTGGATTGC AGAAACCAGA GTCCTGTTTT ATAACACAGA AATGTTTGAA AAAGCAGGTG TTGAACCTCC CCAGACATGG GAAGAACTTA TTGAAGTTGG TGAAAAGATA GTTGATGTAT ATGGAGAGGG AACAGCTATT GCTATTGCTG GTACAAATGC ATGGGACCTG ATCCATAACT GGGCTCCGAT GCTATGGACC AGGGGAGGAG ATTTCCTGAC ACCAGACTGG AAACGGGCTG CCTTTAACCT ATCTGAAGCA GGGTATGAAG CAGTAGAATA TTATGTAGAC CTGGTAAGGA ATGGTTTAGC AAGTACAGCA TGTGCTGAAT ATGACCAGTC CCAGGCTGAT TCAGCTTTTG CCAATGGTGA TGTGGCAATG GCCTTCCAGG GACCATGGAA TATTTCAGGT ATAAAGAATG ATAACCCCGA TCTTCCATTT GCAGCTGCTG AACTTCCAGC TGGACCTTAT GGTAGGGCTT CCTTTGCCGG TGGTAGTAAC CTCGTAGTCC GGAAAAATGC TCCACAGGAT GAAATTGAAG CATCAATTAA GTGGATTAAA TTCCTGTTAA GTGATACTAA CCTGACAGAG TATGTTAAAC TTTCTAACAT GTTACCAGCA ACCAAGGATG CATTTTCTGA TCCATTCTTC CAGAGTGAGA TAATGCAGGT ATTTGAAAAA TCATTGAGCT ATGCACATGC TTATCCATCT TTACCTGCAT GGGGTGAAAT TGAACTGGCT ATGAGGACCA GTTTTCAGAA CATTCTTACC GATTATATTG ATGGTGTATA TGATGATAAT ACCGCCAAAA AATACCTTGA TGCTGCTGCA TTAGAGGTAA ACAACATATT AAAGGAACAT AGTGATAAGT AA
|
Protein sequence | MKKFFLLLVL TTLFVGTLAV SAGATEITMW AMNNAPSELN IAWFNEKAAE FEELTGIKVN FEEIAWSSCM EVISTALATG EGANVMQVGT TQTPFFAATG GLVEIDIREF GGKDNFMEGN LKSTVLDGKY YGVPWIAETR VLFYNTEMFE KAGVEPPQTW EELIEVGEKI VDVYGEGTAI AIAGTNAWDL IHNWAPMLWT RGGDFLTPDW KRAAFNLSEA GYEAVEYYVD LVRNGLASTA CAEYDQSQAD SAFANGDVAM AFQGPWNISG IKNDNPDLPF AAAELPAGPY GRASFAGGSN LVVRKNAPQD EIEASIKWIK FLLSDTNLTE YVKLSNMLPA TKDAFSDPFF QSEIMQVFEK SLSYAHAYPS LPAWGEIELA MRTSFQNILT DYIDGVYDDN TAKKYLDAAA LEVNNILKEH SDK
|
| |