Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_00780 |
Symbol | |
ID | 7314296 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | + |
Start bp | 90301 |
End bp | 91536 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643610496 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002507834 |
Protein GI | 220930926 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.0480859 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAAAC GTATCATTTC TATTTTAACA GTTGGCTTGT TATTAGTGGC AATGTTAACT GGAACGGTAA TGGCCGGGAA AGTTAAATTA CGCTTTTTGC AGCCCGGGGG AAATTTATAT AAGCAGAGTG TTGAGTTTGC CAAGGAGTAT ATGAAACTAC ACCCCAATGT AGAGATAGAA GTAATTGAGG TTGGCTGGAG TGATGCCTAT TCCAAGATAA TGACCATGGT GGCCGCCGGA AATGCCCCGG ATATTATGTA TATTGGAACC AGGTGGATAC CGGCTTTGGC CCAGATGAAT GCTATTCAGC CCCTTGATAA ATTTATCAGT GAAGAGAAGA AGGACCTGTA TTTTGATTCT CTGTTAAAAG GTACCTATTA CCAGGGAAAA CTCTATGCTT TACCACGCTC TTTTTCTACA AAAGCCTTAA TTTACCGGAC TGATTTAATC CCGGAACCAC CTGAAACCTG GGATGAGCTG GTTGAGGTTG CCAAAAGGGT TCAAAAGGAA CATGAAGGTA TATATGGTTT TGGAATAGCA GGAGCAAAAC ATGTTTCTAC CACTACCCAG TTTTTTAATT ATGTCTATCA GAATGGTGGC TCAATCTTCG ATAGTGAGGG AAATATTTTG CTCGATAGTC CCCAGTCAGT TAAGGCCCTT CAGTTTTATG TAGATCTTTA TCGTAAACAT AAAGTGGTTC CCAATCCTAT TGAATATAAC CGTGAAGAAC TACCGAACCT CTTTAAGACC GGGAAAATAG CCATGTTTGT CTGTGGTCCC TGGGCCAAAC CAATGATTGG ACTTGATCCT GATAATGAAA AAGTACCTTA TGCCAGTGCT CCCCTGCCCC GGGGAAGGTA TATGGCAACT ACCCTTGTTT CTGATTCCCT GGTATTATCT TCCCAGAGTG AACATATTGA TGAAGCCTGG AAGTACTTAA ACTGGATAAC CAGCCTGGAG AACCAGAAAA AACATGACCT TATTAATGGA ATGGCTCCGG CTATGGAAAA AGAACTTGAA GACCCGGCAT TTACAGAGGA TCCTTTTTTC AAAACATATG TTGATATGAT TCCTAAAGGT CAGCCCCAGC CTCTACCTCT GGCCTGGGAA CCTTTCCAGG ATGTAATCAC CGGGGCTATT CAAAAGGCTT TACTCGGAAT GGCAACACCT GAAGAAGCCC TAAAGGAAGC AGTTACCAGG ATTGAAGCTG AAAATCTGGC ACCGGTTAAA CACTAA
|
Protein sequence | MSKRIISILT VGLLLVAMLT GTVMAGKVKL RFLQPGGNLY KQSVEFAKEY MKLHPNVEIE VIEVGWSDAY SKIMTMVAAG NAPDIMYIGT RWIPALAQMN AIQPLDKFIS EEKKDLYFDS LLKGTYYQGK LYALPRSFST KALIYRTDLI PEPPETWDEL VEVAKRVQKE HEGIYGFGIA GAKHVSTTTQ FFNYVYQNGG SIFDSEGNIL LDSPQSVKAL QFYVDLYRKH KVVPNPIEYN REELPNLFKT GKIAMFVCGP WAKPMIGLDP DNEKVPYASA PLPRGRYMAT TLVSDSLVLS SQSEHIDEAW KYLNWITSLE NQKKHDLING MAPAMEKELE DPAFTEDPFF KTYVDMIPKG QPQPLPLAWE PFQDVITGAI QKALLGMATP EEALKEAVTR IEAENLAPVK H
|
| |