Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_20520 |
Symbol | |
ID | 7314376 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | - |
Start bp | 2215698 |
End bp | 2216951 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643612496 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002509792 |
Protein GI | 220932884 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 0.561553 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAAA GGGTATTAAT GGTTTTGAGC CTGGTCCTGG TATTGTTGGT TTCTCTGTCC ACTGTTTCTA TGGCTAAAAA AGTCTTAACT ATTAACAGTT ATATCAGTGA CCCTGTCCCC AAAGAAGCTT TTGAAGATGT TATTAAGGCC TTTGAAGAAG CACACCCTGA TATTGATGTC CGGGTAAGTA CTACTGCTCA TGAAGATTTT AAAAAGGCCC TGAGGATCTG GTTAAGTTCT GATAATCCAC CAGATGTCAT CACCTGGTTT GCCGGTAACA GGGCAAAATA TTTTATTAAC AAGGGTTTAA TTATGGCTAT AACTGATGTC TGGGAAGAAG CGGATCTTTA CAATAAATTC CCCCGGGCTT TCAGGAGTAT AAGTTTTGTT AATGGAAAAG CTTATTTCCT TCCTTATAAC TGGTACTGGT GGGGAATGTT TTACCGTAAG TCCATCTTTG ATAAGTATGG GCTTGAAGAA CCCCGGACCT GGGATGAATT TTTAGATGTC TGTGAAACCC TCAAACAAAA CGGTATTACT CCGATAACAA TCGGGACCAA ATACCGCTGG ACTGCTACTG GGTGGTTTGA TTATCTCAAC ATGAGGGTTA ATGGGCCCGA ATTTCATATC AGGTTGATGG AAGGTAAAGA GAAATATAAT GATCCCCGGG TTAAGAAGGT ATTTGAGTAC TGGCGTCAGT TACTGGATAG GGGATATTTT GTTGACAATG CGGCTGCCTA TTCCTGGCAG GAAGGTGTAA GGTTTATGGT TAAGGGAGAA GCTGCAATGT ACTTGATGGG TCAGTTTATT CTGGATGCTG TTCCTGAGGA GGTAGCTAAA GACCTTGACT TTTTCCGCTT CCCTATAATT AATGAAGATG TACCTATTGG GGAAGATACT CCTACTGATG GGTTTATGAT TCCTAAGAAA GCTAAAAACC CGGAACTTGC TAAAGAATTC CTCAAGTTCC TGGCTTCCAG AGAAGGGCAG ATGATCTTTA TAGAAAAAAC AGGCCGTATC GGGGTTAATA ATGAAATTCC AATGGATTCC TACCCGCCTC TAACCCAGAA GGGTGTTAAG ATGATTCAGG GAACCGATGC CCTGGCCCAG TTCTATGACA GGGATACACC TCCAACTATG GCTGATAAAG GTATGAACGG ATTAATGAAT TTCTGGGCAT ACCCTGATCA GATAGATAAA ATTCTTGATA ACCTTGAAAG GCAGAGACAG ATGATTTTTT CAGAACAGGA ATAA
|
Protein sequence | MSKRVLMVLS LVLVLLVSLS TVSMAKKVLT INSYISDPVP KEAFEDVIKA FEEAHPDIDV RVSTTAHEDF KKALRIWLSS DNPPDVITWF AGNRAKYFIN KGLIMAITDV WEEADLYNKF PRAFRSISFV NGKAYFLPYN WYWWGMFYRK SIFDKYGLEE PRTWDEFLDV CETLKQNGIT PITIGTKYRW TATGWFDYLN MRVNGPEFHI RLMEGKEKYN DPRVKKVFEY WRQLLDRGYF VDNAAAYSWQ EGVRFMVKGE AAMYLMGQFI LDAVPEEVAK DLDFFRFPII NEDVPIGEDT PTDGFMIPKK AKNPELAKEF LKFLASREGQ MIFIEKTGRI GVNNEIPMDS YPPLTQKGVK MIQGTDALAQ FYDRDTPPTM ADKGMNGLMN FWAYPDQIDK ILDNLERQRQ MIFSEQE
|
| |