Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_15560 |
Symbol | |
ID | 7312591 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | - |
Start bp | 1666490 |
End bp | 1667746 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 643612002 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002509300 |
Protein GI | 220932392 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 47 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGGA GTCTTATTTT AACACTATCA GTTTTCCTGG TACTGGTCTT GTCCGTATCC GCATTTGCTG CTACTGAAAT TACCGTATGG TATCACTCCG GCAGGGGTGG GGAAAGAGAA GTAATTGAGG ATCAGGTTAA AAGGTTTAAT GCCATGCAGG ATGAAGTCAA GATAAAACTT GTTCAGTTGC CTGAAGGTAG TTATAACGAA CAGGTTCAGG CTGCTGCCAT GTCAGGGGAC CTGCCCGATG TACTTGACCT TGATGGTCCT TTTATTGCCA ACTATGCCTG GTCCGGGTAC CTTCGTCCCT TAGAAGATTA TGTTAGTCCG GAACTTAAAG AGGATCTCTT ACCCTCTATT TTAGCCCAGG GAACCTACCA GGGTCACCTG TATGCTCTGG GAACCTTTGA TTCAGGACTG GCTATCTGGG GTAACAAGGA ATATTTAGAA GATGTCGGAG CCCGTATTCC AACCAGTGTT GAAGATGCCT GGACATTTAC TGAATTTATG GATATCCTGA AAAAGCTTAA AGAACATCCT GATGTAAAAT ATCCACTCGA CTTTAAAATT AACTATGGTA AGGGTGAATG GTTCAGCTAC GGTTTCTCTC CTATTTTCCA GGCTTTTGGA GCCGATTTAA TTAATCGTGA TAACTTCACT ACTGCAGAAG GTGTTTTAAA CGGACCGGAA GCTATGGCTG CTGCCTGGTT CCAGGCCTTG TTTGAACAGG GTTATGCTAA TCCTAACCCT CCGGGAGATA CTGAGTTTAC AAATGGTGAT GCTGCATTAT CATGGTGTGG ACACTGGGGT TATAATCAGT ATAAGGATGC CCTCGGTGAT GATGTTGTCC TGATTCCCAT GCCCAAATTC GCAACCCAGG TAACAGGAAT GGGTTCCTGG GCCTGGAGTA TAACTCAAAA CTGTGAAAAT CCAGAAGCAG CCTGGAAGTT CATAGAGTTT ATATTACAGC CAGAAGAAAT AGTAAAGATG ACCAATGCCA ATGGTGCTGT ACCATCCAGA CTTTCTGCCG CCAAATTATC AGAACCCTAT AAGCCCGGTG GAGAATTAAG GATTTTTGTT GAACAGCTGC AGAAAATAGC TGTAGAACGC CCTGTAACTC CAGCTTATCC GACTATTACT GATGCCTTTG CTACTGCTAT TGATAACATT ATTAACGGTG GCGACATCAG GTATGAACTC AACGAAGCCG TTAGAGCAAT TGACGAAGAA ATTGAGTTTA TGGGTCTTGC TCAATAA
|
Protein sequence | MKRSLILTLS VFLVLVLSVS AFAATEITVW YHSGRGGERE VIEDQVKRFN AMQDEVKIKL VQLPEGSYNE QVQAAAMSGD LPDVLDLDGP FIANYAWSGY LRPLEDYVSP ELKEDLLPSI LAQGTYQGHL YALGTFDSGL AIWGNKEYLE DVGARIPTSV EDAWTFTEFM DILKKLKEHP DVKYPLDFKI NYGKGEWFSY GFSPIFQAFG ADLINRDNFT TAEGVLNGPE AMAAAWFQAL FEQGYANPNP PGDTEFTNGD AALSWCGHWG YNQYKDALGD DVVLIPMPKF ATQVTGMGSW AWSITQNCEN PEAAWKFIEF ILQPEEIVKM TNANGAVPSR LSAAKLSEPY KPGGELRIFV EQLQKIAVER PVTPAYPTIT DAFATAIDNI INGGDIRYEL NEAVRAIDEE IEFMGLAQ
|
| |