Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_20470 |
Symbol | |
ID | 7314371 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | - |
Start bp | 2208237 |
End bp | 2209457 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643612491 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002509787 |
Protein GI | 220932879 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0000000403753 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGG TTGCTTTACT TGTTGTTACT TTAATGCTTA TTTTTACACT GCCGACCATG GCGGTAACCA AATTAACTAT CTGGGAAAAA TATGATGAGG CTACCCAGGA TCCATATTTC CAGAGTTTAG TAGATGAATT TAACAAAACC CATGAAGACA TTGAAGTTGA AATGCTCCAC TATAATACTG AAGATTTGAG GGAAAACTTC CAGAACGCTA TGATGGCCGG AGAAGGTCCT GATGCTACCA TCAGTCCCTT TGACCATGTC GGTGTTTTTG CCGTTTTAGG TATTGCAGAG CCAATTACTG ATATTTTACC ACAGGACATT AAAGATGCCC TGGTTGACAA TGCTTTACCT GCCATGAGCC TTGATGGAGA AATGTACGGT GTTCCTATTG ATATGGGTAA CCACTTAATG CTACTTTATA ATAAGAAATT TGTCAAAGAA CCACCCAAAA CCTGGGACGA GTTAATTAAA ATTGCCAAAA AACATACTAA AGACCTGGAC GGTGACGGAG TAAATGACCA GTTTGGTCTG GTTTATAACC TGACTGAACC TTTCTGGTTT ATTCCTTTCA TGAGTGGTCA CGGTGGCTGG GTTATGGACG AAAACCGTCA GCCAACCTTA AATACTGATG GTGTAGTTAA TGCCTTTAAA TTTATCAGGG ATCTTAAATT TGAACATAAA ATTGTCCCTG AAGAATGTGA CTATGATATT GCAGATAGCC TCTTTAAAGA AAATAAAGCC GTATTTCTTA TCAATGGTGA CTGGTCCTTA AATGGTTATA AAGCAGTAGA AGGCCTTGAT TTTGGTACAG CAGCCATACC TAAATTCAAA GATTATGAGT GGCCTAAACC GATGATGAGT GGTAATGGTT TCATCATGGC CGAAGGTTTA TCTGAAGAAA AGCAGGAGGC CCTCTTTGAA TTTATCCGCT TTGTACTGGA GAAAGAAAAC CAGGTTCGTA TGGTAAAAGA ACTGAGTATC CTTCCCAGTA CCAAAGCTGC CAGAGAAGTA GAGTTTGAAG ACCCGATATT AAGGGGTTCT ATTGAACAGC TTAACCATAC AAAACCAATG CCTATTGTTC CTGAAATGAG GGCGATCTGG GATGCCCTGA GAGCACCAAT CCAGAATGTT ATGAATGGTA GTGCTACTCC TGAAGATGCT GCCAAAGAAG CTCAGGATCT GGCTGAAGAA GGCGTAGGTG CAATGCATTA A
|
Protein sequence | MKKVALLVVT LMLIFTLPTM AVTKLTIWEK YDEATQDPYF QSLVDEFNKT HEDIEVEMLH YNTEDLRENF QNAMMAGEGP DATISPFDHV GVFAVLGIAE PITDILPQDI KDALVDNALP AMSLDGEMYG VPIDMGNHLM LLYNKKFVKE PPKTWDELIK IAKKHTKDLD GDGVNDQFGL VYNLTEPFWF IPFMSGHGGW VMDENRQPTL NTDGVVNAFK FIRDLKFEHK IVPEECDYDI ADSLFKENKA VFLINGDWSL NGYKAVEGLD FGTAAIPKFK DYEWPKPMMS GNGFIMAEGL SEEKQEALFE FIRFVLEKEN QVRMVKELSI LPSTKAAREV EFEDPILRGS IEQLNHTKPM PIVPEMRAIW DALRAPIQNV MNGSATPEDA AKEAQDLAEE GVGAMH
|
| |