Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_14550 |
Symbol | |
ID | 7313995 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | - |
Start bp | 1546129 |
End bp | 1547427 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 643611895 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002509199 |
Protein GI | 220932291 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000000000299418 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAAAAAG GTTTAACCTT AGTTTTAACC CTGGTATTGG TTTCTGTAAT GGCTTTTGCC TCCACAGCTT TAGCTGCTGA CAAGACTTTA ACAATTATGG GAGTCTGGGG AGGCCATGAA AGGGATGCCT TTGAAAAGGT TATTGAAACT TTTGAAACGG CAACCGGAAT TGATGTTCAG TTTGAAGGAA CAAGGGACCT GCCCACTCTG TTAACAACAC GTCTGGAAGC AGGAAACCCG CCTGATATTG TTGCCCTTCC CAATCCCGGT AATATGAAAG AACTTGCTGC TGAAGGTCAT CTGGTTGACC TGAGGAAAGT TCTTGATATG GATACCTTAA GGGAAGATTA CGGACAGACA TGGATTGACT TAGGTTCCTA CAATGATGGT CTATATGGAA TTTTTATTTC TGCAGATGTT AAGAGTTTAG TCTGGTATAA TCCGAAACAG TTTGAAGCTA AAGGTTATGA TATTCCCAAG ACCTGGGATG AGATGGAAAG ATTAATGAAC AACATGGTTG CTAAAGGTGA TATCCCATGG TCTATCGGTC TGGAATCCGG TGCTGCCAGT GGCTGGCCTG GAACTGACTG GATCGAAGAC ATTATGTTAA GAACAGCCGG TCCTGAAGTT TATGACCAGT GGGTAAATCA CGATATTCCC TGGACTGACG AAAGGGTTAA AAAAGCCTTT GAAATTTTTG GTAAAATTGC CCGTAATCCT AAATTCACCT GGGGAGGACC TACTGCTGTA TTAGCTACTA ACTTTGGTGA TGCTGCTAAC CCACTGTTCA CCAATCCTCC ACAGGCATAT ATGCACCGTC AGGCTAGCTT TATCACTGGA TTTATTACAG ATAATAATCC AGACCTTGTT GCCGGTAAAG ACTACAACTG CTTCATTCTT CCCCCAATCA ACGAAGAAGT AGGGACTCCG GTTCTTGGTG CTGCTGATAT GATGGGTATG ATTAATGATA CTCCTGAAGC CAGAGCTTTC ATGAGATATC TCGCCTCTCC TGGAGCCCAG ATGGTCTGGA TTGGGGCTGT TGGTAGTAAA ATCGGTATCA ACAAACGGAT TGACCTCAAT GTATACTCCA GTGAGTTAAT GAAGAATATC GCTAAAGGAT TAAGGGAAGC AGATGTATTC AGGTTTGATG GTTCTGACCT GATGCCCAAG GCTGTTGGTT CTGGTGCCTT CTGGCAGGGT GTAATGGATT ATGTTGGAGG TCAGGATCTT GACAGTGTTC TGGAACATAT CGAATCTGTT GCTGATGATG CCTACGATTC CGGAAAAACT ACAGACTAA
|
Protein sequence | MKKGLTLVLT LVLVSVMAFA STALAADKTL TIMGVWGGHE RDAFEKVIET FETATGIDVQ FEGTRDLPTL LTTRLEAGNP PDIVALPNPG NMKELAAEGH LVDLRKVLDM DTLREDYGQT WIDLGSYNDG LYGIFISADV KSLVWYNPKQ FEAKGYDIPK TWDEMERLMN NMVAKGDIPW SIGLESGAAS GWPGTDWIED IMLRTAGPEV YDQWVNHDIP WTDERVKKAF EIFGKIARNP KFTWGGPTAV LATNFGDAAN PLFTNPPQAY MHRQASFITG FITDNNPDLV AGKDYNCFIL PPINEEVGTP VLGAADMMGM INDTPEARAF MRYLASPGAQ MVWIGAVGSK IGINKRIDLN VYSSELMKNI AKGLREADVF RFDGSDLMPK AVGSGAFWQG VMDYVGGQDL DSVLEHIESV ADDAYDSGKT TD
|
| |