Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1174 |
Symbol | |
ID | 4602040 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1115043 |
End bp | 1116551 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639773950 |
Product | extracellular solute-binding protein |
Protein accession | YP_920575 |
Protein GI | 119720080 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAGA AAAAAGGCCT GCAGAAGACC ACAGCGATAC TGCTCGTAGT AGTCCTGCTG GTAGGGCTAC TAGCCGGCTA CTTCATAGGA GTTTCCACTG CGCCGAAAGC CCCCGCGGAG GAGGTAGTGC CCAAATCCCA GTACGAGCAG CTACAGAAGG AGCTCGAGTC CGTAAAGGCT CAGCTACAGC AGATGGCCGC GCAGCAGGGC AAGCCTGTCG AGATAGTAAT CACCGCGTGG ACTCAGGGGC CCGAAAGGGA GTCGATATAC AGGCAGCTGA ACCTCGTAGA AGCGGCTAAC AGGCTGAACC AGATATTCAA GGTGGTGGGC GTCCCGGCGA CTGTCAAGGT TGAGGGAGAC TTCTCCACGG CGTCTTGGAC GGATTACAGG AAGAAGGTAT TCCTGGCGCT TGAAGGCGGG ACGGGGCCCT GCATATTCCA GATGGAGCAC GTGTGGTCTG CGGTTCTCGC TGAGAACGGG TGGATAATCC CGCTCGACGA CTACGTGAAG AAGTACTGGA ACTGGACGTA CTATGACATC ATCCCCGGCC TATGGTCCTC CGTGACCTAC AAGGGGAAGA TATGGGGCAT TCCGCAGGAC ACGGAGGCCA GGCCGATCTA CTTCAACAAG CTCCTCCTCA AGAAGCTCGG GTGGACCGAC GAGCAGATAA ACGCACTCCC TGAGAAGATC AGAAGGGGCG AGTTCACGCT CCAGGACATG TTGATGGTAG CGAAGGAGGC TGTCGACAAG GGAGTCGTGG CGCCTGGCTA CGGCATCTGG CACCGCCCGA CTGCTGGCCC TGACTGGCCC ATAGTATACC TGGCATTCGG AGGCAAGCTT TACGACGAGA CCAGCGGGAA GCTAGTAGCG GACATGAAGG TTTGGAAGAA GGTCTTCGAC TGGTTCTACG CGGCCTCGAT GCAGAAGTAT AAGGTGATAA CGGATAAGAT GACGTCGCTC GACTGGAACA GGGACGTTCA CCCAACAATA GTAGCCGGTA AAGTATTGTT CTGGATGGGA GGAACGTGGC ACAAGGGGCA GTGGGTCGGC TCCTTCAACC TCTCCGAGAG CAAGTTCTGG GAAATGTTCG GCTTCGCCCT CTACCCCGCA GGCGAGCCGG GACTTAAGCC TGTCACCCTC TCGCAACCAC AGGCTTACTT CATCTCGAAG ACATGCAAGT ACCCGGAGAT CGCGTTCCTC ATAATAACCC TGGCTACCGA CCCCTACCTC AACTCGTTGC ACGCCGTTAA AAGCGCACAC CTAGCGATAA TGTACCGGCA GCTGTCCGAC CCTGTATACA CGAAGGACAA GTTCCTCGCT ATGACGGGCT ACATGGTAGA GTACGCGCAG TACCAGCCGA TGCACCCGAG ATGGGGAGAC TACAACACGA TAATATTCAA TACGATAAAG GGTATCGAGA CAGGGCAGTT CGACGCCGAC CAGGCTCTGC AGGTCTTCAA GCAGAACCTC CAGTCCACGC TTGGCGATAA CGTAATAATA AAAGAATAA
|
Protein sequence | MSQKKGLQKT TAILLVVVLL VGLLAGYFIG VSTAPKAPAE EVVPKSQYEQ LQKELESVKA QLQQMAAQQG KPVEIVITAW TQGPERESIY RQLNLVEAAN RLNQIFKVVG VPATVKVEGD FSTASWTDYR KKVFLALEGG TGPCIFQMEH VWSAVLAENG WIIPLDDYVK KYWNWTYYDI IPGLWSSVTY KGKIWGIPQD TEARPIYFNK LLLKKLGWTD EQINALPEKI RRGEFTLQDM LMVAKEAVDK GVVAPGYGIW HRPTAGPDWP IVYLAFGGKL YDETSGKLVA DMKVWKKVFD WFYAASMQKY KVITDKMTSL DWNRDVHPTI VAGKVLFWMG GTWHKGQWVG SFNLSESKFW EMFGFALYPA GEPGLKPVTL SQPQAYFISK TCKYPEIAFL IITLATDPYL NSLHAVKSAH LAIMYRQLSD PVYTKDKFLA MTGYMVEYAQ YQPMHPRWGD YNTIIFNTIK GIETGQFDAD QALQVFKQNL QSTLGDNVII KE
|
| |