Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1588 |
Symbol | |
ID | 4600552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1538899 |
End bp | 1540257 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639774361 |
Product | extracellular solute-binding protein |
Protein accession | YP_920986 |
Protein GI | 119720491 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAGC AACAAAACAG GTTGGTTTTA GCCTCTATCG CCGTAGGGAT AATAGCTCTC CTAGTGAGCC TCTACGCCGT TGTAACCGTT CAGTCGCTGG CATCCGCGAT TAGCGATCTC AAAGCCTCGA TAAGCAACCT CCAGGGCACG GTTTCGAGCC TCCAGCAACA AGTCGGGAAG ATATCCCAGC AACCCGCGCC CACTACACCC CCCGTACAGA AAGTAAAGCT GGTGGTGATC GGCCCGTGGG CGGGTGACGA GGCGAAGTAC TTCCAGGCCG TGATCGACGC CTACGTAAAG ACGCACCCCA ACGTCGAGAT TGAGTACAGG ACTATGCGCG CCGAGGACGT AGCGGCGACA ATGCCCATAC AGTTCGCCGC CGGCGTAGCT CCTGGAGACG TAATCTTCGG GTGGGCGTGG TTCATCGCCA AGATGGGCAA GGAGGGGCAT CTAGTAGACC TCACAGGCAT AATCAAGGAA AACGAGTACG TACCGGGAAT CGTCGACGCC GTGAAAGCCG ACGGCAAGAT CTGGGGTGCA CCGTTCACGA TGTGGCTAAA GCCTGGCTTC TGGTACAGGA AGTCCTTCTT CCAGAAGTAC GGGCTAACAG AGCCGAAGTC CTACGCCGAG TTCGTGCAAC TACTGGAGAA GATCAAGGGG ATACCCGGCG TGAAGAACCC CATAGCCACG GGAGACGGGG TCGGGTGGCC GATAAGCGAC ATTGTCGAAC ACTTCATAAT CGCGTACGGC GGTCCACAGA TGCAACTAAA CCTCATATCC GGGAAGACTA GGTTCACGGA TCCCAGCGTG AGGAAGGTGT TCTCCGACTA CCTCATACCG CTACTCCAGA AAGGCTACTT CAGCGAGCCG ATAGAGTGGA CCACTGTCAT ACCCAAGTGG TGGGCAGGCG AGTACGGGCT CTACTTCATG GGGACCTGGA TCACGGGGAT GGTCGAAGAC CCGAACGACC TAGACTTCTT CCCCTTACCC GAGAGTAAGG GCGTCGTTGG AGGCGCGGAC TACGCCTTCG TGCCGAAGTA CTCGAGGAAC GTTGACGCCG CGCTCGACTT CATCAAGTAC CTCGCCACGG AGGGGCAGGT CGTACACGCG AGCGTACCTT CCGGTAAGAT CCCGACGTGG ACGAAGGCAC CCGTCGAGAA GCTATGGAAG CCCATGCAGA GTGTATACAC CAAGATCACC GGCAAGGGGC TAGCCATACT GCCTGACCTC GACGACTCTA TCGGGGGAGA CTGGCAGAAG CTCTTCTGGG ATCAGCTGAA GCTGTTGTGG GTCAACCCAG GAGCCCTAGA CTCCGTCCTG AAGACGCTGG AGAGCAGTCA GCCGAAGCCG TCGGGTTAA
|
Protein sequence | MSQQQNRLVL ASIAVGIIAL LVSLYAVVTV QSLASAISDL KASISNLQGT VSSLQQQVGK ISQQPAPTTP PVQKVKLVVI GPWAGDEAKY FQAVIDAYVK THPNVEIEYR TMRAEDVAAT MPIQFAAGVA PGDVIFGWAW FIAKMGKEGH LVDLTGIIKE NEYVPGIVDA VKADGKIWGA PFTMWLKPGF WYRKSFFQKY GLTEPKSYAE FVQLLEKIKG IPGVKNPIAT GDGVGWPISD IVEHFIIAYG GPQMQLNLIS GKTRFTDPSV RKVFSDYLIP LLQKGYFSEP IEWTTVIPKW WAGEYGLYFM GTWITGMVED PNDLDFFPLP ESKGVVGGAD YAFVPKYSRN VDAALDFIKY LATEGQVVHA SVPSGKIPTW TKAPVEKLWK PMQSVYTKIT GKGLAILPDL DDSIGGDWQK LFWDQLKLLW VNPGALDSVL KTLESSQPKP SG
|
| |