Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpet_0485 |
Symbol | |
ID | 5171252 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga petrophila RKU-1 |
Kingdom | Bacteria |
Replicon accession | NC_009486 |
Strand | + |
Start bp | 478691 |
End bp | 480550 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640562994 |
Product | extracellular solute-binding protein |
Protein accession | YP_001244085 |
Protein GI | 148269625 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGGA AACTTGTTTG GTTATCGTTT TTATTACTGA TATTCACGCT TGCTTTTTCT CAAGTAGCCA ATGTGCCAAG GTCGGACTTA CTAATTGTTC AACACTCTCA CGGTAGAGTC TCTGATCCTG CTAATTGCAA TATTTTCACG TCATCCTGGA GATATCCGGC CAGAGGGCTT CATCAGTTGA TCGTTAGACC TCTTTGGATG GTTGATCCTG CGAAAATGGA AATTATCAAT GTTCTTGCAG AAACTAGCCC TATTTACAAC GATGATTTCA CGGAAATGAC TGTTAAACTT CGCAAAGGGA TATATTGGAG TGATGGAGTG GAATTCACTG CCGATGATGT TGTCTTCGGA GTGAAATTAA CGATTCAAAA TGAAGGTATG TCTAACCACG TTCAACTCAA AGAATGGGTC AAAGATGTCG AGGCAAAAGA TAAATACACT GTCGTTTTTA AGCTAAACAA ATCCAATCCT AGATTCCATT ATTATTTTGT CGACAGATGG GGTTGCTGGA GACCATTCCC CAAACATATA TTTGAGAAAG TAGAAGATCC CGTTAAATTC AATTTCTATC CACCTGTTGG AACGGGTCCT TATGTTTTAA AATCCTATGA TCCAAATGGA TATTGGTTTT TGTATGAGAG AAGAGAAGAT TGGGAAAGAA CACCAGATGG TATACTTTAT GGAATGCCAC AACCTCGCTA TGTGCTCTTC CAGGCTTATG ATTCTCCTAG CCAAATGATT CTTGCCATGA GACAACACCA ACTGGACGTC ACATATACAT TTTCTTTGGA AATGGTTAAA TCTTTTTTGA AAATACCTAC TGTTAGAGTC TTCAGAAAAG ATTTCCCTTG GGGTGAAACA CTAGAACCAA CAGTTACAGG CATTACTCTG AATACAATGC GCTATCCTTA CAACATTCGT GATGTAAGAT GGGCATTGGT ACTTGCTATA AACATACTGG AAGTTGCAGA TCTAGTTGCT GATGGGGCTG TTAGATTTGC CCCATTACAT GTGCCACCAG AACCTGTCTG TGAAAAGTAT TACTATACAG CCTTGAAAGA TTTCTTAGAG AACTTCACAT TGCAAATTGA CGATCAGACG GTATTCAAAC CTTTTGATAC CACACTACCC GATAAGCTGG CAGAAATGGC AAGGAAGAAG GGTTATAAAG TCTCCAAAGG TCAACTAGAA GAGTTGTTTG GCATCGGTTG GTGGAGATAT GCTCCTGACG TAGCAGAAAA ATTACTGAAA AAACATGGTT TCAAGAGAAA TGAACAGGGA CAGTGGTTGT TACCCAACGG AACCCCATGG AAGATGGAAA TAATTGTTAA CCCAGACGCA AACAGGCCAG ACAATAGGGT ACCTGCAGCA ATTGCGCAAC AGTGGAAAAA ATTTGGAATC CAAATTGAAA TAAGACCAAC TTCTGATTCT ACTGTGCACG CTTATGGAGA ATTTGATGCG TGCAGTGCCT GGCCAGCAGT AGAAACATGG GGAGGAGTAG CAGATATTTA TAGAACATTA TCTCCTTTCG CGAGCAGATA TCAGCGACCA ATAGGCGAAT TCAACGCTGG TCATGCTTCC AGATGGTCTG ACCCTAGAAT GGATGAAATA TTAGAAAAAA TGAAAAAAAC ATCTCCATTT GATCCAGAAA CAATAGAGTT AGGTAAAGAA GGATTAAAAT TGTTAATTGA AGAAATGCCA AGCATTCCCG CTTTCCAACT GACATGGTTT GTGATTTACG ATGAATACTA CTGGACAAAC TGGTCAACCG TGGAGAATAT CTACGTACAT CCTGTCCATA CTTGGCCTAA TTTCGGTTTT GAATTGCCGT ATCTGAAAAG AACAAAATAA
|
Protein sequence | MKRKLVWLSF LLLIFTLAFS QVANVPRSDL LIVQHSHGRV SDPANCNIFT SSWRYPARGL HQLIVRPLWM VDPAKMEIIN VLAETSPIYN DDFTEMTVKL RKGIYWSDGV EFTADDVVFG VKLTIQNEGM SNHVQLKEWV KDVEAKDKYT VVFKLNKSNP RFHYYFVDRW GCWRPFPKHI FEKVEDPVKF NFYPPVGTGP YVLKSYDPNG YWFLYERRED WERTPDGILY GMPQPRYVLF QAYDSPSQMI LAMRQHQLDV TYTFSLEMVK SFLKIPTVRV FRKDFPWGET LEPTVTGITL NTMRYPYNIR DVRWALVLAI NILEVADLVA DGAVRFAPLH VPPEPVCEKY YYTALKDFLE NFTLQIDDQT VFKPFDTTLP DKLAEMARKK GYKVSKGQLE ELFGIGWWRY APDVAEKLLK KHGFKRNEQG QWLLPNGTPW KMEIIVNPDA NRPDNRVPAA IAQQWKKFGI QIEIRPTSDS TVHAYGEFDA CSAWPAVETW GGVADIYRTL SPFASRYQRP IGEFNAGHAS RWSDPRMDEI LEKMKKTSPF DPETIELGKE GLKLLIEEMP SIPAFQLTWF VIYDEYYWTN WSTVENIYVH PVHTWPNFGF ELPYLKRTK
|
| |