Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpet_0966 |
Symbol | |
ID | 5170994 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga petrophila RKU-1 |
Kingdom | Bacteria |
Replicon accession | NC_009486 |
Strand | + |
Start bp | 998043 |
End bp | 999221 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640563484 |
Product | extracellular solute-binding protein |
Protein accession | YP_001244560 |
Protein GI | 148270100 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000011003 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGAGAC TGCTCGTTTT AATGCTTGTT GTGGTTTCTG CCCTTGTGTT AGCACAAACA AAGCTCACCA TCTGGTGTTC CGAAAAGCAG GTTGACATCC TCCAGAAACT CGGGGAAGAA TTCAAGGCAA AGTACGGAAT CCCTGTTGAA GTTCAGTACG TTGATTTTGG AAGCATCAAA TCCAAATTCC TGACGGCGGC TCCACAGGGA CAGGGTGCAG ACATCATTGT TGGAGCGCAC GACTGGGTAG GAGAACTCGC CGTCAACGGT TTGATCGAAC CCATTCCCAA CTTCTCTGAT CTGAAGAATT TCTATGACAC GGCTCTCAAA GCTTTCTCTT ACGGTGGAAA ACTCTACGGA GTCCCGTACG CCATGGAAGC GGTTGCTCTC ATCTACAACA AGGACTACGT TGATTCTGTT CCTAAGACCA TGGACGAGCT CATAGAAAAA GCAAAACAGA TAGATGAGGA ATACGGAGGA GAAGTCAGAG GTTTCATCTA CGATGTCGCC AACTTCTACT TCTCTGCGCC GTTCATTCTG GGTTACGGAG GATACGTCTT CAAGGAAACA CCTCAGGGAC TCGACGTGAC AGACATTGGA CTCGCGAACG AAGGAGCAAT CAAAGGTGCG AAACTCATAA AGAGAATGAT CGATGAAGGT GTTCTCACCC CGGGTGACAA CTACGGAACG ATGGATTCCA TGTTCAAAGA AGGTCTCGCG GCTATGATCA TCAACGGACC TTGGGCTATA AAATCTTACA AAGACGCGGG TATAAACTAC GGAGTTGCTC CCATTCCTGA GCTCGAACCG GGTGTTCCTG CCAAACCATT CGTTGGTGTT CAGGGATTCA TGATCAACGC CAAGTCTCCA AACAAAGTGA TCGCCATGGA ATTTCTCACG AACTTCATTG CGAGAAAAGA GACCATGTAC AAGATATACC TCGCAGATCC AAGACTTCCT GCAAGAAAAG ATGTCCTCGA ACTCGTCAAA GACAATCCTG ACGTTGTTGC GTTTACCCAG AGTGCTTCCA TGGGAACACC GATGCCAAAC GTGCCGGAAA TGGCTCCTGT CTGGTCTGCC ATGGGAGACG CTCTCAGCAT CATTATCAAC GGACAGGCCA GTGTCGAAGA TGCTCTCAAA GAGGCTGTGG AAAAAATCAA GGCACAGATA GAAAAATAA
|
Protein sequence | MKRLLVLMLV VVSALVLAQT KLTIWCSEKQ VDILQKLGEE FKAKYGIPVE VQYVDFGSIK SKFLTAAPQG QGADIIVGAH DWVGELAVNG LIEPIPNFSD LKNFYDTALK AFSYGGKLYG VPYAMEAVAL IYNKDYVDSV PKTMDELIEK AKQIDEEYGG EVRGFIYDVA NFYFSAPFIL GYGGYVFKET PQGLDVTDIG LANEGAIKGA KLIKRMIDEG VLTPGDNYGT MDSMFKEGLA AMIINGPWAI KSYKDAGINY GVAPIPELEP GVPAKPFVGV QGFMINAKSP NKVIAMEFLT NFIARKETMY KIYLADPRLP ARKDVLELVK DNPDVVAFTQ SASMGTPMPN VPEMAPVWSA MGDALSIIIN GQASVEDALK EAVEKIKAQI EK
|
| |