Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpet_0954 |
Symbol | |
ID | 5171032 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga petrophila RKU-1 |
Kingdom | Bacteria |
Replicon accession | NC_009486 |
Strand | + |
Start bp | 982252 |
End bp | 983487 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640563472 |
Product | extracellular solute-binding protein |
Protein accession | YP_001244548 |
Protein GI | 148270088 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAGT ACTTTGTTCT GTTGCTAGCA GTTCTTCTGG TTGGTGGACT CTTCGCTGTG AAAATCACTA TGACATCTGG AGGGGTCGGA AAGGAACTCG AGGTACTGAA AAAGCAGCTG GAGATGTTCC ACCAGCAGTA CCCAGATATC GAAGTGGAAA TCATTCCGAT GCCGGACAGT TCAACTGAAA GGCACGATCT CTACGTCACG TACTTTGCCG CCGGAGAGAC GGATCCAGAC GTTCTCATGC TCGATGTGAT ATGGCCTGCT GAGTTTGCTC CGTTCCTTGA AGATCTGACA GCAGACAAAG ACTACTTCGA ACTCGGTGAA TTCCTACCCG GAACTGTGAT GTCTGTCACG GTCAATGGAA GAATCGTTGC TGTTCCCTGG TTCACAGATG CAGGTCTCCT TTACTACAGA AAAGACCTCC TCGAGAAATA CGGTTACGAT CACGCTCCGA GAACCTGGGA TGAACTCGTC GAAATGGCAA AGAAGATCTC TCAGGCTGAA GGCATCCACG GATTCGTCTG GCAGGGTGCA AGATACGAAG GCCTTGTCTG TGATTTCCTT GAATACCTCT GGTCTTTCGG TGGGGATGTG CTCGATGAGA GTGGAAAAGT TGTGATCGAT TCTCCAGAAG CTGTTGCGGC TCTTCAGTTC ATGGTCGATC TCATCTACAA GCACAAAGTC ACTCCTGAAG GAGTTACCAC CTACATGGAA GAAGACGCAA GAAGAATCTT CCAGAACGGA GAAGCTGTTT TTATGAGGAA CTGGCCGTAC GCCTGGTCCC TCGTGAACAG CGACGAATCC CCAATCAAAG GAAAGGTTGG AGTTGCTCCT CTTCCAATGG GTCCTGGTGG AAGAAGAGCT GCCACACTCG GTGGGTGGGT CCTCGGTATA AACAAATTCT CGTCACCTGA AGAAAAGGAA GCCGCAAAGA AGCTCATAAA GTTCCTCACA AGTTACGACC AGCAGCTCTA CAAAGCGATC AACGCCGGAC AGAATCCAAC GAGAAAAGCC GTTTACAAAG ATCCAAAACT CAAAGAAGCT GCTCCGTTCA TGGTTGAACT TCTCGGAGTT TTCATCAACG CTCTTCCAAG ACCAAGGGTT GCGAACTACA CAGAAGTTTC CGATGTCATT CAGAGGTACG TGCACGCTGC TCTGACAAGA CAGACAACAC CAGAAGACGC AATAAAGAAC ATTGCAAAAG AGCTCAAATT CCTGCTTGGA CAGTAA
|
Protein sequence | MKKYFVLLLA VLLVGGLFAV KITMTSGGVG KELEVLKKQL EMFHQQYPDI EVEIIPMPDS STERHDLYVT YFAAGETDPD VLMLDVIWPA EFAPFLEDLT ADKDYFELGE FLPGTVMSVT VNGRIVAVPW FTDAGLLYYR KDLLEKYGYD HAPRTWDELV EMAKKISQAE GIHGFVWQGA RYEGLVCDFL EYLWSFGGDV LDESGKVVID SPEAVAALQF MVDLIYKHKV TPEGVTTYME EDARRIFQNG EAVFMRNWPY AWSLVNSDES PIKGKVGVAP LPMGPGGRRA ATLGGWVLGI NKFSSPEEKE AAKKLIKFLT SYDQQLYKAI NAGQNPTRKA VYKDPKLKEA APFMVELLGV FINALPRPRV ANYTEVSDVI QRYVHAALTR QTTPEDAIKN IAKELKFLLG Q
|
| |