Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpet_1552 |
Symbol | |
ID | 5171285 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga petrophila RKU-1 |
Kingdom | Bacteria |
Replicon accession | NC_009486 |
Strand | + |
Start bp | 1543354 |
End bp | 1544529 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640564078 |
Product | extracellular solute-binding protein |
Protein accession | YP_001245135 |
Protein GI | 148270675 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000398035 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT TTCTGGTGAT CGCTTTGCTT GTCGTTTCCC TCGTTGTCCT CGCTCAGCCG AAACTCACCA TCTGGTGCTC TGAGAAGCAG GTCGATATCC TTCAAAAACT CGGAGAGGAG TTCAAGGCAA AGTACGGCGT AGAGGTTGAA GTGCAGTACG TGAACTTCCA AGACATCAAG TCCAAGTTCC TCATAGCAGC TCCTGAAGGA CAGGGTGCGG ATATCATCGT TGGAGCACAC GACTGGGTAG GCGAACTCGC AGTCAACGGT TTGATCGAAC CCATTCCGAA CTTCAGTGAT CTGAAGAACT TCTATGAAAC TGCCCTCAAC GCGTTCTCTT ACGGTGGAAA ACTCTACGGT ATTCCCTACG CCATGGAAGC AATAGCACTC ATCTACAACA AGGACTACGT TCCTGAACCC CCAAAGACCA TGGACGAACT CATAGAGACA GCAAAACAGA TCGATGAAGA ATTTGGAGGA GAAGTGAGAG GTTTCATCAC CTCAGCGGCC GAGTTTTACT ACATTGCTCC TTTCATTTTC GGATACGGTG GATACGTATT CAAACAGACA GAAAAAGGAC TGGACGTCAA CGATATCGGA CTGGCCAACG AAGGAGCCAT CAAGGGTGTG AAACTCCTCA AAAGATTGGT TGATGAGGGA ATACTGGATC CCAGTGACAA TTATCAGATC ATGGATTCCA TGTTCAGGGA AGGCCAGGCG GCGATGATCA TCAACGGACC GTGGGCCATT AAGGCGTACA AGGATGCAGG AATAGACTAT GGTGTAGCCC CAATCCCCGA TCTGGAACCT GGCGTTCCTG CAAGACCTTT CGTTGGGGTC CAGGGCTTCA TGGTGAACGC AAAATCCCCA AACAAACTCC TTGCCATCGA ATTCCTGACC AGTTTCATTG CAAAAAAGGA AACGATGTAC AGAATCTACC TTGGAGATCC AAGACTTCCC TCCAGAAAGG ACGTGCTCGA ACTTGTGAAA GATAACCCAG ACGTAGTTGG CTTCACACTG AGCGCAGCCA ACGGTATTCC AATGCCCAAC GTTCCACAGA TGGCCGCTGT CTGGGCCGCT ATGAACGATG CGCTCAATCT CGTTGTGAAC GGAAAAGCAA CGGTCGAAGA AGCGCTCAAA AACGCCGTTG AAAGAATCAA AGCTCAGATT CAGTAA
|
Protein sequence | MKKFLVIALL VVSLVVLAQP KLTIWCSEKQ VDILQKLGEE FKAKYGVEVE VQYVNFQDIK SKFLIAAPEG QGADIIVGAH DWVGELAVNG LIEPIPNFSD LKNFYETALN AFSYGGKLYG IPYAMEAIAL IYNKDYVPEP PKTMDELIET AKQIDEEFGG EVRGFITSAA EFYYIAPFIF GYGGYVFKQT EKGLDVNDIG LANEGAIKGV KLLKRLVDEG ILDPSDNYQI MDSMFREGQA AMIINGPWAI KAYKDAGIDY GVAPIPDLEP GVPARPFVGV QGFMVNAKSP NKLLAIEFLT SFIAKKETMY RIYLGDPRLP SRKDVLELVK DNPDVVGFTL SAANGIPMPN VPQMAAVWAA MNDALNLVVN GKATVEEALK NAVERIKAQI Q
|
| |