Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpet_1677 |
Symbol | |
ID | 5171299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga petrophila RKU-1 |
Kingdom | Bacteria |
Replicon accession | NC_009486 |
Strand | + |
Start bp | 1672949 |
End bp | 1674916 |
Gene Length | 1968 bp |
Protein Length | 655 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640564203 |
Product | extracellular solute-binding protein |
Protein accession | YP_001245258 |
Protein GI | 148270798 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00224475 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCAGGA GAGTTCTGTG GGGCCTTCTT GTAGTAATCT TCGCAGCCCA GATCCTTGCT ATTGGACTCA ACAAAATCGT TCCCGGTGAG TATTACAATC TCACCGACTA CGAACGTCTG ACAGGCAAGA AGATCACGAA ATTCAACGAA GCACCGATGT TGAAAGAGAT GGTTGAGAAA GGACTGCTTC CACCTGTGGA GGAAAGGCTT CCGAAGAATC CGGTTGTGGT GACACCTTAT GAGGAGATAG GTCAATACGG TGGTACCTGG AGAAGAGTAT GGTTCGGCCT TCCGGATCAG CCCAATGTCG ATAAGATCGC TGTCGAGAAA CTCGTGATGT TCGATAAGAC CGGTGGGGTA ATACTTCCGA ATATCCTCGA AGAGTGGCAG GTTAGCAGTG ATGGTAAAAC GTTTGTCTTC AAGATAAGAG AAGGGCTGAA GTGGTCTGAT GGTGTGCCTG TCACCACAGA GGACGTGAGA TTTTGGTATG AAGACATTTT ACTGGATGAA AATCTGACCC CTACGATTCC TTCCTGGCTG ATCGCTGGAG GTAAACCCTT AAAAGTAGAA ATCGTCGATA AGTGTACATT CAAAGTAAAT TTCGAAGTCC CCTATCCTCT GTTTCTCTAT CAGCTAGCAT ACCGGGGACA GGGCGGTTAC GTTTTCGTTG TCCCATCGCA CTATCTGAAA AACTTCCATC CAAAGTATGT CCCGCTTGAA AAACTGACAC AAATGGCGAA GGAAGAAGGA TACGATTACT GGTGGCAACT TTTCGCTGCG AAAGGTACCA ATACCAATGC GTGGATTACG AATCCTGAGC TTCCCGTACT CTATCCATGG AAATTGAAGA AATTGACTGA TTCACAACTC GTCATCGAAA GGAACCCATA CTATTTCAAG GTGGATCCTG AAGGGAATCA GCTTCCATAC ATAGATGAAA TAGTGTTCTA CAGGATTCAA GACAAACAGA TGGCGCTCAT GAAAGCTATG GCTGGAGAAA TAGATATGCA AACCAGGCAC TTTGGAACGG AACAGTTCAC TATATTACTT GAGAACAGGG AAAAAGGTGG CTATAGAGTT TTGAGATGGG TTTGGGGTGT TGGCAGCATA GTAACGTTCT ATGTGAATCA AAATGTGAAA GATCCCGTTC TTAGAGAACT CTTCCAGAAT CCAAAGTTCA GATACGCCCT TTCACTGGCG ATAAACCGAG AAGAAATAGC TACCCTGGTC TTCCACAACC TTGGTGAGCC ACGTCAAGCA TCACTGATCA CAGGTGTTGC TTTCTACGAT CCTGAATGGG AGAAAGCATA TGCGGAATAT AACCCTGAGA AGGCGAACGC TCTCTTGGAT GAAATAGGCC TGACAAAGCG AGATGCCGAG GGTTATAGAA TAAGATCGGA TGGCAAAAGG TTGGAAATAA TAATAGAGTA CTCCGTAACA GACGCTGTTG TTGACGTACT GGAGATGGTA AAACAGTACT GGGAAAATCT GGGTATCAAG GTGCTCCTGA AACCTGAGGA ACGATCGCTC TACATGACAA GGTGTGAAGC AGGAGAGCCT GAAATAGGTG CGTGGTCATT CGACAGATGT GCAGCCGTAT TGAGCGATCC TGGAAGGTTA CTGGGAACAG TGTGGGATGG CCCATGGGCA CCTCTTTATG CAAGGTGGTA CATTTCCGGT GGAAAAGCTG GCGAGGAACC ACCAGAAGGC TCAGACATTA GAAGAATCTA CGAGCTTTGG GACAAAGTAA AAGTAACCGT CGATGAAGAA GAAAGAGACA GACTTTTCAA GGAGCTCATC AACATTCATA AGAAAAATAT CTTCTTCATA GGAACGGTGG GAGAAGTCCA GATACCTGTC ATCGTGAAGG ACAATTTCAG AAATGTCCCT GATGGATTAA TCTTTGATCA TCCTCTCTTC AGTCCAAAGA ATGCCCGACC GGAACAATTC TTCTTTGAAC TGAAATAA
|
Protein sequence | MFRRVLWGLL VVIFAAQILA IGLNKIVPGE YYNLTDYERL TGKKITKFNE APMLKEMVEK GLLPPVEERL PKNPVVVTPY EEIGQYGGTW RRVWFGLPDQ PNVDKIAVEK LVMFDKTGGV ILPNILEEWQ VSSDGKTFVF KIREGLKWSD GVPVTTEDVR FWYEDILLDE NLTPTIPSWL IAGGKPLKVE IVDKCTFKVN FEVPYPLFLY QLAYRGQGGY VFVVPSHYLK NFHPKYVPLE KLTQMAKEEG YDYWWQLFAA KGTNTNAWIT NPELPVLYPW KLKKLTDSQL VIERNPYYFK VDPEGNQLPY IDEIVFYRIQ DKQMALMKAM AGEIDMQTRH FGTEQFTILL ENREKGGYRV LRWVWGVGSI VTFYVNQNVK DPVLRELFQN PKFRYALSLA INREEIATLV FHNLGEPRQA SLITGVAFYD PEWEKAYAEY NPEKANALLD EIGLTKRDAE GYRIRSDGKR LEIIIEYSVT DAVVDVLEMV KQYWENLGIK VLLKPEERSL YMTRCEAGEP EIGAWSFDRC AAVLSDPGRL LGTVWDGPWA PLYARWYISG GKAGEEPPEG SDIRRIYELW DKVKVTVDEE ERDRLFKELI NIHKKNIFFI GTVGEVQIPV IVKDNFRNVP DGLIFDHPLF SPKNARPEQF FFELK
|
| |