Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpet_1687 |
Symbol | |
ID | 5170690 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga petrophila RKU-1 |
Kingdom | Bacteria |
Replicon accession | NC_009486 |
Strand | + |
Start bp | 1686824 |
End bp | 1688818 |
Gene Length | 1995 bp |
Protein Length | 664 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640564213 |
Product | extracellular solute-binding protein |
Protein accession | YP_001245268 |
Protein GI | 148270808 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.789812 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGATTAC CTCAAAATTT CCGTGGAGGT GGTAGGATGA AGCGTTTTCT GGTGTTTCTT GTGGTTTTGC TCACTTTAAC AGCGGTTTTT GCAACAGAGT TGCCTCCTCT CACACAGTAC AACCTTTCAG ACTTTGAAAA ACTCACCGGC AAGAAGATCA CTCAGTTCAA CGAAGCTCCG ATTCTGAACG AACAGGTAAA GCAGGGCAAG TTACCCCCTG TAGAAGAACG ACTGCCGGAA GATCCCGTTG TTCTCATTCC GTGGGAAAGC ACAGGAAAGT ACGGAGGAAC ATGGAACAGA GCCTGGACAG GACCTGCCGA CAGACCACAG GCCGACAGGT TCATGCTCGA ATCTGCAATG GTGTTTGATC CCCAGGGTAA GGAGCTCTAT CCGAACATCC TTGAGAAGAT CGAAATGTCT TCCGATGGAA AAGAATTCAT CTGCACTCTC CGAAAAGGTT TGAAATGGTC TGATGGAGTT CCTGTTACCA CAGAAGATGT GAGATTCTGG TACGAAGATG TCCTTCTCAA CGAGGAACTC GTTCCTACAT TCCCAAGAGA TCTCATGGCT GGCGGCAAAC CTATGAAACT CGAAATCATA GACGAATACA CCTACAAGAT CACTTTTGAA GAACCTTATC CTCTCTTCCT GTACCTGTAC GCAACGAGAA AGGGAAGCTG GGGAATCCGT GGAATCATGC TTCCAGCGCA TTACCTGAAA CAATTCCATC CAAAGTACGT GCCGCTGGAA AAAATCCAGA AGATGGCAGA AGAAAATGGG TACGACAACT GGACGAACTA CTTCTGGTCT CTCGGTGATC ACAACGCTCA CATCTCCAAT CCCGATCTTC CCGTGCTAGC TGCTTGGAAG TTGAAAGAAA TCACGGATGC AAAACTCGTA ATAGAAAGGA ACCCGTACTA CTGGAAGATA GATCCCGAAG GGAATCAGCT TCCTTATATC GATGAGATCG TATTCTGGAC CGTCCAGGAT AGACAGATGA TACTTCTGAA AGTCATGGCA GGAGAAATCG ATATGCAAGC AAGACACCTG AGTCTGGAAG ACTACACACT CCTTGCCGCT AACGCACAGA AGGGTGGTTA CAAAATCATC AAGTGGAAAC TTGCACGCGG AAGTGATGTT ACACTCTGGT TGAATCAAAA CGTCAAAGAT CCTGTTCTCA GAGAACTCTT CCAGAACATC AAATTCAGAC AGGCACTCTC ACTTGCCATC AATCGTGAAG AGATAAACTC CCTTGTCTAT TACGGTCTCT GTGAACCGAG ACAGGCATCG TTTGTGAGCG GTGTTAAATT CTACGATCCT GAATGGGAAA CAAGATTCGC CGAATACGAC CCTGAGACTG CAAATAAACT TCTGGATGAA ATAGGCCTTA CAAAACGCAA CGCAGAAGGT TACAGATTGA GACCGGACGG TCAACCACTG ATCCTGACGA TCGAATATCC CACGGGTATC TTCGGTGCAT GGGACAAAAC ACTCGAGATG ATAGCTCAAT ACTTCCAAAA AATCGGGATA AAGGTCAATC TGAAACCAGA GGAGAGATCA CTGTACATAA CAAGATGTAA CGGTGGTGAG CCTGAAATAG GCGTCTGGTT CTTTGACAGA AACAAGTATC CAATGCTCGA TCCCGGAAGG CTTCTTGGAA CGGTAACCGA TGGGCCATGG GCACCACTCT ACGGTCAGTG GTACACTTCG GGTGGAAAGG GTGGTGAAGA ACCACCCGAA GGATCCGACA TCAGAAGAAT ATACGAACTC TGGGAAAAGG TCAAAATGAC AGTCGATGAG AAAGAAAGAG ACAAACTCTT CAGAGAAGTC ATAAATGTTC ACAAGAAAAA CATTTTCTTC ATAGGAACAG TTGGAGAACC AATCTGGCCG GTTGTTGTGA AGACTTATTT CAAGAATGTA CCTGATTCAC CAGATTTTGT GTGGGAAAAC GAGGGTGATG GACAACACGC TGAACAGTAC TACATGGACA AATAG
|
Protein sequence | MRLPQNFRGG GRMKRFLVFL VVLLTLTAVF ATELPPLTQY NLSDFEKLTG KKITQFNEAP ILNEQVKQGK LPPVEERLPE DPVVLIPWES TGKYGGTWNR AWTGPADRPQ ADRFMLESAM VFDPQGKELY PNILEKIEMS SDGKEFICTL RKGLKWSDGV PVTTEDVRFW YEDVLLNEEL VPTFPRDLMA GGKPMKLEII DEYTYKITFE EPYPLFLYLY ATRKGSWGIR GIMLPAHYLK QFHPKYVPLE KIQKMAEENG YDNWTNYFWS LGDHNAHISN PDLPVLAAWK LKEITDAKLV IERNPYYWKI DPEGNQLPYI DEIVFWTVQD RQMILLKVMA GEIDMQARHL SLEDYTLLAA NAQKGGYKII KWKLARGSDV TLWLNQNVKD PVLRELFQNI KFRQALSLAI NREEINSLVY YGLCEPRQAS FVSGVKFYDP EWETRFAEYD PETANKLLDE IGLTKRNAEG YRLRPDGQPL ILTIEYPTGI FGAWDKTLEM IAQYFQKIGI KVNLKPEERS LYITRCNGGE PEIGVWFFDR NKYPMLDPGR LLGTVTDGPW APLYGQWYTS GGKGGEEPPE GSDIRRIYEL WEKVKMTVDE KERDKLFREV INVHKKNIFF IGTVGEPIWP VVVKTYFKNV PDSPDFVWEN EGDGQHAEQY YMDK
|
| |