Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpet_0608 |
Symbol | |
ID | 5171644 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga petrophila RKU-1 |
Kingdom | Bacteria |
Replicon accession | NC_009486 |
Strand | - |
Start bp | 603439 |
End bp | 605415 |
Gene Length | 1977 bp |
Protein Length | 658 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640563116 |
Product | extracellular solute-binding protein |
Protein accession | YP_001244205 |
Protein GI | 148269745 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0360846 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGAAAG TCTTTGTTTT TCTGCTGGTC TCATTGTTCA TTGTGATTGG TCTTTCCTGG AGTGTCTACG CAACACCTGA AGACTACTAC AAAGCCACGG GTAAAAAGAT TGAAAAGTTC AACGAGGCAC CCATGCTTGC CGAACTTGTG AAACAAGGAA AGCTCCCACC GGTCGAGCAG AGGCTTCCAA AGGAACCTCT TGTTGTAGTT CCTGAGGAAA GTGTAGGACA GTACGGTGGC ACCTGGAGGA GAGTCTGGAA AGGGCCTTCT GACAGGTGGG GTATTCCCAG GATCAACCAA GCGAGTCTTG TGTTCTGGGA CAAGAACGGA GAGAAGTTTG TACCGGGAGT AGCGAAAAGC TGGGACATAC TTGAAAATGG AAAAGTATAT GTCTTTCATC TCAGAGAAGG CATGAAGTGG TCCGATGGTC ATCCCTACAC TTCCGAAGAT ATCCTTTTCT GGGTCGACGA TATACTGGGG AACGATGAAC TCACCCCTGC GAAACCCGCC TGGTACAGAC TTCTCGACAG GGTGGAAGCT CCTGATCCCT ACACGGTGAA ATTCGTGTTC AAACAACCGT ACGCTCTGTT TCTGCTTCAG GTGGCGAACA GAGGATTCAC AGGCTCTCCC AAACATTTCT TGAAGCAATT CCACCCAAAT TACACCCCTA TGGAAGAGAT CGAAAAGAAA ATGGTTGAGG GTGTTCACAA CACATGGGTA GATCTTTTCA ATGACAAAAG TGATTTTCTC GAAAGTCTTG ATCTGCCCGT TTTGACACCC TGGAAACCCA TCACAGATCC AACAGAACAG TTCTACATAC TCGAGAGAAA TCCATACTTC TGGGCGGTTG ACATCGAAGG GAATCAACTG CCTTACATCG ATAGGATCAG ACACGAATAC GTTCAAAGTA GTGAGGTTAT CATGTTGAAA GCAATCTCCG GAGAAATCGA TATGCAGTGG AGGCACATTG GTCTTCTGGG ACCCGGCCCG GGTGTTTTGC CGCTTCTTCT CGAAAACGCC AAGAGCGGTG GTTACAAGGT TTTGAGATGG AAAACCGACA ATGGATCCGT GAGCATGGTG ATGCTGAACA TCTCCGATCC ACCAGATCCT GTACTCGGAG AAGTTTTCAG GGATGTGAGA TTCAGACAGG CGCTTTCACT TGCTATCAAC AGAGAGGAGA TCAACGAGAT TCTCTTCAAC GGACTCGCAG AACCAAGGCA AGCCTCGTTC GTGAGTGGAT CTCCGTACTA CGATCCCGAA TGGGAGAAGG CGTATGTGGA GTATGATCCA GACAGAGCAA ACAAACTCCT CGACGAAATG GGACTGAAGT GGAACAGCAA GCACGAGTAC AGACTTCTTC CCAATGGAAA ACCTCTGAGA TTCACCGTAC AGGTCACTGG TCAGACCCAC GTTGATGTCT GGACGATGGT GAAAGAATAC TGGAAACAGA TAGGTGTGTG GGTCGAAATC GAAAACCTCG AAAGGTCACT CTATGATTCG AGGCTCAGTG CACACGATTT CGATGCACAG GCTTGGGTGA TGGACAGGGC AAGTCAGCCC CTTGTGGATC CCCTGTGGAT CATTCCTGGA AGCACGGAGT ACGCTTCCGC GTGGTACATT GGCTGGGCTG ATTGGGCTGG TTCTTACCTT GAAGGAGAAG AATCTCTGAA GGAATATCTA CAGCAGGAAG ATGCGATCGT TCCACCCGAG GGTATAAAGG AGACTCTCGA AAAGCTACTC GACGTATGGA AGGAGATTCA AAACACTTCC GATCCTGAAA AGATCAAGGA ACTCATGAAA GAAGTAACGA AGATCCACAG GGAAAATCTC TGGATGATAG GAACAGTGGG CGAAGATATA TCTCCTGCCA TCGTTAAGAA CAACTTCAAG AACGTACCGG AGGAACTGGT GACGGCAACG CCGTTCTTCA GTCCATGGAA CGCCATGCCG ATACAATTCT ACATAGAACA GAAATGA
|
Protein sequence | MRKVFVFLLV SLFIVIGLSW SVYATPEDYY KATGKKIEKF NEAPMLAELV KQGKLPPVEQ RLPKEPLVVV PEESVGQYGG TWRRVWKGPS DRWGIPRINQ ASLVFWDKNG EKFVPGVAKS WDILENGKVY VFHLREGMKW SDGHPYTSED ILFWVDDILG NDELTPAKPA WYRLLDRVEA PDPYTVKFVF KQPYALFLLQ VANRGFTGSP KHFLKQFHPN YTPMEEIEKK MVEGVHNTWV DLFNDKSDFL ESLDLPVLTP WKPITDPTEQ FYILERNPYF WAVDIEGNQL PYIDRIRHEY VQSSEVIMLK AISGEIDMQW RHIGLLGPGP GVLPLLLENA KSGGYKVLRW KTDNGSVSMV MLNISDPPDP VLGEVFRDVR FRQALSLAIN REEINEILFN GLAEPRQASF VSGSPYYDPE WEKAYVEYDP DRANKLLDEM GLKWNSKHEY RLLPNGKPLR FTVQVTGQTH VDVWTMVKEY WKQIGVWVEI ENLERSLYDS RLSAHDFDAQ AWVMDRASQP LVDPLWIIPG STEYASAWYI GWADWAGSYL EGEESLKEYL QQEDAIVPPE GIKETLEKLL DVWKEIQNTS DPEKIKELMK EVTKIHRENL WMIGTVGEDI SPAIVKNNFK NVPEELVTAT PFFSPWNAMP IQFYIEQK
|
| |