Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpet_0607 |
Symbol | |
ID | 5171558 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga petrophila RKU-1 |
Kingdom | Bacteria |
Replicon accession | NC_009486 |
Strand | - |
Start bp | 601377 |
End bp | 603395 |
Gene Length | 2019 bp |
Protein Length | 672 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640563115 |
Product | Beta-galactosidase |
Protein accession | YP_001244204 |
Protein GI | 148269744 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1874] Beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00254825 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTAAATC CGAAACTTCC TGTGATCTGG TACGGAGGAG ACTACTACCC GGAACAGTGG GACGAGGAAA CGTTCGAAAG AGATATCAGG ATGTTCAAAG AAGCAGGTAT CAACGTGGTC ACCCTCGGGG TTTTCTCATG GAGTCTTATC CAACCAGACG AGAACACTTA CGATTTTTCG TTCTTCGAAA AAGTGATGGA TCGTCTGTAC AAAGAAGGTA TCTATGTCTG TCTTGCAACT CCCACATCCG CTCCCCCTCA CTGGATGACG CAGAAATACC CTGAAATCCT TCTCACAGAC GTGAACGGTG TAAAGCGAGA GAAGGGAGGC AGACAGAACT TTTGTCCCAA CAGTGAGAAG TACAGATATT TTGCAAGAAA CATCGCAGAG AAACTCGCAG AACATTTCAA GGATCACCCT GCTCTCGTTC TTTGGCACGT GAACAACGAA TATTTGAACT ACTGCTACTG TGATATCTGC AGGGGGAAGT TTCAGAATTG GTTGAAGGAA AAGTACGGCA CACTCGATGA ATTGAACAGA AGATGGAACA CGCGTTTCTG GTCCCAGACC TTCACCGCCT GGGAAGAGAT TCCTGTTCCA ACAACGAGAA GTGTTCTCTT CTGGAGAAAG GACAGGTATC AATCTGTTCT TCCGGAACTT TATCTGGATT ACAGACGCTT CATGACACAG AGTATGCTCG AGTGTTTCCT GGAGGAATAC AGGGCAATAA AGAAACACAC TCCAGACATA CCCGTCACCA CGAATCTGAT TGCGGCAACT TTCAAGGAGT GGAACTATTT CGAATGGGCG AATCATATGG ACGTTGCAGC ATGGGACAAC TATCCCGGTT ACAAAGAAGA TTTCTCCGTG ATTTCTTTGA GACACTCTCT GATACGCGGT TTGAAAGAAG GAAAACCCTT CGTCCTAATG GAACAGTCTC CTTCGCAGGC GTGCTGGAGA TGGTACAATC CCCAGAAAAG ACCAGGAGAG ATGAGGCTCT GGAGTTATCA CGCTCTCGCT CACGGTGCAG AGACTCTCAT GTTCTTTCAG CTGAGACAAT CAAAGGGAGG AGTAGAAAAA TTCCACGGGG CTGTTATCAC GCATGTCGAC AGTCCGAACA CAAGGGTCTT TCAAGAAGTG AAAAGAGTGG GAGAAGAGAT AAAGTCCTTA TCGGATATTC TCGACACAAG GGTTGTCTCA CAGACTGCTC TTCTTTTCGA CTGGGAAAGC TGGTGGGCGA TGGAAGACAC TCTGACACCG AACATAGACT TCAAGTACCT GAACGAAGCA GAAAAGTATT TCAAGGCACT TGTGAAAACC GGCCTTGGAG TCGATGTTGT TGGTAAAGAT CAGAATTTTG AGGGATACAG GATCATCGTT GCTCCGGCGC TTTACATGGT GGACGAAGAA CTTGCACGGA AGCTGAAAGA TTACGTCTCG AACGGAGGAA TCTTGATCCT CACAACAATG AGCGGTATTG TCGATGAAGA CGATCAGGTT GTTCTGGGAG GTTATCCAGG TTATCTTCGC GATCTTATGG GGGGCTATGT GGAAGAAATA GATGCTTTGC CACCTGAGGA GAAAAACGAA ATCTTGATGT TTGGCAAAAG GTACGAGTGT AGTTTGGTGT TTGACTACAT AAAAACCACC ACAGCGGAAG TGCTCGGAAG ATTTGGAAGG GACTATTACA TGAACGAACC TGCCGTGTTG AGAAACAGAT ACGGTGGTGG ATGGGTTTAC TACGTAGGTT CTTCCGCGTC TTTCGATCTT GTATGGGATC TTGTGAAGTT CATAGAGAGA GAACACAATC TCAGAGCAGA GATCACACCG CCAGATGGTG TAGAGGTGAT AAAGAAAGTC AAAGGAGAGA AGACCTTCTA CTTTCTCTTC AATCACTCTC ATGAACCGCA GATCGTTGAC TTGCCGGAAG GAACGTTCAG GGATCTGATC AAAGGAAATT TGTATGAAAA AAAGGCAAGA CTGCAACCGC TCGATGTCTT GATCCTTCTG AAAGAGTGA
|
Protein sequence | MVNPKLPVIW YGGDYYPEQW DEETFERDIR MFKEAGINVV TLGVFSWSLI QPDENTYDFS FFEKVMDRLY KEGIYVCLAT PTSAPPHWMT QKYPEILLTD VNGVKREKGG RQNFCPNSEK YRYFARNIAE KLAEHFKDHP ALVLWHVNNE YLNYCYCDIC RGKFQNWLKE KYGTLDELNR RWNTRFWSQT FTAWEEIPVP TTRSVLFWRK DRYQSVLPEL YLDYRRFMTQ SMLECFLEEY RAIKKHTPDI PVTTNLIAAT FKEWNYFEWA NHMDVAAWDN YPGYKEDFSV ISLRHSLIRG LKEGKPFVLM EQSPSQACWR WYNPQKRPGE MRLWSYHALA HGAETLMFFQ LRQSKGGVEK FHGAVITHVD SPNTRVFQEV KRVGEEIKSL SDILDTRVVS QTALLFDWES WWAMEDTLTP NIDFKYLNEA EKYFKALVKT GLGVDVVGKD QNFEGYRIIV APALYMVDEE LARKLKDYVS NGGILILTTM SGIVDEDDQV VLGGYPGYLR DLMGGYVEEI DALPPEEKNE ILMFGKRYEC SLVFDYIKTT TAEVLGRFGR DYYMNEPAVL RNRYGGGWVY YVGSSASFDL VWDLVKFIER EHNLRAEITP PDGVEVIKKV KGEKTFYFLF NHSHEPQIVD LPEGTFRDLI KGNLYEKKAR LQPLDVLILL KE
|
| |