Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpet_1559 |
Symbol | |
ID | 5170333 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga petrophila RKU-1 |
Kingdom | Bacteria |
Replicon accession | NC_009486 |
Strand | + |
Start bp | 1554653 |
End bp | 1557907 |
Gene Length | 3255 bp |
Protein Length | 1084 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640564085 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001245142 |
Protein GI | 148270682 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000116157 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCTACG AATGGGAAAA CCCACAGCTT GTGGGTGAAG GAACGGAGAA GCCACACGCT TCTTTCATAC CCTATCTGGA CCCGTTCAGC GGGGAATGGG AGTACCCTGA GGAATTCATC TCTTTGAACG GGAACTGGGG GTTTCTCTTC GCGAAAAATC CCTTCGAGGT GCCGGAGGAT TTCTTTTCAG AGAACTTCGA CGACTCGAAC TGGGATGAGA TAGAAGTTCC AAGCAACTGG GAGATGAAAG GATATGGGAA GCCCATCTAC ACGAACGTGG TTTATCCATT TGAACCGAAC CCTCCTTTTG TTCCAAAAGA CGACAATCCG ACCGGGGTGT ACAGAAGGTG GATCGAGATA CCTGAGGATT GGTTCAAAAG GGAGATCTTT CTGCATTTTG AAGGTGTTCG ATCCTTCTTC TATCTGTGGG TGAACGGGAA GAAGATCGGT TTCAGCAAAG ACAGCTGCAC ACCCGCTGAA TTCAGACTCA CCGATGTTCT AAGGCCAGGG AAGAATCTGA TCACCGTTGA GGTTCTGAAG TGGAGCGATG GAAGCTATCT CGAAGATCAG GACATGTGGT GGTTTGCGGG GATATACAGG GACGTTTATC TGTACTCGCT GCCGAAATTT CACATCAGGG ACGTGTTCGT GAGAACGGAT CTGGATGAAA ATTACAGAGA CGGAAAGATC TTTCTGGACG TAGAGATGAG AAATCTCGGT GAGGAAGAAG AAAAAGACCT TGAAGTAACA CTCATCACAC CGGATGGAGA CGAAAAAACA CTCGTGAAAG AGACAGTAAA GCCGGAGGAC AGAGTCCTTT CCTTTGCCTT TGACGTGAAA GATCCGAGGA AGTGGTCCGC GGAGACACCA CATCTGTACG TTTTGAAACT GAAACTGGGA GAAGACGAAA AGAAAGTCAA CTTCGGATTC AGGAAGATAG AGATAAAAGA CGGAATGCTT CTTTTCAACG GGAAACCTCT CTACATAAAG GGAGTGAACA GACACGAGTT CGATCCTGAC AGGGGTCATG CGGTGACGGT GGAGAGGATG ATTCAGGACA TAAAACTCAT GAAGCAGCAC AACATAAACA CAGTTCGCAC ATCGCACTAT CCGAACCAGA CGAAGTGGTA CGATCTGTGT GACTATTTTG GACTCTACGT GATAGACGAG GCAAACATCG AATCCCACGG TATAGGCTGG GATCCTGAAG TGACACTTGC GAACAGACCG GAATGGGAGA AGGCACATCT CGACAGAATC CAGAGGATGG TCGAGCGTGA CAAGAATCAT CCGTGTGTTA TCTTCTGGTC TCTTGGAAAC GAAGCGGGAG ACGGAGTGAA TTTCGAAAAA GCCGCTCTCT GGATAAAGGA AAGAGACAAC ACGCGGCTTA TCCATTACGA GGGAACAACA AGGAGGGGAG AATCGTACTA CGTGGATGTT TTCTCTCTCA TGTACCCGAA GATAGACGTT CTTCTTGAGT ACGCCTCCAG AAAGAGGGAA AAGCCTTTCA TCATGTGTGA GTATGCCCAC GCGATGGGAA ACAGTGTGGG AAATCTGAAG GACTACTGGG ATGTGATAGA AAGGTATCCG TATCTTCACG GAGGGTGCAT CTGGGACTGG GTGGACCAGG GAATCAGGAA GAAGGATGAA AACGGAAAGG AATTCTGGGC GTACGGTGGA GATTTCGGCG ATGAACCAAA CAACAAGAAT TTCTGCTGCA ACGGAGTGGT CCTTCCCGAC AGAACACCCG AGCCAGAGCT TTACGAGGTG AAGAAAATCT ATCAGAACAT CAAAGTGCGT CAGATCTCAA TAGACACCTA CGAAGTGGAG AACGGGTATC TTTTCACAGA CCTTGAGATG TTCGATGGAA CTTGGAGGAT CAGAAAGGAC GGTGAAGTGA TAGAAGAAGG AAGGTTCAAA CTCTCAGCCG AGCCAGGAGA AAGGAAAATT TTTAAGATAC CACTTCCAGT GATGGAAGAC TCGGAATACT TCCTTGAGAT CTGTTTTGCT CTCTCTGAAG ATACCCTCTG GGCGAAGAAG GGACACGTTG TAGCGTGGGA ACAGTTCCTC CTGAAACCTC CCATCTTTCA AAAAAGCATT GTTCAGGAAA AAGTAGATTT CTCAGAAGAT GGAAGATACC TTCTGGTGAG GACAAAAGAC GCGGAGTTTA TCTTCTCGAA ACTCACCGGC CTTCTGGAGC ACATCGTGTA CAGGGGAAGA AATATCCTGA CAGGATCGAT CGTTCCAAAT TTCTGGAGAG TTCCAACGGA CAACGATATC GGAAACAGAA TGCCGGAAAG ACTCTCCATA TGGAAAAAGG CATCGAGCGA AAGAAAGCTC TTCAAGATGT TCTGGAAGAG AAGGGAAAAC AGCGTTTCCG TTCAGAGTGT CTATCAGGTA CCCGGAAACA GCTGGGTGTA CCTCACCTAC ACCATCTTTG GAAACGGTGA CATCCTCGTG GATCTTTCCC TGATTCCCGC AGAAGGTGTA CCGGAGATTC CAAGGATCGG TCTTCAGTTC ACGGTCCCTG AAGAGTTCGG CACCGTGGAG TGGTACGGAA GGGGACCGCA CGAGACTTAC TGGGACAGAA AAGAAAGTGG CCTTTTCGCA AGGCACAGAA AAGCTGTCGA TGAGATGATA CACAGGTACG TCAGGCCCCA GGAAACGGGG AACAGATCGG ACGTGAGATG GTTTGCGCTT TCCGACGGTG AAACAAAACT CTTTGTGTCA GGCATGCCGC AGATAGACTT CAGCGTCTGG CCCTTTTCCA TGGAGGATCT CGAGAGGGCT CAGCACATAA GTGAACTCCC GGAGAGGGAC TTCGTCACCG TGAACGTGGA CTTCAGACAG ATGGGCCTTG GAGGAGACGA CAGCTGGGGT GCGATGCCTC ATCTGGAGTA CAGGCTTCTA CCAAAGCCGT ATCGTTTTTC TTTCAGAATG AGGATTAGCA AAGAGATTCC ATCCTGGAGG GTTCTTGCGG CGATCCCTGA AACGCTCCAT GTTGAGATGT CCTCAGAAGA CGTGATACGC GAAGGAGACA CCCTGAGAGT GAAATTTTCC CTTCTGAACG ACACTCCACT GAGCAAGGAA GAACAGGTGG TTCTCTTTGT TGATGGAAAC GAATACTCGG TGAGGCGAGT GGTGATTCCA CCCTTCAAGA AGGAAGAGCT GGTGTTCAAA GTAGAAGGAT TGAAGAAGGG AGAACATCTG ATACATACTA ATCTGAACAC GAGAAAAACT ATCTACGTGA GGTGA
|
Protein sequence | MSYEWENPQL VGEGTEKPHA SFIPYLDPFS GEWEYPEEFI SLNGNWGFLF AKNPFEVPED FFSENFDDSN WDEIEVPSNW EMKGYGKPIY TNVVYPFEPN PPFVPKDDNP TGVYRRWIEI PEDWFKREIF LHFEGVRSFF YLWVNGKKIG FSKDSCTPAE FRLTDVLRPG KNLITVEVLK WSDGSYLEDQ DMWWFAGIYR DVYLYSLPKF HIRDVFVRTD LDENYRDGKI FLDVEMRNLG EEEEKDLEVT LITPDGDEKT LVKETVKPED RVLSFAFDVK DPRKWSAETP HLYVLKLKLG EDEKKVNFGF RKIEIKDGML LFNGKPLYIK GVNRHEFDPD RGHAVTVERM IQDIKLMKQH NINTVRTSHY PNQTKWYDLC DYFGLYVIDE ANIESHGIGW DPEVTLANRP EWEKAHLDRI QRMVERDKNH PCVIFWSLGN EAGDGVNFEK AALWIKERDN TRLIHYEGTT RRGESYYVDV FSLMYPKIDV LLEYASRKRE KPFIMCEYAH AMGNSVGNLK DYWDVIERYP YLHGGCIWDW VDQGIRKKDE NGKEFWAYGG DFGDEPNNKN FCCNGVVLPD RTPEPELYEV KKIYQNIKVR QISIDTYEVE NGYLFTDLEM FDGTWRIRKD GEVIEEGRFK LSAEPGERKI FKIPLPVMED SEYFLEICFA LSEDTLWAKK GHVVAWEQFL LKPPIFQKSI VQEKVDFSED GRYLLVRTKD AEFIFSKLTG LLEHIVYRGR NILTGSIVPN FWRVPTDNDI GNRMPERLSI WKKASSERKL FKMFWKRREN SVSVQSVYQV PGNSWVYLTY TIFGNGDILV DLSLIPAEGV PEIPRIGLQF TVPEEFGTVE WYGRGPHETY WDRKESGLFA RHRKAVDEMI HRYVRPQETG NRSDVRWFAL SDGETKLFVS GMPQIDFSVW PFSMEDLERA QHISELPERD FVTVNVDFRQ MGLGGDDSWG AMPHLEYRLL PKPYRFSFRM RISKEIPSWR VLAAIPETLH VEMSSEDVIR EGDTLRVKFS LLNDTPLSKE EQVVLFVDGN EYSVRRVVIP PFKKEELVFK VEGLKKGEHL IHTNLNTRKT IYVR
|
| |