Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_1625 |
Symbol | |
ID | 6093074 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | + |
Start bp | 1643294 |
End bp | 1646548 |
Gene Length | 3255 bp |
Protein Length | 1084 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 642488826 |
Product | glycoside hydrolase family 42 protein |
Protein accession | YP_001739644 |
Protein GI | 170289406 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00573488 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCTACG AATGGGAAAA CCCACAGCTT GTGGGTGAAG GAACGGAGAA GCCACACGCT TCTTTCATAC CCTATCTGGA CCCGTTCAGC GGGGAATGGG AGTACCCTGA GGAATTCATC TCTTTGAACG GGAACTGGGG GTTTCTCTTC GCGAAAAATC CCTTCGAGGT GCCGGAGGAT TTCTTTTCAG AGAACTTCGA CGACTCGAAC TGGGATGAGA TAGAAGTTCC AAGCAACTGG GAGATGAAAG GATATGGGAA GCCCATCTAC ACGAACGTGG TTTATCCATT TGAACCGAAC CCTCCTTTTG TTCCAAAAGA CGACAATCCG ACCGGGGTGT ACAGAAGGTG GATCGAGATA CCTGAGGATT GGTTCAAAAG GGAGATCTTT CTGCATTTTG AAGGTGTTCG ATCCTTCTTC TATCTGTGGG TGAACGGGAA GAAGATCGGT TTCAGCAAAG ACAGCTGCAC ACCCGCTGAA TTCAGACTCA CCGATGTTCT AAGGCCAGGG AAGAATCTGA TCACCGTTGA GGTTCTGAAG TGGAGCGATG GAAGCTATCT CGAAGATCAG GACATGTGGT GGTTTGCGGG GATATACAGG GACGTTTATC TGTACTCGCT GCCGAAATTT CACATCAGGG ACGTGTTCGT GAGAACGGAT CTGGATGAAA ATTACAGAGA CGGAAAGATC TTTCTGGACG TAGAGATGAG AAATCTCGGT GAGGAAGAAG AAAAAGACCT TGAAGTAACA CTCATCACAC CGGATGGAGA CGAAAAAACA CTCGTGAAAG AGACAGTAAA GCCGGAGGAC AGAGTCCTTT CCTTTGCCTT TGACGTGAAA GATCCGAGGA AGTGGTCCGC GGAGACACCA CATCTGTACG TTTTGAAACT GAAACTGGGA GAAGACGAAA AGAAAGTCAA CTTCGGATTC AGGAAGATAG AGATAAAAGA CGGAATGCTT CTTTTCAACG GGAAACCTCT CTACATAAAG GGAGTGAACA GACACGAGTT CGACCCCGAC AGGGGTCATG CGGTGACAGT GGAGAGAATG ATTCAGGACA TAAAGCTAAT GAAGCAGCAC AACATAAACA CAGTTCGCAC ATCACACTAT CCGAACCAGA CGAAGTGGTA CGATCTGTGT GACTATTTTG GACTCTACGT GATAGACGAG GCAAACATCG AATCTCATGG AATCGGTGAA GCTCCTGAAG TGACCCTTGC GAACAGACCG GAATGGGAGA AGGCACATCT CGACAGGATC AAAAGGATGG TCGAGCGCGA CAAGAATCAT CCCAGTATCA TCCTCTGGTC TCTCGGGAAC GAAGCGGGAG ATGGAGTGAA TTTCGAAAAA GCCGCTCTCT GGATAAAGAA AAGGGACAAC ACGCGACTTA TCCATTACGA GGGAACAACA AGGAGGGGAG AATCGTACTA CGTCGATGTT TTCTCCCTCA TGTATCCGAA GATGGACGTT CTTCTTGAGT ACGCTTCAAA GAAGAGGGAA AAGCCCTTCA TCATGTGTGA ATACGCGCAC GCGATGGGAA ACAGTGTGGG AAATCTGAAG GACTACTGGG ACGTGATAGA AAAGTATCCG TACCTTCATG GAGGTTGCAT CTGGGACTGG GTGGATCAGG GAATAAGAAA GAAGTACGAG AACGGAAGAG AATTCTGGGC GTACGGTGGT GACTTCGGTG ACACACCGAA TGACGGAAAC TTCTGTATAA ACGGTGTGGT ACTGCCCGAT AGAACACCTG AGCCGGAACT TTACGAGGTG AAGAAGGTCT ATCAGAACGT CAAAATAAGA CAAGTATCGA AAGACACTTA TGAAGTGGAA AACAGGTATC TATTCACGAA CCTTGAGATG TTCGATGGCG CCTGGAAAAT CAGAAAAGAC GGAGAGGTCA TCGAAGAAAA AACCTTCAAG ATCTTCGCTG AACCTGGAGA AAAGCGTCTC TTGAAGATAC CACTCCCAGA AATGGACGAT TCTGAGTATT TCCTCGAGAT CAGTTTCTCC CTTTCTGAGG ATACCCCCTG GGCGGAAAAA GGCCACGTTG TGGCGTGGGA GCAGTTCCTT CTGAAAGCAC CGGCATTTGA GAAAAAATCC ATTTCAGATG GAGTATCACT CAGAGAAGAG GGGAAACATC TCACCGTTGA AGCAAAAGAC ACGGTGTATG TATTCTCAAA ACTCACAGGC CTTCTGGAGC AGATCCTTCA CAGAGGAAAA AAGATCCTGA AAAGTCCCGT TGTTCCCAAC TTTTGGAGAG CTCCAACCGA CAACGACATC GGAAACAGAA TGCCGCAGAG GCTCGCCATC TGGAAGAGGG CATCGAAAGA GAGAAAACTC TTCAAAATGC ACTGGAAAAA GGAAGAAAAT CGTGTTTCTG TCCATAGTGT TTTTCAGCTT CCGGGAAACA GCTGGGTGTA CACAACATAC ACCGTTTTCG GAAATGGAGA CATCCTAGTG GACCTTTCTC TGATTCTCGC TGAGGATGTA CCGGAGATTC CAAGGATCGG TCTTCAGTTC ACGGTCCCTG AAGAGTTCGG CACCGTGGAG TGGTACGGAA GGGGACCGCA CGAGACTTAC TGGGACAGAA AAGAAAGTGG CCTTTTCGCA AGGCACAGAA AAGCTGTCGA TGAGATGATA CACAGGTACG TCAGGCCCCA GGAAACGGGG AACAGATCGG ACGTGAGATG GTTTGCGCTT TCCGATGGTG AAACAAAACT CTTTGTGTCA GGCATGCCGC AGATAGACTT CAGCGTCTGG CCCTTTTCCA TGGAGGATCT CGAGAGGGCT CAGCACATAA GTGAACTCCC GGAGAGGGAC TTCGTCACCG TGAACGTGGA CTTCAGACAG ATGGGCCTTG GAGGAGACGA CAGCTGGGGT GCGATGCCTC ATCTGGAGTA CAGGCTTCTA CCAAAGCCGT ATCGTTTTTC TTTCAGAATG AGGATTAGCA AAGAGATTCC ATCCTGGAGG GTTCTTGCGG CGATCCCTGA AACGCTCCAT GTTGAGATGT CCTCAGAAGA CGTGATACGC GAAGGAGACA CCCTGAGAGT GAAATTTTCC CTTCTGAACG ACACTCCACT GAGCAAGGAA GAACAGGTGG TTCTCTTTGT TGATGGAAAC GAATACTCGG TGAGGCGAGT GGTGATTCCA CCCTTCAAGA AGGAAGAGCT GGTGTTCAAA GTAGAAGGAT TGAAGAAGGG AGAACATCTG ATACATACTA ATCTGAACAC GAGAAAAACT ATCTACGTGA GGTGA
|
Protein sequence | MSYEWENPQL VGEGTEKPHA SFIPYLDPFS GEWEYPEEFI SLNGNWGFLF AKNPFEVPED FFSENFDDSN WDEIEVPSNW EMKGYGKPIY TNVVYPFEPN PPFVPKDDNP TGVYRRWIEI PEDWFKREIF LHFEGVRSFF YLWVNGKKIG FSKDSCTPAE FRLTDVLRPG KNLITVEVLK WSDGSYLEDQ DMWWFAGIYR DVYLYSLPKF HIRDVFVRTD LDENYRDGKI FLDVEMRNLG EEEEKDLEVT LITPDGDEKT LVKETVKPED RVLSFAFDVK DPRKWSAETP HLYVLKLKLG EDEKKVNFGF RKIEIKDGML LFNGKPLYIK GVNRHEFDPD RGHAVTVERM IQDIKLMKQH NINTVRTSHY PNQTKWYDLC DYFGLYVIDE ANIESHGIGE APEVTLANRP EWEKAHLDRI KRMVERDKNH PSIILWSLGN EAGDGVNFEK AALWIKKRDN TRLIHYEGTT RRGESYYVDV FSLMYPKMDV LLEYASKKRE KPFIMCEYAH AMGNSVGNLK DYWDVIEKYP YLHGGCIWDW VDQGIRKKYE NGREFWAYGG DFGDTPNDGN FCINGVVLPD RTPEPELYEV KKVYQNVKIR QVSKDTYEVE NRYLFTNLEM FDGAWKIRKD GEVIEEKTFK IFAEPGEKRL LKIPLPEMDD SEYFLEISFS LSEDTPWAEK GHVVAWEQFL LKAPAFEKKS ISDGVSLREE GKHLTVEAKD TVYVFSKLTG LLEQILHRGK KILKSPVVPN FWRAPTDNDI GNRMPQRLAI WKRASKERKL FKMHWKKEEN RVSVHSVFQL PGNSWVYTTY TVFGNGDILV DLSLILAEDV PEIPRIGLQF TVPEEFGTVE WYGRGPHETY WDRKESGLFA RHRKAVDEMI HRYVRPQETG NRSDVRWFAL SDGETKLFVS GMPQIDFSVW PFSMEDLERA QHISELPERD FVTVNVDFRQ MGLGGDDSWG AMPHLEYRLL PKPYRFSFRM RISKEIPSWR VLAAIPETLH VEMSSEDVIR EGDTLRVKFS LLNDTPLSKE EQVVLFVDGN EYSVRRVVIP PFKKEELVFK VEGLKKGEHL IHTNLNTRKT IYVR
|
| |