Gene TRQ2_1625 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_1625 
Symbol 
ID6093074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp1643294 
End bp1646548 
Gene Length3255 bp 
Protein Length1084 aa 
Translation table11 
GC content48% 
IMG OID642488826 
Productglycoside hydrolase family 42 protein 
Protein accessionYP_001739644 
Protein GI170289406 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00573488 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCTACG AATGGGAAAA CCCACAGCTT GTGGGTGAAG GAACGGAGAA GCCACACGCT 
TCTTTCATAC CCTATCTGGA CCCGTTCAGC GGGGAATGGG AGTACCCTGA GGAATTCATC
TCTTTGAACG GGAACTGGGG GTTTCTCTTC GCGAAAAATC CCTTCGAGGT GCCGGAGGAT
TTCTTTTCAG AGAACTTCGA CGACTCGAAC TGGGATGAGA TAGAAGTTCC AAGCAACTGG
GAGATGAAAG GATATGGGAA GCCCATCTAC ACGAACGTGG TTTATCCATT TGAACCGAAC
CCTCCTTTTG TTCCAAAAGA CGACAATCCG ACCGGGGTGT ACAGAAGGTG GATCGAGATA
CCTGAGGATT GGTTCAAAAG GGAGATCTTT CTGCATTTTG AAGGTGTTCG ATCCTTCTTC
TATCTGTGGG TGAACGGGAA GAAGATCGGT TTCAGCAAAG ACAGCTGCAC ACCCGCTGAA
TTCAGACTCA CCGATGTTCT AAGGCCAGGG AAGAATCTGA TCACCGTTGA GGTTCTGAAG
TGGAGCGATG GAAGCTATCT CGAAGATCAG GACATGTGGT GGTTTGCGGG GATATACAGG
GACGTTTATC TGTACTCGCT GCCGAAATTT CACATCAGGG ACGTGTTCGT GAGAACGGAT
CTGGATGAAA ATTACAGAGA CGGAAAGATC TTTCTGGACG TAGAGATGAG AAATCTCGGT
GAGGAAGAAG AAAAAGACCT TGAAGTAACA CTCATCACAC CGGATGGAGA CGAAAAAACA
CTCGTGAAAG AGACAGTAAA GCCGGAGGAC AGAGTCCTTT CCTTTGCCTT TGACGTGAAA
GATCCGAGGA AGTGGTCCGC GGAGACACCA CATCTGTACG TTTTGAAACT GAAACTGGGA
GAAGACGAAA AGAAAGTCAA CTTCGGATTC AGGAAGATAG AGATAAAAGA CGGAATGCTT
CTTTTCAACG GGAAACCTCT CTACATAAAG GGAGTGAACA GACACGAGTT CGACCCCGAC
AGGGGTCATG CGGTGACAGT GGAGAGAATG ATTCAGGACA TAAAGCTAAT GAAGCAGCAC
AACATAAACA CAGTTCGCAC ATCACACTAT CCGAACCAGA CGAAGTGGTA CGATCTGTGT
GACTATTTTG GACTCTACGT GATAGACGAG GCAAACATCG AATCTCATGG AATCGGTGAA
GCTCCTGAAG TGACCCTTGC GAACAGACCG GAATGGGAGA AGGCACATCT CGACAGGATC
AAAAGGATGG TCGAGCGCGA CAAGAATCAT CCCAGTATCA TCCTCTGGTC TCTCGGGAAC
GAAGCGGGAG ATGGAGTGAA TTTCGAAAAA GCCGCTCTCT GGATAAAGAA AAGGGACAAC
ACGCGACTTA TCCATTACGA GGGAACAACA AGGAGGGGAG AATCGTACTA CGTCGATGTT
TTCTCCCTCA TGTATCCGAA GATGGACGTT CTTCTTGAGT ACGCTTCAAA GAAGAGGGAA
AAGCCCTTCA TCATGTGTGA ATACGCGCAC GCGATGGGAA ACAGTGTGGG AAATCTGAAG
GACTACTGGG ACGTGATAGA AAAGTATCCG TACCTTCATG GAGGTTGCAT CTGGGACTGG
GTGGATCAGG GAATAAGAAA GAAGTACGAG AACGGAAGAG AATTCTGGGC GTACGGTGGT
GACTTCGGTG ACACACCGAA TGACGGAAAC TTCTGTATAA ACGGTGTGGT ACTGCCCGAT
AGAACACCTG AGCCGGAACT TTACGAGGTG AAGAAGGTCT ATCAGAACGT CAAAATAAGA
CAAGTATCGA AAGACACTTA TGAAGTGGAA AACAGGTATC TATTCACGAA CCTTGAGATG
TTCGATGGCG CCTGGAAAAT CAGAAAAGAC GGAGAGGTCA TCGAAGAAAA AACCTTCAAG
ATCTTCGCTG AACCTGGAGA AAAGCGTCTC TTGAAGATAC CACTCCCAGA AATGGACGAT
TCTGAGTATT TCCTCGAGAT CAGTTTCTCC CTTTCTGAGG ATACCCCCTG GGCGGAAAAA
GGCCACGTTG TGGCGTGGGA GCAGTTCCTT CTGAAAGCAC CGGCATTTGA GAAAAAATCC
ATTTCAGATG GAGTATCACT CAGAGAAGAG GGGAAACATC TCACCGTTGA AGCAAAAGAC
ACGGTGTATG TATTCTCAAA ACTCACAGGC CTTCTGGAGC AGATCCTTCA CAGAGGAAAA
AAGATCCTGA AAAGTCCCGT TGTTCCCAAC TTTTGGAGAG CTCCAACCGA CAACGACATC
GGAAACAGAA TGCCGCAGAG GCTCGCCATC TGGAAGAGGG CATCGAAAGA GAGAAAACTC
TTCAAAATGC ACTGGAAAAA GGAAGAAAAT CGTGTTTCTG TCCATAGTGT TTTTCAGCTT
CCGGGAAACA GCTGGGTGTA CACAACATAC ACCGTTTTCG GAAATGGAGA CATCCTAGTG
GACCTTTCTC TGATTCTCGC TGAGGATGTA CCGGAGATTC CAAGGATCGG TCTTCAGTTC
ACGGTCCCTG AAGAGTTCGG CACCGTGGAG TGGTACGGAA GGGGACCGCA CGAGACTTAC
TGGGACAGAA AAGAAAGTGG CCTTTTCGCA AGGCACAGAA AAGCTGTCGA TGAGATGATA
CACAGGTACG TCAGGCCCCA GGAAACGGGG AACAGATCGG ACGTGAGATG GTTTGCGCTT
TCCGATGGTG AAACAAAACT CTTTGTGTCA GGCATGCCGC AGATAGACTT CAGCGTCTGG
CCCTTTTCCA TGGAGGATCT CGAGAGGGCT CAGCACATAA GTGAACTCCC GGAGAGGGAC
TTCGTCACCG TGAACGTGGA CTTCAGACAG ATGGGCCTTG GAGGAGACGA CAGCTGGGGT
GCGATGCCTC ATCTGGAGTA CAGGCTTCTA CCAAAGCCGT ATCGTTTTTC TTTCAGAATG
AGGATTAGCA AAGAGATTCC ATCCTGGAGG GTTCTTGCGG CGATCCCTGA AACGCTCCAT
GTTGAGATGT CCTCAGAAGA CGTGATACGC GAAGGAGACA CCCTGAGAGT GAAATTTTCC
CTTCTGAACG ACACTCCACT GAGCAAGGAA GAACAGGTGG TTCTCTTTGT TGATGGAAAC
GAATACTCGG TGAGGCGAGT GGTGATTCCA CCCTTCAAGA AGGAAGAGCT GGTGTTCAAA
GTAGAAGGAT TGAAGAAGGG AGAACATCTG ATACATACTA ATCTGAACAC GAGAAAAACT
ATCTACGTGA GGTGA
 
Protein sequence
MSYEWENPQL VGEGTEKPHA SFIPYLDPFS GEWEYPEEFI SLNGNWGFLF AKNPFEVPED 
FFSENFDDSN WDEIEVPSNW EMKGYGKPIY TNVVYPFEPN PPFVPKDDNP TGVYRRWIEI
PEDWFKREIF LHFEGVRSFF YLWVNGKKIG FSKDSCTPAE FRLTDVLRPG KNLITVEVLK
WSDGSYLEDQ DMWWFAGIYR DVYLYSLPKF HIRDVFVRTD LDENYRDGKI FLDVEMRNLG
EEEEKDLEVT LITPDGDEKT LVKETVKPED RVLSFAFDVK DPRKWSAETP HLYVLKLKLG
EDEKKVNFGF RKIEIKDGML LFNGKPLYIK GVNRHEFDPD RGHAVTVERM IQDIKLMKQH
NINTVRTSHY PNQTKWYDLC DYFGLYVIDE ANIESHGIGE APEVTLANRP EWEKAHLDRI
KRMVERDKNH PSIILWSLGN EAGDGVNFEK AALWIKKRDN TRLIHYEGTT RRGESYYVDV
FSLMYPKMDV LLEYASKKRE KPFIMCEYAH AMGNSVGNLK DYWDVIEKYP YLHGGCIWDW
VDQGIRKKYE NGREFWAYGG DFGDTPNDGN FCINGVVLPD RTPEPELYEV KKVYQNVKIR
QVSKDTYEVE NRYLFTNLEM FDGAWKIRKD GEVIEEKTFK IFAEPGEKRL LKIPLPEMDD
SEYFLEISFS LSEDTPWAEK GHVVAWEQFL LKAPAFEKKS ISDGVSLREE GKHLTVEAKD
TVYVFSKLTG LLEQILHRGK KILKSPVVPN FWRAPTDNDI GNRMPQRLAI WKRASKERKL
FKMHWKKEEN RVSVHSVFQL PGNSWVYTTY TVFGNGDILV DLSLILAEDV PEIPRIGLQF
TVPEEFGTVE WYGRGPHETY WDRKESGLFA RHRKAVDEMI HRYVRPQETG NRSDVRWFAL
SDGETKLFVS GMPQIDFSVW PFSMEDLERA QHISELPERD FVTVNVDFRQ MGLGGDDSWG
AMPHLEYRLL PKPYRFSFRM RISKEIPSWR VLAAIPETLH VEMSSEDVIR EGDTLRVKFS
LLNDTPLSKE EQVVLFVDGN EYSVRRVVIP PFKKEELVFK VEGLKKGEHL IHTNLNTRKT
IYVR