Gene TRQ2_1623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_1623 
Symbol 
ID6093072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp1640344 
End bp1642293 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content48% 
IMG OID642488824 
ProductBeta-galactosidase 
Protein accessionYP_001739642 
Protein GI170289404 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1874] Beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.015305 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGGAG TCTGTTACTA TCCTGAACAC TGGGGAATAG AGAGAGTGGA AGAAGACTTC 
AGAAGGATGA AAGAACTCGG GATAGAGTAC GTGAGGATTG GAGAGTTCGC CTGGAGCAGG
ATAGAGCCTG AGCGTGGAAA GTTCAACTGG GACTGGCTCG ACAAAACACT TGAGCTGGCA
GAGAAGATGG GGCTCAAGAT CGTACTGGGA ACTCCCACAG CTACACCTCC GAAGTGGCTC
ATCGATGAAC ACCCGGAGAT CCTTCCCGTT GATAAAGATG GCAGGGTGAA AAATTTTGGT
TCCAGAAGAC ATTACTGCTT CTCCAGCCCT GTCTATAGAG AAGAAGTGAA AAGAATCGTC
ACCATAATAG CGAAAAGGTA CGGAAAACAC CCGGCAGTCG CAGGCTGGCA GACGGACAAC
GAATACGGCT GTCACGATAC GGTTAAGTGC TACTGTCCGA GGTGCAAAAA AGCCTTCCAA
AAATGGTTGG AAAGAAAGTA CGAGGGAGAC ATAAAGAAAT TGAATGAAGC GTGGGGAACA
GTGTTCTGGA GCCAGGAGTA TCGATCCTTC GACGAAATAG AGCTTCCGAA TCTCACTCCT
GCCGATCCAA ACCCGTCTCA TCTTCTCGAC TATTACAGGT TCGCTTCCGA CCAGGTGGTG
GAATTCAACA AGCTTCAAGT GGAGATCATA AGGGAGTATT CTCCTGGAAG ATTCATCACA
CACAATTTCA TGGGAGGATT TGTAGATTTT GACCATTACA AACTTTCAAG AGACTTGGAT
TTTGCATCCT GGGACAACTA CCCACTCGGA CACACACTCG TTTTTCTGAG AATGAAGGGT
GAGACGAAAA ATCCGTTCGA TAGAGTAGGA CACCCGGACA TCATCTCCTT CTCACACGAC
CTGTACCGAG GAGTGGGCAG GGGAAGATTC TGGGTGATGG AACAGCAAGC AGGACCTGTG
AACTGGGCTC CTTACAATCT CTGGCCTGCC AAAGGAGCAG TGAAACTCTG GACGTGGCAG
GCATTCGCAC ACGGTGCGGA GGTGGTCTCT TACTTCAGAT GGAGGCAGGT TCCTTTCGCG
CAGGAGCAGA TGCACTCCGG ACTTCTGGCA CCGGATTCAG CACCTTCCCC TGGATACCAT
GAGGTAAAGC AGGTTTTCGA GGAGCTCAAA AACATCGATA TCAATGAACC CGTGAAAAGC
GAAGTGGCAC TTGTCTTCGA TTATGAAACA GCATGGATCT TCTCCATACA ACCGCACGGC
GAGGGGGCGA ACTACCTCGA TCTTGTCTTC AGATGCTACA GCGCGCTCAG AAGTCTTGGT
CTGAACGTGG ATATAGTACC CCCTGGATCT TCACTGGACG GATACAAAAT GGTCGTTGTT
CCAAGTCTTG CCATTGTGAA GGAAGAGGTT CTGAACACGT TTAAAAAATA CGACGGTCTT
CTCGTACTTG GCCCAAGAAG CGGAAGCAAA ACAGAGACGT TCCAGATTCC TCCCGAGATG
CCCCCAGGTC TTCTCAAAGA GCTCATACCC GTTGAAGTAA GACAAGTTGA AAGCCTTGGG
TACAACGCTG AAACTCTTAT TTGGAACGGG AAAGAGTATC CCGTCTTGAT CTGGAGGGAG
GACGTGGATC CTACTATCAC GGAAGTGATC GCAAGATTCA AAGATGGTTT TGGAGCCATA
TTCCGGAAAG ACAACGTTTT TTACCTTTCT TTCTGGCCTG ACAGAGAATT TCTTGTAGAT
TTCTTTGAAG CACTTTCAAA AGAATCAGGA ATTGAAACGA AAAGATTACC AGAAGGAATA
CGCATTCAAA GGAGAGGTGA ATATGTTTTT TCTTTCAATT TCACCTCTGA GGAGGTAGAT
TTAGAAATAC CAGCAAAGGT TCAGATAGTT CTAGGAGATC AAAAGATTCC TCCCTACGGA
CTGTTGATAT GGAGGGAAAA CGAACACTGA
 
Protein sequence
MLGVCYYPEH WGIERVEEDF RRMKELGIEY VRIGEFAWSR IEPERGKFNW DWLDKTLELA 
EKMGLKIVLG TPTATPPKWL IDEHPEILPV DKDGRVKNFG SRRHYCFSSP VYREEVKRIV
TIIAKRYGKH PAVAGWQTDN EYGCHDTVKC YCPRCKKAFQ KWLERKYEGD IKKLNEAWGT
VFWSQEYRSF DEIELPNLTP ADPNPSHLLD YYRFASDQVV EFNKLQVEII REYSPGRFIT
HNFMGGFVDF DHYKLSRDLD FASWDNYPLG HTLVFLRMKG ETKNPFDRVG HPDIISFSHD
LYRGVGRGRF WVMEQQAGPV NWAPYNLWPA KGAVKLWTWQ AFAHGAEVVS YFRWRQVPFA
QEQMHSGLLA PDSAPSPGYH EVKQVFEELK NIDINEPVKS EVALVFDYET AWIFSIQPHG
EGANYLDLVF RCYSALRSLG LNVDIVPPGS SLDGYKMVVV PSLAIVKEEV LNTFKKYDGL
LVLGPRSGSK TETFQIPPEM PPGLLKELIP VEVRQVESLG YNAETLIWNG KEYPVLIWRE
DVDPTITEVI ARFKDGFGAI FRKDNVFYLS FWPDREFLVD FFEALSKESG IETKRLPEGI
RIQRRGEYVF SFNFTSEEVD LEIPAKVQIV LGDQKIPPYG LLIWRENEH