Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_1623 |
Symbol | |
ID | 6093072 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | + |
Start bp | 1640344 |
End bp | 1642293 |
Gene Length | 1950 bp |
Protein Length | 649 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 642488824 |
Product | Beta-galactosidase |
Protein accession | YP_001739642 |
Protein GI | 170289404 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1874] Beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.015305 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGGAG TCTGTTACTA TCCTGAACAC TGGGGAATAG AGAGAGTGGA AGAAGACTTC AGAAGGATGA AAGAACTCGG GATAGAGTAC GTGAGGATTG GAGAGTTCGC CTGGAGCAGG ATAGAGCCTG AGCGTGGAAA GTTCAACTGG GACTGGCTCG ACAAAACACT TGAGCTGGCA GAGAAGATGG GGCTCAAGAT CGTACTGGGA ACTCCCACAG CTACACCTCC GAAGTGGCTC ATCGATGAAC ACCCGGAGAT CCTTCCCGTT GATAAAGATG GCAGGGTGAA AAATTTTGGT TCCAGAAGAC ATTACTGCTT CTCCAGCCCT GTCTATAGAG AAGAAGTGAA AAGAATCGTC ACCATAATAG CGAAAAGGTA CGGAAAACAC CCGGCAGTCG CAGGCTGGCA GACGGACAAC GAATACGGCT GTCACGATAC GGTTAAGTGC TACTGTCCGA GGTGCAAAAA AGCCTTCCAA AAATGGTTGG AAAGAAAGTA CGAGGGAGAC ATAAAGAAAT TGAATGAAGC GTGGGGAACA GTGTTCTGGA GCCAGGAGTA TCGATCCTTC GACGAAATAG AGCTTCCGAA TCTCACTCCT GCCGATCCAA ACCCGTCTCA TCTTCTCGAC TATTACAGGT TCGCTTCCGA CCAGGTGGTG GAATTCAACA AGCTTCAAGT GGAGATCATA AGGGAGTATT CTCCTGGAAG ATTCATCACA CACAATTTCA TGGGAGGATT TGTAGATTTT GACCATTACA AACTTTCAAG AGACTTGGAT TTTGCATCCT GGGACAACTA CCCACTCGGA CACACACTCG TTTTTCTGAG AATGAAGGGT GAGACGAAAA ATCCGTTCGA TAGAGTAGGA CACCCGGACA TCATCTCCTT CTCACACGAC CTGTACCGAG GAGTGGGCAG GGGAAGATTC TGGGTGATGG AACAGCAAGC AGGACCTGTG AACTGGGCTC CTTACAATCT CTGGCCTGCC AAAGGAGCAG TGAAACTCTG GACGTGGCAG GCATTCGCAC ACGGTGCGGA GGTGGTCTCT TACTTCAGAT GGAGGCAGGT TCCTTTCGCG CAGGAGCAGA TGCACTCCGG ACTTCTGGCA CCGGATTCAG CACCTTCCCC TGGATACCAT GAGGTAAAGC AGGTTTTCGA GGAGCTCAAA AACATCGATA TCAATGAACC CGTGAAAAGC GAAGTGGCAC TTGTCTTCGA TTATGAAACA GCATGGATCT TCTCCATACA ACCGCACGGC GAGGGGGCGA ACTACCTCGA TCTTGTCTTC AGATGCTACA GCGCGCTCAG AAGTCTTGGT CTGAACGTGG ATATAGTACC CCCTGGATCT TCACTGGACG GATACAAAAT GGTCGTTGTT CCAAGTCTTG CCATTGTGAA GGAAGAGGTT CTGAACACGT TTAAAAAATA CGACGGTCTT CTCGTACTTG GCCCAAGAAG CGGAAGCAAA ACAGAGACGT TCCAGATTCC TCCCGAGATG CCCCCAGGTC TTCTCAAAGA GCTCATACCC GTTGAAGTAA GACAAGTTGA AAGCCTTGGG TACAACGCTG AAACTCTTAT TTGGAACGGG AAAGAGTATC CCGTCTTGAT CTGGAGGGAG GACGTGGATC CTACTATCAC GGAAGTGATC GCAAGATTCA AAGATGGTTT TGGAGCCATA TTCCGGAAAG ACAACGTTTT TTACCTTTCT TTCTGGCCTG ACAGAGAATT TCTTGTAGAT TTCTTTGAAG CACTTTCAAA AGAATCAGGA ATTGAAACGA AAAGATTACC AGAAGGAATA CGCATTCAAA GGAGAGGTGA ATATGTTTTT TCTTTCAATT TCACCTCTGA GGAGGTAGAT TTAGAAATAC CAGCAAAGGT TCAGATAGTT CTAGGAGATC AAAAGATTCC TCCCTACGGA CTGTTGATAT GGAGGGAAAA CGAACACTGA
|
Protein sequence | MLGVCYYPEH WGIERVEEDF RRMKELGIEY VRIGEFAWSR IEPERGKFNW DWLDKTLELA EKMGLKIVLG TPTATPPKWL IDEHPEILPV DKDGRVKNFG SRRHYCFSSP VYREEVKRIV TIIAKRYGKH PAVAGWQTDN EYGCHDTVKC YCPRCKKAFQ KWLERKYEGD IKKLNEAWGT VFWSQEYRSF DEIELPNLTP ADPNPSHLLD YYRFASDQVV EFNKLQVEII REYSPGRFIT HNFMGGFVDF DHYKLSRDLD FASWDNYPLG HTLVFLRMKG ETKNPFDRVG HPDIISFSHD LYRGVGRGRF WVMEQQAGPV NWAPYNLWPA KGAVKLWTWQ AFAHGAEVVS YFRWRQVPFA QEQMHSGLLA PDSAPSPGYH EVKQVFEELK NIDINEPVKS EVALVFDYET AWIFSIQPHG EGANYLDLVF RCYSALRSLG LNVDIVPPGS SLDGYKMVVV PSLAIVKEEV LNTFKKYDGL LVLGPRSGSK TETFQIPPEM PPGLLKELIP VEVRQVESLG YNAETLIWNG KEYPVLIWRE DVDPTITEVI ARFKDGFGAI FRKDNVFYLS FWPDREFLVD FFEALSKESG IETKRLPEGI RIQRRGEYVF SFNFTSEEVD LEIPAKVQIV LGDQKIPPYG LLIWRENEH
|
| |