Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_0969 |
Symbol | |
ID | 6092399 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | + |
Start bp | 1006135 |
End bp | 1007475 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 642488165 |
Product | beta-galactosidase |
Protein accession | YP_001739002 |
Protein GI | 170288764 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | [TIGR03356] beta-galactosidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00738034 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGTGA AAAAGTTCCC TGAAGGATTC CTCTGGGGTG TTGCAACAGC TTCCTACCAG ATCGAGGGTT CTCCCCTCGC AGACGGAGCT GGTATGTCTA TCTGGCACAC CTTCTCCCAT ACTCCTGGAA ATGTAAAGAA CGGTGACACG GGAGATGTGG CCTGCGACCA CTACAACAGA TGGAAAGAGG ACATTGAAAT CATAGAGAAA CTCGGAGTAA AGGCTTACAG ATTTTCAATC AGCTGGCCAA GAATACTTCC GGAAGGAACA GGAAGGGTGA ATCAGAAAGG ACTGGATTTT TACAACAGGA TCATAGACAC CCTGCTGGAA AAAGGTATCA CACCCTTTGT GACCATCTAT CACTGGGATC TTCCCTTCGC TCTTCAGTTG AAAGGAGGAT GGGCGAACAG AGAAATAGCG GATTGGTTCG CAGAATACTC AAGGGTTCTC TTTGAAAATT TCGGCGACCG TGTGAAGAAC TGGATCACCT TGAACGAACC GTGGGTTGTT GCCATAGTGG GGCATCTGTA CGGAGTCCAC GCTCCTGGAA TGAGAGATAT TTACGTGGCT TTCCGAGCTG TTCACAATCT CTTGAGGGCA CACGCCAAAG CGGTGAAAGT GTTCAGGGAA ACTGTGAAAG ATGGAAAGAT CGGAATAGTT TTCAACAATG GATATTTCGA ACCTGCGAGT GAAAAAGAGG AGGACATCAG AGCGGCGAGA TTCATGCATC AGTTCAACAA CTATCCTCTC TTTCTCAATC CGATCTACAG AGGAGATTAT CCGGAGCTCG TTCTGGAATT TGCCAGAGAG TATCTACCGG AGAATTACAA AGATGACATG TCCGAGATAC AGGAAAAGAT CGACTTTGTT GGATTGAACT ATTACTCCGG TCATTTGGTG AAGTTCGATC CAGATGCACC AGCTAAGGTC TCTTTCGTTG AAAGGGATCT TCCAAAAACA GCCATGGGAT GGGAGATCGT TCCAGAAGGA ATCTACTGGA TCCTGAAGAA GGTGAAAGAA GAATACAACC CACCAGAGGT TTACATCACA GAGAATGGGG CTGCTTTTGA CGACGTAGTT AGTGAAGATG GAAGAGTTCA CGATCAAAAC AGAATCGATT ATTTGAAGGC CCACATTGGT CAGGCATGGA AGGCCATACA GGAGGGAGTG CCGCTTAAAG GTTACTTCGT CTGGTCGCTC CTCGACAATT TCGAATGGGC AGAGGGATAC TCTAAGAGAT TTGGTATTGT GTACGTGGAC TACAGTACTC AAAAACGCAT CATAAAAGAC AGTGGGTACT GGTACTCGAA TGTGGTTAAA AACAACGGTC TGGAAGACTG A
|
Protein sequence | MNVKKFPEGF LWGVATASYQ IEGSPLADGA GMSIWHTFSH TPGNVKNGDT GDVACDHYNR WKEDIEIIEK LGVKAYRFSI SWPRILPEGT GRVNQKGLDF YNRIIDTLLE KGITPFVTIY HWDLPFALQL KGGWANREIA DWFAEYSRVL FENFGDRVKN WITLNEPWVV AIVGHLYGVH APGMRDIYVA FRAVHNLLRA HAKAVKVFRE TVKDGKIGIV FNNGYFEPAS EKEEDIRAAR FMHQFNNYPL FLNPIYRGDY PELVLEFARE YLPENYKDDM SEIQEKIDFV GLNYYSGHLV KFDPDAPAKV SFVERDLPKT AMGWEIVPEG IYWILKKVKE EYNPPEVYIT ENGAAFDDVV SEDGRVHDQN RIDYLKAHIG QAWKAIQEGV PLKGYFVWSL LDNFEWAEGY SKRFGIVYVD YSTQKRIIKD SGYWYSNVVK NNGLED
|
| |