Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_0175 |
Symbol | |
ID | 6091577 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | + |
Start bp | 170307 |
End bp | 171716 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642487356 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001738219 |
Protein GI | 170287981 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.949152 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATCT CCATCATCGG AGCAGGAAGC GTGAGGTTCG CACTTCAGCT TGTGGGAGAT ATCGCTCAGA CCGAAGAGCT CTCAAAAGGA GACACCCATG TCTACCTCAT GGACGTTCAC GAAAAAAGAC TGAATGCATC TTACACTCTC GCGAAAAAGT ACGTGGAAGA GCTGAACTCT CCTGTGAAGA TCGTAAAAAC ATCCAGTCTG GATGAAGCCA TAGATGGAGC AGACTTCATC ATAAACACCG CCTATCCTTA CGATCCGAGG TACCACGACA GCGGCTCTCA GAGATGGGAC GAGGTCACAG ATGTCGGTGA AAGACACGGC TACTACAGAG GCATCGACAG TCAGGAACTG AACATGGTTT CCACCTACAC CTACGTTCTT TCTTCTTATC CCGACATGAA GCTCGCCCTC GAAATCGCAG AGAAGATGAA GAAGATGGCA CCCAGAGCGT ACTTGATGCA GACGGCAAAT CCCGTCTTCG AGATCACACA GGCGGTGAGA AGGTGGACCG GTGCGAACAT AGTGGGGTTC TGCCACGGAG TTGCCGGGGT CTACGAAGTC TTCGAAAGAC TCGGTCTCGA TCCGAAGGAA GTGGACTGGC AGGTGGCAGG TGTGAACCAC GGTATCTGGT TGAACAGGTT CAGATACAGA GGAGAAGATG CCTATCCTCT CCTTGATGAG TGGATAGAGA AAGAGCTACC AAAGTGGGAG CCGAAGAATC CGTGGGACAC ACAGATGTCT CCTGCCGCGA TGGACATGTA CAGATTCTAC GGTATGCTTC CGATAGGTGA CACCGTGAGG AACGGTACCT GGAAATACCA TTACAACCTG GAGACGAAGA AGAAGTGGTT TGGAAAGTTC GGTGGCATAG ACAACGAAGT GGAAAGGCCA AAATTCCACG AACAGCTCAG AAGAGCAAGA GAGCGCCTCA TAAAACTCGC AGAAGAGGTC CATCCAGGCA TGAAACTCAC AGAAGAACAT CCCGAGATCT TCCCGAAAGG GAAACTCAGT GGAGAACAGC ACATTCCTTT CATAAACGCG ATAGCGAACA ACAAACGTGT GAGACTCTTC CTGAACGTAG AGAACCAGGG AACGCTCAAG GACTTTCCCG ATGACCTCGT GATGGAACTT CCCGTCTGGG TGGACTGCTG TGGAATCCAC AGAGAGAAAG TGGAACCCGA TCTCACCCAC CGGATAAAGA TCTTCTATCT GTGGCCCAGG ATTCTGAGAA TGGAGTGGAA CTTAGAAGCA TACATCTCGA GGGACAGAAA GGTGCTCGAA GAGATTCTCA TCAGAGACCC AAGGACAAAA TCCTACGAGC AGATCGTGCA AGTGCTCGAT GAGATCTTCA ACCTGCCGTT CAACGAAGAG TTGAGGAGAT ACTACAAAGA GAAACTCTGA
|
Protein sequence | MKISIIGAGS VRFALQLVGD IAQTEELSKG DTHVYLMDVH EKRLNASYTL AKKYVEELNS PVKIVKTSSL DEAIDGADFI INTAYPYDPR YHDSGSQRWD EVTDVGERHG YYRGIDSQEL NMVSTYTYVL SSYPDMKLAL EIAEKMKKMA PRAYLMQTAN PVFEITQAVR RWTGANIVGF CHGVAGVYEV FERLGLDPKE VDWQVAGVNH GIWLNRFRYR GEDAYPLLDE WIEKELPKWE PKNPWDTQMS PAAMDMYRFY GMLPIGDTVR NGTWKYHYNL ETKKKWFGKF GGIDNEVERP KFHEQLRRAR ERLIKLAEEV HPGMKLTEEH PEIFPKGKLS GEQHIPFINA IANNKRVRLF LNVENQGTLK DFPDDLVMEL PVWVDCCGIH REKVEPDLTH RIKIFYLWPR ILRMEWNLEA YISRDRKVLE EILIRDPRTK SYEQIVQVLD EIFNLPFNEE LRRYYKEKL
|
| |