Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_1743 |
Symbol | |
ID | 6093194 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | - |
Start bp | 1759112 |
End bp | 1762072 |
Gene Length | 2961 bp |
Protein Length | 986 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 642488942 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001739759 |
Protein GI | 170289521 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCTGG AAGAACTGAA AAATCCGGGT GTGTGGTACA GACCCGCTCC CTTCTGGAGC TGGAACGATA AGTTGTGCGA GGAGGAACTT CTAAGGCAGA TAGACGAAAT GTATGAGAAG GGCTACGGAG GCTTCTTCAT GCACTCGAGA GTGGGGCTTG TAACGGAATA CCTCTCTGAG GAGTGGATGA GACTCGTAAG AAGCTGTGCA GAACACGCCA GAAAACTCGG GATGCTCGCC TGGCTCTACG ATGAGGACAA ATGGCCTTCC GGATTCGCTG GAGGAATCGT GCCACTCGAA AAACCGGAAC ACAGGCACAA ATACCTCACA CTCTTGAAAA AGGATCAGAT CAAACCAGAA GATGAGATCC TGAAAAAGAT AGAAAGAGAC GGCGAAGAAT TCTACGTGGT AAAACGCGTT ATGAAGCTTG GTGATCCGTG GTTCAACGGA ACCTGCTACG TCGATCTGCT CTCAAGAGAG ACCACAGAGG CGTTTCTCAG ATCGACACAC GAGAGATACA AAAAATCGTG CGGTGATCTT TTCAGAGTCT CCATCCCCGG GATCTTCACC GATGAACCCA CCTATCTGAG AGTGCATCAC CCTAAAGAAC CCACACTCCC CTGGACAGAA CGATTTCCAG AGGAGTTTTT GAGGCGAAAG GGATACGACA TCAGAGACCA CTTAGAAGAA CTCTTCTTCA ACGTGAAGGA CTACATGAAG GTGAGATACG ATTTCTTCGA TGTTGCGACG AGTCTGTTCA TCGAAAACTT CACGATCCCG TACGCGAAGT GGTGTGAAGA AAACGGCATC TTCATGACGG GACACTACAT GGCAGAAGAT ACACTCAGAG GACAGGTGGA GTGGATAGGT GCCGCGATGC CACACTACGA ATACATGCAG ATACCGGGAA TAGACAAACT CGCCAGACAC CTGGAACAGG TGGTCACGAT AAAGCAGGTT TCATCGGCGG CAGAGCAGTT AGGGAAGAAA TGGGTGCTCT GCGAAACGTT CGGTACAACG GGTCAGCACG TGAGTTTTCT ACACAGAAAA TGGATAGCGG ACTGGCAGGC GGTTCTCGGT GTCACATACA TAAACCCGCA TCTCAGTCTC TACTCGATGA GAGGAGAGAG AAAGAGGGAC TACCCACCGA ACCTCTTCTA TCAACAACCC TGGTGGAAGA ATGAAAGATT CCTGTCGGAT TACTTTGCAA GACTGAACCA CATCGTCACA CAGGGAAAGA GAGAGGTCAA AGTGCTCATG ATACATCCCA TCTCCTCGGC GTGGTGTGTG TATTCTAAGT TCGATGACGA GATCGATAAG CTCAACGAGC TCTTCGATAC GATCACAAAG GAACTCGTTG CGAACAAGAT AGACTTCCAC TTCGGCGACG AGATGATCCT CTCAAAACAT GGAAGAGTGA AAGAGGCAAA ACTGAGAGTA GGAGAGTACG AATACGAAGT CGTGGTGCTG CCACCGCTCC TGAATCTGAA AAGTTCTACT GTGGAACTTT TGAACTCACT GGCAGAAAAC GGTGGTAAGG TCTTTGTTTT GAAGGATTTC AGGTACGGCA GGTTCTTCCC GGAAAGAGTG GAAGGAAAGA AGGGAAGAAT CGAATTCCTC AAAAAGGCTC GTGTTTTTGA AACGCTCGAA GATCTCATCG AAGAACTCAA ACCCTTCTCT TCTGTGGATG TTCTGGATAC GAAAACGAAA GAGAATGCAA AAGCGGTGAT CGCGCAGAAG AGAGTGCTGG AAGACGGATC TTATCTTCTC TTTCTTGCGA ACACGGACAT CGACCGGGAA GTGCACTGCC ATCTGGAACT GAAGGAAAAG AGAAAACACA CGTACGCGAT AGACCTGTTC AATTTCAAAT TGGTGGAGCT GAAGGAAAAC GAGTTTGTCA TGTTCCCAGC GTCGAGTGTT TGTATCTGGG TAACGGACGA GGAGGTTCCT GCAGAGGATG AGAAAGTGGT TTCAACAGGA GTTCTTCTGG AGAAAGAATT CGATTTCGAG ACGGCACTGA ACGACTTTGA AGTGAAGATG AACTCTTTCA ACGTTCTTCC AGTGGACAGG GTGGAGTATT TTGAAGCGGG TGGAAGGGTT TTCAGGAACG AGTTCGTCTC GAAGATCTGG TACGAGTTCT ACAGATTACC GGATGGTACA CCGTTCAGGG TGGAGTACTC TTTCGAGGTC AGGAAAAAGC CTCAAAAACT CTTTCTCGTC GTTGAGTGCG CGGAAAATCT TGACAGAATC ACGGTGAACG GTCGAGAGGT GAGGTACGAA AGAAAAAGCT GCATTTTCAA CGAGGAACAG AATTTCCTGG ATGTGAACTT TGGAAAGATG GAGATCACAG ATCTCGTCAG AGAAGGAAAG AACACGGTTG TTCTGGAGGG AAGAAAAGAA AACAACATCA CAGGCCCGGG GTGTCACACG AGGGTGAAAG ATCCAGAGAA TCACAGGCCA ACAGAGGTGG AGACGATATA TCTTGTGGGA GATTTTTCAC TTGTGAACGT GGACGAGACG AGGTACGTGA TCGATGCACC CAAGATACCG GATCACAGGG ACATCACACG GGATGGGTAC CCGTTCTATG TGGGATCCTT CACGCTCAAA AAGATATTTG AGTGTAAAAA AGACCCGGGG AAAAGATACT TCCTCAAGCT GAACGGTGTG GAGGCGGCCT CGGTCGAGGT GATCCTGAAC GGGAAGTTCC TGGGCGTTCT TTTCTGGAGA CCTTTCATGA TAGATATTAC AGATGCTCTG AGAAACGGAA AAAACGAACT CCAACTCGTC CTCACAAACA CACTGTTCAA CCTCATAGAG GCGAACCACA AGGCAGACGT TCTCGAGGAG ACCTTCAGAA GACCGAAGAG CTTCATAGAT TTCGAGCACC ACACGGACAG ATACATCCTG CTGCCCTTCG GCCTGGAAAA CGTCGCCGTT CTCAGCTCTT CTTCACGATG A
|
Protein sequence | MNLEELKNPG VWYRPAPFWS WNDKLCEEEL LRQIDEMYEK GYGGFFMHSR VGLVTEYLSE EWMRLVRSCA EHARKLGMLA WLYDEDKWPS GFAGGIVPLE KPEHRHKYLT LLKKDQIKPE DEILKKIERD GEEFYVVKRV MKLGDPWFNG TCYVDLLSRE TTEAFLRSTH ERYKKSCGDL FRVSIPGIFT DEPTYLRVHH PKEPTLPWTE RFPEEFLRRK GYDIRDHLEE LFFNVKDYMK VRYDFFDVAT SLFIENFTIP YAKWCEENGI FMTGHYMAED TLRGQVEWIG AAMPHYEYMQ IPGIDKLARH LEQVVTIKQV SSAAEQLGKK WVLCETFGTT GQHVSFLHRK WIADWQAVLG VTYINPHLSL YSMRGERKRD YPPNLFYQQP WWKNERFLSD YFARLNHIVT QGKREVKVLM IHPISSAWCV YSKFDDEIDK LNELFDTITK ELVANKIDFH FGDEMILSKH GRVKEAKLRV GEYEYEVVVL PPLLNLKSST VELLNSLAEN GGKVFVLKDF RYGRFFPERV EGKKGRIEFL KKARVFETLE DLIEELKPFS SVDVLDTKTK ENAKAVIAQK RVLEDGSYLL FLANTDIDRE VHCHLELKEK RKHTYAIDLF NFKLVELKEN EFVMFPASSV CIWVTDEEVP AEDEKVVSTG VLLEKEFDFE TALNDFEVKM NSFNVLPVDR VEYFEAGGRV FRNEFVSKIW YEFYRLPDGT PFRVEYSFEV RKKPQKLFLV VECAENLDRI TVNGREVRYE RKSCIFNEEQ NFLDVNFGKM EITDLVREGK NTVVLEGRKE NNITGPGCHT RVKDPENHRP TEVETIYLVG DFSLVNVDET RYVIDAPKIP DHRDITRDGY PFYVGSFTLK KIFECKKDPG KRYFLKLNGV EAASVEVILN GKFLGVLFWR PFMIDITDAL RNGKNELQLV LTNTLFNLIE ANHKADVLEE TFRRPKSFID FEHHTDRYIL LPFGLENVAV LSSSSR
|
| |