Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_1617 |
Symbol | |
ID | 6093066 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | + |
Start bp | 1632491 |
End bp | 1634311 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 642488818 |
Product | arabinogalactan endo-1,4-beta-galactosidase |
Protein accession | YP_001739636 |
Protein GI | 170289398 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3867] Arabinogalactan endo-1,4-beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0588913 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGGGG TGCTTTTCGT GCTGATGATT TCTTCCATGG CCTTTGGTTT GATAGTCAAC CCGGTGAAAA ACCTGTGCGA GGATTTCATC TTTGGAATGG ATGTTTCTAT GCTCTACGAG ATCGAGCAAC TGGGTGGGAA ATATTTCGAG AATGGTGTGG AAAAAGATTG TCTTGAAATA CTGAAGAATC ATGGAATAAA CTGGATCAGG TTGAGGGTGT GGAATGATCC GAGAGACGAG AATGGAAATC CTCTCGGAGG AGGAAACTGC GATTACCTGA AGATGACAGA AATCGCTAAA AGGGCAAAGA AACTCGGAAT GAAAGTGCTT CTTGACTTCC ATTACAGCGA CTGGTGGGCG GATCCTGGAA AGCAGAACAA ACCAAAAGAG TGGGAATATC TTCATGGAGA ACTTCTGGAG AGGGCGGTTT ACTCCTACAC GAAACTTGTA CTGAACCACA TGCGAAGAAA CGGGGCACTA CCAGATATGG TCCAGGTGGG GAATGAGGTG AACAACGGTT TTCTCTGGCC TGACGGCAAG ATTTCTGGAG AAGGTGCAGG TGGTTTCGAC GGATTCACAA GACTTTTGAA AGCTGCCATC AAGGCCGTTA GAGAGGTTGA TCCGGATATA AAGATCGTTA TTCATCTGGC GGAAGGTGGA AACAACTCTC TCTTCAGATG GTTCTTCGAC GAGATCACAA GAAGAAACGT GGACTTCGAT GTAATAGGTG TATCTTACTA CCCGTACTGG CACGGAACCC TCGAGGATCT GAAAAACAAC CTCTACGACA TAGCCACAAG ATATAACAAG GATGTGCTCG TTGTTGAAAC AGCTTACGCC TGGACACTCG AGGATGGAGA TGGTTATCCC AACATCTTCA ATGGTGAAGA AATGGAACTA ACAGGTGGCT ACAAGGCAAC CGTTCAGGGA CAGGCAACAT TTCTGAGAGA TCTCATGGAA GTGGTAAACA GCGTTCCCAA CGGCCATGGA CTCGGGATTT TCTACTGGGA AGGAGATTGG ATCCCTGTGA GGGGGGCTGG ATGGAAAACC GGAGAAGGAA ACCCCTGGGA CAACCAAGCT ATGTTCGATT TCAGTGGGAA CGCTCTCCCA TCACTGAATG TTTTCAAACT GGTGAAAACA TCATCGCCAG TGGAGATTGC GATAAAAGAG ATCCTTCCTG TGGAGGTTAC AACCAACCTG GGAGAGGTTC CAAAATTTCC AGATGCTGTG AAAGTTCTGT TCAGCGACGA TTCCATCAGA TCTTTACCTG TCGAATGGAA CTTTGATTCT GCCCTTGTTG AAGAATCCGG TGTTTACAAA GTGGAAGGCT ACATTAAAGA CATTGACCGG AAAATTTTCG CGACACTCAC CGTGAAGGGT AGCAGAAACT ATCTGAAAAA TCCGGGCTTC GAAACAGGAG AATTTTCGCC TTGGCAGGTC TCGGGAGACA AAAAAGCGGT GAAAGTTGTA AAAGTCAATC CTTCAAGCAA TGCGCACCAG GGAGAGTACG CAGTGAATTT CTGGCTCGAT GAATCCTTCA GTTTCGAACT GTCACAAGAA GTGGAACTTC CAGCAGGTGT GTACAGAGTA GGGTTCTGGA CCCATGGAGA AAAAGGTGTG AAGATTGCTC TGAAGGTAAG TGATTACGGA GGAGATGAAC GATCTGTAGA AGTTGAAACA ACGGGCTGGC TCGAATGGAA GAACCCGGAG ATAAGGAACA TAAAAGTTGA AACAGGAAGA ATAAAGATTA CCGTTTCTGT CGAGGGAAGG GCAGGTGACT GGGGGTTCAT TGATGATTTC TATCTTTTCA GAGAAGAGTA A
|
Protein sequence | MRGVLFVLMI SSMAFGLIVN PVKNLCEDFI FGMDVSMLYE IEQLGGKYFE NGVEKDCLEI LKNHGINWIR LRVWNDPRDE NGNPLGGGNC DYLKMTEIAK RAKKLGMKVL LDFHYSDWWA DPGKQNKPKE WEYLHGELLE RAVYSYTKLV LNHMRRNGAL PDMVQVGNEV NNGFLWPDGK ISGEGAGGFD GFTRLLKAAI KAVREVDPDI KIVIHLAEGG NNSLFRWFFD EITRRNVDFD VIGVSYYPYW HGTLEDLKNN LYDIATRYNK DVLVVETAYA WTLEDGDGYP NIFNGEEMEL TGGYKATVQG QATFLRDLME VVNSVPNGHG LGIFYWEGDW IPVRGAGWKT GEGNPWDNQA MFDFSGNALP SLNVFKLVKT SSPVEIAIKE ILPVEVTTNL GEVPKFPDAV KVLFSDDSIR SLPVEWNFDS ALVEESGVYK VEGYIKDIDR KIFATLTVKG SRNYLKNPGF ETGEFSPWQV SGDKKAVKVV KVNPSSNAHQ GEYAVNFWLD ESFSFELSQE VELPAGVYRV GFWTHGEKGV KIALKVSDYG GDERSVEVET TGWLEWKNPE IRNIKVETGR IKITVSVEGR AGDWGFIDDF YLFREE
|
| |