Gene TRQ2_1617 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_1617 
Symbol 
ID6093066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp1632491 
End bp1634311 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content46% 
IMG OID642488818 
Productarabinogalactan endo-1,4-beta-galactosidase 
Protein accessionYP_001739636 
Protein GI170289398 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3867] Arabinogalactan endo-1,4-beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0588913 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGGGG TGCTTTTCGT GCTGATGATT TCTTCCATGG CCTTTGGTTT GATAGTCAAC 
CCGGTGAAAA ACCTGTGCGA GGATTTCATC TTTGGAATGG ATGTTTCTAT GCTCTACGAG
ATCGAGCAAC TGGGTGGGAA ATATTTCGAG AATGGTGTGG AAAAAGATTG TCTTGAAATA
CTGAAGAATC ATGGAATAAA CTGGATCAGG TTGAGGGTGT GGAATGATCC GAGAGACGAG
AATGGAAATC CTCTCGGAGG AGGAAACTGC GATTACCTGA AGATGACAGA AATCGCTAAA
AGGGCAAAGA AACTCGGAAT GAAAGTGCTT CTTGACTTCC ATTACAGCGA CTGGTGGGCG
GATCCTGGAA AGCAGAACAA ACCAAAAGAG TGGGAATATC TTCATGGAGA ACTTCTGGAG
AGGGCGGTTT ACTCCTACAC GAAACTTGTA CTGAACCACA TGCGAAGAAA CGGGGCACTA
CCAGATATGG TCCAGGTGGG GAATGAGGTG AACAACGGTT TTCTCTGGCC TGACGGCAAG
ATTTCTGGAG AAGGTGCAGG TGGTTTCGAC GGATTCACAA GACTTTTGAA AGCTGCCATC
AAGGCCGTTA GAGAGGTTGA TCCGGATATA AAGATCGTTA TTCATCTGGC GGAAGGTGGA
AACAACTCTC TCTTCAGATG GTTCTTCGAC GAGATCACAA GAAGAAACGT GGACTTCGAT
GTAATAGGTG TATCTTACTA CCCGTACTGG CACGGAACCC TCGAGGATCT GAAAAACAAC
CTCTACGACA TAGCCACAAG ATATAACAAG GATGTGCTCG TTGTTGAAAC AGCTTACGCC
TGGACACTCG AGGATGGAGA TGGTTATCCC AACATCTTCA ATGGTGAAGA AATGGAACTA
ACAGGTGGCT ACAAGGCAAC CGTTCAGGGA CAGGCAACAT TTCTGAGAGA TCTCATGGAA
GTGGTAAACA GCGTTCCCAA CGGCCATGGA CTCGGGATTT TCTACTGGGA AGGAGATTGG
ATCCCTGTGA GGGGGGCTGG ATGGAAAACC GGAGAAGGAA ACCCCTGGGA CAACCAAGCT
ATGTTCGATT TCAGTGGGAA CGCTCTCCCA TCACTGAATG TTTTCAAACT GGTGAAAACA
TCATCGCCAG TGGAGATTGC GATAAAAGAG ATCCTTCCTG TGGAGGTTAC AACCAACCTG
GGAGAGGTTC CAAAATTTCC AGATGCTGTG AAAGTTCTGT TCAGCGACGA TTCCATCAGA
TCTTTACCTG TCGAATGGAA CTTTGATTCT GCCCTTGTTG AAGAATCCGG TGTTTACAAA
GTGGAAGGCT ACATTAAAGA CATTGACCGG AAAATTTTCG CGACACTCAC CGTGAAGGGT
AGCAGAAACT ATCTGAAAAA TCCGGGCTTC GAAACAGGAG AATTTTCGCC TTGGCAGGTC
TCGGGAGACA AAAAAGCGGT GAAAGTTGTA AAAGTCAATC CTTCAAGCAA TGCGCACCAG
GGAGAGTACG CAGTGAATTT CTGGCTCGAT GAATCCTTCA GTTTCGAACT GTCACAAGAA
GTGGAACTTC CAGCAGGTGT GTACAGAGTA GGGTTCTGGA CCCATGGAGA AAAAGGTGTG
AAGATTGCTC TGAAGGTAAG TGATTACGGA GGAGATGAAC GATCTGTAGA AGTTGAAACA
ACGGGCTGGC TCGAATGGAA GAACCCGGAG ATAAGGAACA TAAAAGTTGA AACAGGAAGA
ATAAAGATTA CCGTTTCTGT CGAGGGAAGG GCAGGTGACT GGGGGTTCAT TGATGATTTC
TATCTTTTCA GAGAAGAGTA A
 
Protein sequence
MRGVLFVLMI SSMAFGLIVN PVKNLCEDFI FGMDVSMLYE IEQLGGKYFE NGVEKDCLEI 
LKNHGINWIR LRVWNDPRDE NGNPLGGGNC DYLKMTEIAK RAKKLGMKVL LDFHYSDWWA
DPGKQNKPKE WEYLHGELLE RAVYSYTKLV LNHMRRNGAL PDMVQVGNEV NNGFLWPDGK
ISGEGAGGFD GFTRLLKAAI KAVREVDPDI KIVIHLAEGG NNSLFRWFFD EITRRNVDFD
VIGVSYYPYW HGTLEDLKNN LYDIATRYNK DVLVVETAYA WTLEDGDGYP NIFNGEEMEL
TGGYKATVQG QATFLRDLME VVNSVPNGHG LGIFYWEGDW IPVRGAGWKT GEGNPWDNQA
MFDFSGNALP SLNVFKLVKT SSPVEIAIKE ILPVEVTTNL GEVPKFPDAV KVLFSDDSIR
SLPVEWNFDS ALVEESGVYK VEGYIKDIDR KIFATLTVKG SRNYLKNPGF ETGEFSPWQV
SGDKKAVKVV KVNPSSNAHQ GEYAVNFWLD ESFSFELSQE VELPAGVYRV GFWTHGEKGV
KIALKVSDYG GDERSVEVET TGWLEWKNPE IRNIKVETGR IKITVSVEGR AGDWGFIDDF
YLFREE