Gene TRQ2_0664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_0664 
Symbol 
ID6092081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp678826 
End bp680256 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content48% 
IMG OID642487850 
Productglycoside hydrolase family protein 
Protein accessionYP_001738700 
Protein GI170288462 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGGCAT TTTCACTGCT GGCCATTTTG CTTTTCAGCT TACTGAGCGG ATGCCTTACT 
TTGGGAAGCG AAGACCATGT ACCAAGTTTC AGATGGGCCA CAGTCCACGA TCCATCCGTT
ACGAAGGTTG GCGATACTTT CTATGTCTTT GGCTCACACC TTCAAATTGC CAAATCGAGC
GATCTGATGC ACTGGACACA AGTGAATGTA GGGGTTTATA ACAACAACCC TATAATCCCA
AATATATTCA CCGAGTTGAA AGAAACTTTC GAATGGGCTG AAACAAACAC TCTCTGGGCA
CCCCATGTGA TTCAGCTTCA AGATGGTAGG TACTACTTCT ACTACTGCGC GTGCAGAGGT
GACTCACCCC GATCTGCTAT GGGAATAGCA GTCTCTGATA GCATCGAAGG CCCTTACAGA
AACCTCGGGA TAATTCTGAG GTCTGGATAT CGCCCCGGAG AAGGAATGTG TGAAGAAGGA
GTACCATACG ATGCGAGGAT CCATCCAAAT GTTGTGGATC CGCATGTTTT TTACGACAAA
GAAGGTAACC TGTGGATGGT TTACGGGTCC TATTCCGGTG GCATCTACAT TCTAAAGCTC
GATCCAGAAA CCGGTTTTCC TCTTCCGGAA CAGGGATACG GAAAGAAACT CACAGGAGGA
AATCACAGCA GGATTGAGGG TCCTTTCATT TTCTACAGTC CTGATACAGG TTATTACTAT
CTCTTTCTGA GCTTTGGAGG GCTCGACTAC AGAGGAGGAT ACAACATTAG GGTCGCAAGG
TCCAAAAACC CTGACGGTCC TTACTATGAC GCGGAAGGCC ACAACATGAC AGATTGTTAC
GGCCCATCGT TCCTGGAAGG AAACGATCCC TACATAGCAC CGTTCGGTGT GAAACTGGTG
GGTAACTTCA CCTTGAGCGA AGGAAGTATC ATAGATTTTC GCGCGTTCGG ATACGTATCT
CCGGGGCACA ACTCCGCCTA TTACGATCCG GAGACGGGGA AGTATTTCAT CTTCTTCCAC
ACGAGGTTCC CCGGCAGGGG AGAGACGTAC CAGATCAGGG TCCACCAGCT CTTCCTCAAC
GAAGACGGTT GGTTCGTCAT GGCTCCCTTC CCCTACGCGG GTGAAACTAT TGAAGAACTG
CCTCTGCAGG AGGTGGTTGG GGAATATCAG CTGGTAATAC ACGACAAAGA GATGACGAAC
GAGATAAGGA AACCCGTGAG AATCGCTCTG AATCCGGACG GAACTGTTAC TGGAGCTCAA
ACTGGTGAAT GGGAAAAGAA GGGACATTAT ATAACTCTGA AACTCGATGG AGAGATCTAC
AAAGGGGTGG CTTTGAAGCA GTGGCACTAC TCCGAGAAAA AATGGGTGAC AGTGTTCTCT
GCTCTATCAC AGAAGGGAGT ATCAGTGTGG GGTATAAAAA CTTCTGAGTA G
 
Protein sequence
MRAFSLLAIL LFSLLSGCLT LGSEDHVPSF RWATVHDPSV TKVGDTFYVF GSHLQIAKSS 
DLMHWTQVNV GVYNNNPIIP NIFTELKETF EWAETNTLWA PHVIQLQDGR YYFYYCACRG
DSPRSAMGIA VSDSIEGPYR NLGIILRSGY RPGEGMCEEG VPYDARIHPN VVDPHVFYDK
EGNLWMVYGS YSGGIYILKL DPETGFPLPE QGYGKKLTGG NHSRIEGPFI FYSPDTGYYY
LFLSFGGLDY RGGYNIRVAR SKNPDGPYYD AEGHNMTDCY GPSFLEGNDP YIAPFGVKLV
GNFTLSEGSI IDFRAFGYVS PGHNSAYYDP ETGKYFIFFH TRFPGRGETY QIRVHQLFLN
EDGWFVMAPF PYAGETIEEL PLQEVVGEYQ LVIHDKEMTN EIRKPVRIAL NPDGTVTGAQ
TGEWEKKGHY ITLKLDGEIY KGVALKQWHY SEKKWVTVFS ALSQKGVSVW GIKTSE