Gene TRQ2_0969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_0969 
Symbol 
ID6092399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp1006135 
End bp1007475 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content46% 
IMG OID642488165 
Productbeta-galactosidase 
Protein accessionYP_001739002 
Protein GI170288764 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00738034 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGTGA AAAAGTTCCC TGAAGGATTC CTCTGGGGTG TTGCAACAGC TTCCTACCAG 
ATCGAGGGTT CTCCCCTCGC AGACGGAGCT GGTATGTCTA TCTGGCACAC CTTCTCCCAT
ACTCCTGGAA ATGTAAAGAA CGGTGACACG GGAGATGTGG CCTGCGACCA CTACAACAGA
TGGAAAGAGG ACATTGAAAT CATAGAGAAA CTCGGAGTAA AGGCTTACAG ATTTTCAATC
AGCTGGCCAA GAATACTTCC GGAAGGAACA GGAAGGGTGA ATCAGAAAGG ACTGGATTTT
TACAACAGGA TCATAGACAC CCTGCTGGAA AAAGGTATCA CACCCTTTGT GACCATCTAT
CACTGGGATC TTCCCTTCGC TCTTCAGTTG AAAGGAGGAT GGGCGAACAG AGAAATAGCG
GATTGGTTCG CAGAATACTC AAGGGTTCTC TTTGAAAATT TCGGCGACCG TGTGAAGAAC
TGGATCACCT TGAACGAACC GTGGGTTGTT GCCATAGTGG GGCATCTGTA CGGAGTCCAC
GCTCCTGGAA TGAGAGATAT TTACGTGGCT TTCCGAGCTG TTCACAATCT CTTGAGGGCA
CACGCCAAAG CGGTGAAAGT GTTCAGGGAA ACTGTGAAAG ATGGAAAGAT CGGAATAGTT
TTCAACAATG GATATTTCGA ACCTGCGAGT GAAAAAGAGG AGGACATCAG AGCGGCGAGA
TTCATGCATC AGTTCAACAA CTATCCTCTC TTTCTCAATC CGATCTACAG AGGAGATTAT
CCGGAGCTCG TTCTGGAATT TGCCAGAGAG TATCTACCGG AGAATTACAA AGATGACATG
TCCGAGATAC AGGAAAAGAT CGACTTTGTT GGATTGAACT ATTACTCCGG TCATTTGGTG
AAGTTCGATC CAGATGCACC AGCTAAGGTC TCTTTCGTTG AAAGGGATCT TCCAAAAACA
GCCATGGGAT GGGAGATCGT TCCAGAAGGA ATCTACTGGA TCCTGAAGAA GGTGAAAGAA
GAATACAACC CACCAGAGGT TTACATCACA GAGAATGGGG CTGCTTTTGA CGACGTAGTT
AGTGAAGATG GAAGAGTTCA CGATCAAAAC AGAATCGATT ATTTGAAGGC CCACATTGGT
CAGGCATGGA AGGCCATACA GGAGGGAGTG CCGCTTAAAG GTTACTTCGT CTGGTCGCTC
CTCGACAATT TCGAATGGGC AGAGGGATAC TCTAAGAGAT TTGGTATTGT GTACGTGGAC
TACAGTACTC AAAAACGCAT CATAAAAGAC AGTGGGTACT GGTACTCGAA TGTGGTTAAA
AACAACGGTC TGGAAGACTG A
 
Protein sequence
MNVKKFPEGF LWGVATASYQ IEGSPLADGA GMSIWHTFSH TPGNVKNGDT GDVACDHYNR 
WKEDIEIIEK LGVKAYRFSI SWPRILPEGT GRVNQKGLDF YNRIIDTLLE KGITPFVTIY
HWDLPFALQL KGGWANREIA DWFAEYSRVL FENFGDRVKN WITLNEPWVV AIVGHLYGVH
APGMRDIYVA FRAVHNLLRA HAKAVKVFRE TVKDGKIGIV FNNGYFEPAS EKEEDIRAAR
FMHQFNNYPL FLNPIYRGDY PELVLEFARE YLPENYKDDM SEIQEKIDFV GLNYYSGHLV
KFDPDAPAKV SFVERDLPKT AMGWEIVPEG IYWILKKVKE EYNPPEVYIT ENGAAFDDVV
SEDGRVHDQN RIDYLKAHIG QAWKAIQEGV PLKGYFVWSL LDNFEWAEGY SKRFGIVYVD
YSTQKRIIKD SGYWYSNVVK NNGLED