Gene TRQ2_0159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_0159 
Symbol 
ID6091561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp152318 
End bp154231 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content48% 
IMG OID642487340 
Producthypothetical protein 
Protein accessionYP_001738203 
Protein GI170287965 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00185609 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTTTTGA GAGAGATAAA CCGATACTGC AAAGAAAAAG CCACCGGAAA GAGAATCTAC 
GCAGTTCCAA AGCTGTGGAT ACCGAGTTTC TTCAAAAAGT TCGACGAAAA ATCCGGCAGG
TGCTTCGTCG ATCCTTACGA ACTCGGAGCC GAGATCACCG ACTGGATTTT GAATCAGTCC
AGAGAGCAGG ATTATTCCCA GCCTATTTCA TTTCTTAAGG GCGAGAAAAC ACCGGACTGG
ATTAAGCGTT CCGTTGTTTA TGGATCCCTC CCCAGGACCA CCACGGCGTA CAATCACAAA
GGCTCTGGAT ACTACGAAGA GAACGACGTT CTTGGTTTCA GAGAAGCGGG AACGTTCTTC
AAGATGATGC TGCTTCTTCC GTTCATCAAA AGTCTCGGTG CGGACGCTAT CTATTTACTT
CCCGTGAGTA GAATGAGCGA TCTCTTCAAG AAGGGAGACG CTCCATCACC GTACTCCGTG
AAGAATCCAA TGGAGCTCGA TGAGAGGTAC CACGATCCGC TTCTCGAACC TTTCAAGGTG
GATGAAGAGT TCAAGGCCTT TGTGGAAGCG TGTCACATCC TCGGAATCAG AGTGATTCTC
GATTTCATTC CAAGAACGGC TTCCAGAGAC TCTGATCTCA TAAGAGAACA TCCGGACTGG
TTCTACTGGA TAAAGGTGGA GGAACTTGCA GATTACACTC CTCCAAGGGC CGAGGAACTT
CCGTTCAAGG TGCCGGATGA GGATGAACTC GAGATCATAT ACAGCAAAGA AAATGTGAAA
AGACACCTCA AAAAGTTCAC ACTTCCTCCG AATCTGATCG ACCCTCAAAA GTGGGAGAAA
ATAAAAAGAG AAGAGGGGAA CATTCTGGAG TTGATTGTGA AAGAATTTGG AATCATCACT
CCTCCAGGAT TTTCCGATTT GATCAACGAT CCACAACCTA CATGGGATGA TGTCACGTTT
TTGAGGTTGT ACTTGGATCA CCCGGAGGCT TCGAAAAGAT TTCTCGAGCC GAACCAGCCT
CCCTACGTTC TCTACGACGT AATAAAGGCG AGCAAATTTC CTGGAAAAGA GCCGAACAGA
GAGCTCTGGG AGTACCTCGC GGGCGTGATA CCACATTACC AGAAAAAATA CGGAATAGAC
GGTGCAAGAC TCGATATGGG GCACGCACTT CCCAAAGAAC TTCTTGACCT CATAATAAAG
AACGTGAAGG AGTACGATCC CGCATTTGCG ATGATCGCAG AGGAGCTGGA CATGGGGAAG
GACAAAGTAT CGAAGGAAGC GGGATATGAC GTGATCCTGG GAAGTAGCTG GTACTTTGCG
GGAAGAGTGG AGGAAATAGG AAAACTCCCT GAAATCGCCG AAAAGCTCGT TCTTCCTTTC
CTCGCCTCCG TTGAGACTCC CGACACACCG CGCATTGCCA CAAGAAAGTA CGCTTCCAAG
ATGAAAAAAC TGGCACCGTT TGTAACCTAC TTTCTACCGA ACTCTATTCC CTATGTGAAC
ACGGGACAGG AGATTGGAGA GAAACAGCCC ATGAACCTGG GGCTGGACAC GGATCCAAAC
CTGAGAAAAG TCCTCTCCCC AACCGACGAG TTTTTCGGGA AACTCGCATT TTTCGACCAC
TACGTTCTCC ACTGGGACAG CCCGGACAGA GGAATCTTGA GCTTCATCAA AAAACTGATA
AAGGTGCGCC AGCAGTTCCT CGATTTTGTC CTCAACGGAA AGTTTGAAAA CCTCACAACG
GAAGATCTCG TCATGTACTC TTACGAGAGA AACGGACAAA AGATCATCGT CGCCGCAAAT
GTTGGAAAAG AGCCAAAAGA GATCACCGGC GGAAGGGTTT GGAACGGAAA GTGGAGTGAT
GAAGAGAAGG TAGTCCTCAA ACCCCTTGAT TTTGTTCTTG TTGTACAGGA GTGA
 
Protein sequence
MLLREINRYC KEKATGKRIY AVPKLWIPSF FKKFDEKSGR CFVDPYELGA EITDWILNQS 
REQDYSQPIS FLKGEKTPDW IKRSVVYGSL PRTTTAYNHK GSGYYEENDV LGFREAGTFF
KMMLLLPFIK SLGADAIYLL PVSRMSDLFK KGDAPSPYSV KNPMELDERY HDPLLEPFKV
DEEFKAFVEA CHILGIRVIL DFIPRTASRD SDLIREHPDW FYWIKVEELA DYTPPRAEEL
PFKVPDEDEL EIIYSKENVK RHLKKFTLPP NLIDPQKWEK IKREEGNILE LIVKEFGIIT
PPGFSDLIND PQPTWDDVTF LRLYLDHPEA SKRFLEPNQP PYVLYDVIKA SKFPGKEPNR
ELWEYLAGVI PHYQKKYGID GARLDMGHAL PKELLDLIIK NVKEYDPAFA MIAEELDMGK
DKVSKEAGYD VILGSSWYFA GRVEEIGKLP EIAEKLVLPF LASVETPDTP RIATRKYASK
MKKLAPFVTY FLPNSIPYVN TGQEIGEKQP MNLGLDTDPN LRKVLSPTDE FFGKLAFFDH
YVLHWDSPDR GILSFIKKLI KVRQQFLDFV LNGKFENLTT EDLVMYSYER NGQKIIVAAN
VGKEPKEITG GRVWNGKWSD EEKVVLKPLD FVLVVQE