Gene TRQ2_0386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_0386 
Symbol 
ID6091791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp374006 
End bp375670 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content52% 
IMG OID642487564 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001738425 
Protein GI170288187 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAGTG ATGTGATAAA AAAGGGCCTC GAAAGGGCTC CTCATAGATC ACTTTTGAAG 
GCACTCGGAA TAACGGACGA CGAAATGCGA AGGCCTTTCA TCGGCATAGT GTCCTCGTGG
AACGAGATCA TTCCCGGCCA TGTCCACCTT GACAAGGTTG TCGAGGCGGT GAAAGCCGGT
GTGAGAATGG CCGGAGGAGT TCCTTTCGTC TTTCCAACGA TCGGGATCTG TGACGGAATA
GCCATGGATC ACAGGGGAAT GAAGTTTTCC TTGCCCTCGA GGGAACTCAT AGCGGACTCC
ATAGAGATCG TTGCAAGCGG TTTCCCCTTC GATGGTTTGG TCTTCGTCCC CAACTGCGAC
AAGATCACAC CCGGCATGAT GATGGCCATG GGAAGATTGA ACATCCCGTC CGTTCTGATA
TCCGGCGGTC CCATGCTCGC AGGTCGCTAC AACGGCAGAG ACATCGATCT CATCACCGTC
TTCGAAGCGG TTGGTGGATA CAAAGTGGGA AAAGTCGATG AAGAAACGCT CAAAGCGATA
GAAGATCTCG CGTGCCCGGG TGCCGGTTCG TGTGCTGGAT TGTTCACCGC GAACACGATG
AACTCTCTGG CGGAAGCTCT CGGAATCGCA CCGAGGGGGA ATGGGACTGT ACCGGCCGTA
CATGCGAAGA GGTTGAGAAT GGCGAAAGAA GCGGGAATAC TCGTTGTGGA ACTCGTGAAA
AGAGATATAA AACCAAGAGA TATCGTCACT CTGGACTCTT TCATGAACGC TGTCATGGTG
GATCTTGCAA CGGGAGGATC CACGAACACA GTTCTGCATT TGAAGGCGAT AGCCGAGAGT
TTTGGAATAG ATTTCGATAT AAAGCTCTTT GACGAACTCA GCAGGAAGAT TCCTCACATC
TGCAACATCT CTCCCGTTGG TCCGTACCAC ATCCAGGATC TCGACGATGC TGGTGGTATC
TACGCTGTGA TGAAACGTCT CCAGGAAAAT GGTCTTTTGA AGGAAGACGT CATGACCATC
TATTTGAGAA AGATTGGAGA TCTCGTCAGA GAAGCTAAGA TCCTGAATGA AGATGTGATC
AGGCCCTTCG ATAATCCGTA CCACAAAGAG GGCGGGCTCG GTATCCTCTT CGGGAACCTC
GCTCCGGAAG GAGCGGTTGC CAAACTCTCC GGTGTTCCCG AGAAGATGAT GCACCACGTT
GGCCCGGCCG TCGTCTTTGA AGACGGAGAA GAGGCGACAA AAGCCATTCT ATCTGGAAAG
ATCAAAAAAG GAGACGTGGT TGTGATTCGC TACGAAGGAC CGAAGGGTGG TCCCGGGATG
AGGGAGATGC TCTCACCCAC CTCCGCTATC GTGGGGATGG GCCTTGCGGA GGACGTGGCT
CTCATCACAG ACGGTAGGTT CTCGGGTGGA TCGCACGGTG CCGTGATAGG TCACGTTTCT
CCAGAAGCGG CAGAAGGCGG TCCTATAGGT ATCGTGAAAG ACGGGGACCT CATCGAGATA
GATTTTGAAA AGAGAACCCT GAATCTCTTG ATCTCAGACG AAGAGTTCGA AAGAAGAATG
AAAGAGTTCA CGCCTCTGGT GAAAGAAGTG GACAGCGATT ACCTGAGAAG GTACGCGTTC
TTCGTGCAGT CGGCGAGCAA GGGGGCAACC TTCAGGAAGC CCTGA
 
Protein sequence
MRSDVIKKGL ERAPHRSLLK ALGITDDEMR RPFIGIVSSW NEIIPGHVHL DKVVEAVKAG 
VRMAGGVPFV FPTIGICDGI AMDHRGMKFS LPSRELIADS IEIVASGFPF DGLVFVPNCD
KITPGMMMAM GRLNIPSVLI SGGPMLAGRY NGRDIDLITV FEAVGGYKVG KVDEETLKAI
EDLACPGAGS CAGLFTANTM NSLAEALGIA PRGNGTVPAV HAKRLRMAKE AGILVVELVK
RDIKPRDIVT LDSFMNAVMV DLATGGSTNT VLHLKAIAES FGIDFDIKLF DELSRKIPHI
CNISPVGPYH IQDLDDAGGI YAVMKRLQEN GLLKEDVMTI YLRKIGDLVR EAKILNEDVI
RPFDNPYHKE GGLGILFGNL APEGAVAKLS GVPEKMMHHV GPAVVFEDGE EATKAILSGK
IKKGDVVVIR YEGPKGGPGM REMLSPTSAI VGMGLAEDVA LITDGRFSGG SHGAVIGHVS
PEAAEGGPIG IVKDGDLIEI DFEKRTLNLL ISDEEFERRM KEFTPLVKEV DSDYLRRYAF
FVQSASKGAT FRKP