Gene TRQ2_0139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_0139 
Symbol 
ID6091541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp134832 
End bp136106 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content52% 
IMG OID642487320 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001738183 
Protein GI170287945 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCAGA TGGAAATGGC CAGAAAGGGT GTTGTTTCCG ACGAGATGAA AAAGGTGGCG 
GAGTACGAGG GAGTGGATGT CGAGATCGTC AGGCAAAAAC TTGCGGAAGG CAGAGCGGTT
CTTCCAAAGA ACAAACTCCA CAGGATAGAA AGGCCAATGA TCGTTGGAGA AGGTTTCAGT
GTGAAGGTGA ACGCGAACAT AGGAACCTCC CAGGGATTTT CTTCGCTCGA AGAGGAAAAG
GAAAAGGCAA GGGTAGCGAT AGAATACGGT GCTGACTCCC TCATGGTTCT TTCCACGTGG
GGAGACCTGA GGGAGATCAG AAGGGCCATC GTGGAGATGT CGCCCGTTCC AGTTGGTTCG
GTGCCCATAT ACGATTCCGC CGTGAGGAGT TACCAGATGA AAAAGAACGT GGTGGATTTT
TCGGAGAAGG ACTTCTTCGA TATGGTCATA GCACACGCGG AAGATGGCAT AGACTTTATG
ACGATCCACG TCGGTGTGAC GAGAAGGGTG CTTGATAGGA TAAAAAGTTC AAGGCGGGTT
TTGAAGATCG TGAGCAGAGG AGGAGCGATC ATCGCGGGAT GGATGATAAA GAACAACAGG
GAAAATCCGT TCTACGAACA CTTCGATGAA CTCTTGGACA TTGCAAAAGA CTACGATATC
ACTCTGAGTC TTGGCGACGG CATGAGACCC GGAGCTGTGG TGGATGCGAG CGACGCCCAG
CAGTTCGAAG AGCTGTTCGT GATGGGGGAA CTCGTGGAGA AAGCGAGGGA AAAAGGGGTC
CAGGTGATGC TGGAAGGGCC GGGGCACGTT CCACTGAACG AGGTGGAGAT GAACGTGAGG
CTCATGAAAA AGATCGGAAA AGGAGCCCCC ATCTTCCTTC TGGGACCTCT TCCAACGGAC
AGAGCCATGG GCTACGATCA CATAGCCTGC GCGATAGGTG GTGCGCTGGC TGGCTACTAC
GGAGCCGATT TCCTCTGTTA TGTAACTCCT TCAGAGCACA TCTCGCTTCC GGATGTTGAA
GACGTGAGAG AAGGTGTGAT AGCCTCTAAG ATAGCGGCTA TTGTCGCGGA TGTGGCGCGC
GGAAACAAAA AAGCCTGGGA GCTTGAGAAA AAGATGGCCC TCGCAAGAAA GAACTTCGAC
TGGGAGACGA TGTTCAGCCT TTCGCTGGGA AAGGACGTTG CGAAGAAGAA ATACGAGGAA
AGACCGTACC CCGACAAAGG CTGTTCTATG TGTGGACCAT TCTGTGCGAT AAAGATAGCG
GAGGAGTTCT CTTGA
 
Protein sequence
MTQMEMARKG VVSDEMKKVA EYEGVDVEIV RQKLAEGRAV LPKNKLHRIE RPMIVGEGFS 
VKVNANIGTS QGFSSLEEEK EKARVAIEYG ADSLMVLSTW GDLREIRRAI VEMSPVPVGS
VPIYDSAVRS YQMKKNVVDF SEKDFFDMVI AHAEDGIDFM TIHVGVTRRV LDRIKSSRRV
LKIVSRGGAI IAGWMIKNNR ENPFYEHFDE LLDIAKDYDI TLSLGDGMRP GAVVDASDAQ
QFEELFVMGE LVEKAREKGV QVMLEGPGHV PLNEVEMNVR LMKKIGKGAP IFLLGPLPTD
RAMGYDHIAC AIGGALAGYY GADFLCYVTP SEHISLPDVE DVREGVIASK IAAIVADVAR
GNKKAWELEK KMALARKNFD WETMFSLSLG KDVAKKKYEE RPYPDKGCSM CGPFCAIKIA
EEFS