Gene TRQ2_0120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_0120 
Symbol 
ID6091522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp115326 
End bp118523 
Gene Length3198 bp 
Protein Length1065 aa 
Translation table11 
GC content44% 
IMG OID642487302 
ProductO-antigen polymerase 
Protein accessionYP_001738165 
Protein GI170287927 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID[TIGR01131] ATP synthase subunit 6 (eukaryotes),also subunit A (prokaryotes) 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGATTC TGTTTTACAT CATGTACGTT GTCGTTGCCC TTTTCTCGTC CAGGAAACTC 
ACGTACGAGT TCAGCGTTCC AAAGTACGCT TTGATCACGG TGTTTCTGAG CGTCATGTTC
TTCATGATCC TGATGAAGAT TCTCAGGAAG GAGAAACTGG AAATCAGATT CAACATGGCC
CACGTGGCTT TTTTTGCCTT CGCTATCTCC GCTCTTTTAT CTACGATAAA TGTCTATCGC
GATAATCCTG TCTATTTCAG ATATTCTTTT GACATAGCGA TCTACGTTCT TCTCATGTTT
TTCACATCGA TCTTCATCTC GAACTTCTTT GTCACAAAAG AAAGAATAAA AAGGTTTCTC
ACGGTCAGTG TGGCTCTTGC GGGATTCATA GCTTTCGACG CGCTTTTGAA CTTCTACGGC
GGTGTCGATG TTTTTCTTGG AAGTGTGGGG AGTCCCTTTT CGAGGGCAAC CGTGAAGGCG
ACCGTAGGAA ACGTGAACTT CGTTTCGAAC TTTCTCTCCC TGAACCTTCC GCTCGCTATC
TATCTCATAG CGTCAGCCGA TTTCAAAAGA AAAGAAGCGT CTGTGATCAA GGTGATAGCC
TCAGTTTCTG CGCTTCTCAT AGTTTCGGGA ATACTGGTGT CTCAGACGAG ATCACTGTAC
GTTGCGAACA TCATCTCCCT CTGTATTTTC TCTGTGTTCT ACATGATCTT CAGAAAAAAG
AAGGTTTCGA AAGAGACAGA CAGAGAAGTT CTGAGTATGA GTAAAGCTCT CACGACTTTC
GTTCTGATCT CTTCGATCGT CCTCGTTGTT CTCTACAATC TTCCCACTCC TCTGAACGGT
TACGGAATGG TATCTCCCGC AGGAAGAATT CAGGCGGTTG CCGAGGTGAG TTCCTGGCAC
GAGAGGCTTC TGTCCTGGTT CTCATCGATA TACCAGTGGA GAACACACAA GATTCTCGGT
ACCGGAATCG GAACGTATCA GATTCTCACG ATAAACTACA TGGGAAATGT TATAGAAGAC
CACCCGATTC TGATGTACGG CTGGAACAAC TTCAAGAGAA CGCACAACGA TTACTTTCAA
GTTCTTGGAG AAATGGGAAT TTTAGGATTT GCTTCTGTTG TTTTTCTCGC TCTGTCGCTT
GGGATACTGT TCTTCAAAAT AATTCGAAGG ATCACGGTGC GGGACGATCT CTTGCTCTTT
CTGGCACTTG CCAGTTCTTT CATCACGTTC ATGATGCACA GCGCGTTCAG CTTTCCGGCG
CATCTACTTC CAAACGGTTT TCTTGTGATG ACAATTGCTT CCATAGCGGT TGGTGGGTAC
TTCTACGGCG GAAAGAAAGC GGAGATTCAA CGAAAGAAGG CCGTTGTGTT TGGAACGATC
GTTCTTCTGG TTGGCGTCGT TTCCGCTTAC TTGAAATGGA ACTACTTCAT TTCTGAAGTT
TATTTCAAAT GGGGAAATTC TGCCTATCTT TCTATAAGGA AAGTGGAAGA AGACATGGCC
AAACTGGACA ACTACGAACA GCAGGTAAAG ACAGCCATGG AAGAGCTGAG CTCTCTGAGT
GGACGTTACA GTTACCTGAA ACCCGACGAA TTCAAGAAGT TCGTTGAAAG TCAGAACCTT
TCGATGAAGC CTTCAAGCTC AGAGGTAGAA AGATTGAGGC TCGAGACCAT TCAAAAAGAA
AGGCAGAAGT TGCAGAACAT GCTTCAGCAG ATAGCATCTT ATAGGGATCA GCTCACAAAT
CAAAGAGTAG AGCTTTATAA TAAAGCGAAA GAGTACTTCT TGAAATCCGT TCAAGTGAAC
AAAACGTACG GAAGATCTTA TTTCTACCTT GCATCGCTTG CAACAAGCGA GTACCGAATC
GACGAATTGA AAGCGAAGCT GAAGACAAAA GAGGATTACA AAGCGTTTTT TGAACAGAAC
TTTGATGATT ACCAGAAGGT GATCTTTCCG GATGTGAAGA AGACCGATCT GACGTTTCTT
GAGAACGCGA GTCTTTCCAC AATCAACGAT CTGGGAGAGG ACAATCTCAT CACCGCTCAG
GTCCTCCTCG ATTCCGTTTC TCTGTACCTT TCCTCTCTGA AGTCTTTCAA CGAGAGGAAC
ACATACAGAG GCCTTGCAAC AAGGTATGTG GGGCTTCACC AGATCATGAA GATTTTGTTC
TCGAAAGCCC AGGACGATCT GATACAGAAA GCCTTCGCTG AGTTGACATC AAGGTATTTC
GATAGTTTCA CGAATTACGC GAAGCTCACG GTGAAGATTC TTCCGGGTGC GTGGAACAGA
TTTCCTGACT GGAAAAACTA CGATCTCAGG AAAGCCGTTG CGGGGCAGGA TATCTACAGA
TATTTCGCTA CGAAAGCTGC GGAAGCTCAA CCGCTCACCG TTCAGAAGAA CAGAGAGTTT
CTATTCTACC TTGCCGAAAA GGAAATCTGG GCAGTTGAAA GCATGAGCGA AGCGGGTGTC
TGGGGAGTTC CAGATGGTGT TCTCGATTTT CTACATGCGA TGCCTTTCGA ATACGCCTCT
TCGAACAATC GTCAGGAGGC CTTGTTCGTA TCCGAAGACG TCCTGAAGAT TTACAATGAG
AGCTACAGAA ACACAAAAGA ATCCATATCG ATGTACGAAA GTAGAGCAGA AAAGACATTT
GAAAGCGTTC TGGAGGGGCT GAAGGAATAC ATCGAAAGTA ATCTGGGAAA AGAGTACGCG
AACCAATTCG AAAAACTCTT CAAGGACCTT TTTGAAAGCT TCAAGAATCT GAACTGGCTG
TCCGTCAACG TTCAGGAAAT GAACAAATTC ATATCAGAGA AGAACTACAC GTACAAGTTG
AATCCATGGG CAGAACTTCT CGTAGAAAGG ATGAAAAACT TCGAAAGCTA TATGAAGCAA
AGAAATGTTG AAGCTTCCAG AATCAGCGAA GTGCTTTCCA GGATCTACAA CGATCTGTAC
GAACTCAGAG ATGTTCTTGT TTTCGAAAGA TACATAAGGT TCCTCGAACA CTACAGGCTC
ATTCTGAACG ATGTAAAGAA TTATCTGAAC ACTCTGAAAA GAGCCTACAG CGTGGCTTCG
GACGATGAGT GGAAGATGAT ACTGGAAGAC TGGAGTGTGA ACCTCTGGAA CGATGAAAAG
TTCGAAACGA AAGATCAAGT GATGGAACGA TTGAAGAAAT CAGAAGAATT TCTGAAAACG
ATAGAAGAAA GCTTGTAG
 
Protein sequence
MEILFYIMYV VVALFSSRKL TYEFSVPKYA LITVFLSVMF FMILMKILRK EKLEIRFNMA 
HVAFFAFAIS ALLSTINVYR DNPVYFRYSF DIAIYVLLMF FTSIFISNFF VTKERIKRFL
TVSVALAGFI AFDALLNFYG GVDVFLGSVG SPFSRATVKA TVGNVNFVSN FLSLNLPLAI
YLIASADFKR KEASVIKVIA SVSALLIVSG ILVSQTRSLY VANIISLCIF SVFYMIFRKK
KVSKETDREV LSMSKALTTF VLISSIVLVV LYNLPTPLNG YGMVSPAGRI QAVAEVSSWH
ERLLSWFSSI YQWRTHKILG TGIGTYQILT INYMGNVIED HPILMYGWNN FKRTHNDYFQ
VLGEMGILGF ASVVFLALSL GILFFKIIRR ITVRDDLLLF LALASSFITF MMHSAFSFPA
HLLPNGFLVM TIASIAVGGY FYGGKKAEIQ RKKAVVFGTI VLLVGVVSAY LKWNYFISEV
YFKWGNSAYL SIRKVEEDMA KLDNYEQQVK TAMEELSSLS GRYSYLKPDE FKKFVESQNL
SMKPSSSEVE RLRLETIQKE RQKLQNMLQQ IASYRDQLTN QRVELYNKAK EYFLKSVQVN
KTYGRSYFYL ASLATSEYRI DELKAKLKTK EDYKAFFEQN FDDYQKVIFP DVKKTDLTFL
ENASLSTIND LGEDNLITAQ VLLDSVSLYL SSLKSFNERN TYRGLATRYV GLHQIMKILF
SKAQDDLIQK AFAELTSRYF DSFTNYAKLT VKILPGAWNR FPDWKNYDLR KAVAGQDIYR
YFATKAAEAQ PLTVQKNREF LFYLAEKEIW AVESMSEAGV WGVPDGVLDF LHAMPFEYAS
SNNRQEALFV SEDVLKIYNE SYRNTKESIS MYESRAEKTF ESVLEGLKEY IESNLGKEYA
NQFEKLFKDL FESFKNLNWL SVNVQEMNKF ISEKNYTYKL NPWAELLVER MKNFESYMKQ
RNVEASRISE VLSRIYNDLY ELRDVLVFER YIRFLEHYRL ILNDVKNYLN TLKRAYSVAS
DDEWKMILED WSVNLWNDEK FETKDQVMER LKKSEEFLKT IEESL