Gene TRQ2_1619 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_1619 
Symbol 
ID6093068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp1635482 
End bp1637383 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content47% 
IMG OID642488820 
Productextracellular solute-binding protein 
Protein accessionYP_001739638 
Protein GI170289400 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00276535 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGG TTCTTGTGTT CGTGTTTTTA GTCCTCAGCG CTATTTCAGC GGTAATGGCT 
CAGATGCTAC CACCTGGTAT CCCTAGGGAA AAGACGTTGA TACTGCCTTT CCTCTTTGCA
CCACTTCCCG CACCAGGTAA CTGGAACCTC TGGGCAGGAT GGAGAGCTCA AAACTGCGGT
CTTCACCAGT TCGTCACCGA ACCTCTCTGG ACCATCAACC CCAACCCTGA GGAAGGCGGG
ATCATCAACG CACTCGCTGC GGAGCCTCCT ATCTACAACG AAGACTTCAC AAAGCTTACG
ATCAAACTCA GAAAAGGAAT TTACTGGAGC GACGGGGTTG AATTCACAGC GGACGACTTT
GTGTTCACGA TTAAAACGGT GAAAGACACA CCCGGTCTGG ATTATCACGG CCCGATGCAA
GATGTGAAAG ATGTCTACGC CCTTGACAAG TACACGGTCG TTGTGGAACT TGAGAGACCA
AACAGTAGGT TCCATGCCTA CTTTGTTGAA AGATGGAATG CATTAAGACC TATGCCAAAA
CACATCTTCG AAAAGGTAAA AGATGTGGTA TCCTTCGACT TCAACCCACC TGTAAGCTTA
GGACCATACG TTTTGAAAGA TTATGACCCC GCAGGATACT GGGTGCTCTG GGAGAAGAGA
AAAGACTGGC AGAGAACAGT CACTGGTCAA CTCTTCGGTG AACCTGTTCC TGAGTACGTC
CTCTTCATAA ACTACGGTAC TCCTGAGAAG AACACGATGG CTATGCTGAG GCACGAACTG
GACGTTCTTC AGGGATCGGC AGAACAATTG ATTACACTTC TGAGAATGAG CAAAAATACC
AGAAGTTACA GAAAAACATG GCCATACATA GATCCAAGAG ATATTTCCAC GAGAGGACCT
GGTTTCAACT TCATGGTGTA TCCGTACAAC ATCAAAGACG TGAGATGGGC ACTAGCTTTG
TCCATAGACA TAGTAAAACT CGCCATTTCA ACGTACGACG GAATGGTCGC TATGACTCCA
GGACTTCCTC TGGTTGTCAA CAAAAACTTC TACGAATGGT ATTTCAAGAG ACTGGAACCG
TGGCTGGGGG AACTAGCACT GGATCTTGGA AACGGTGAAA CCTTCAAACC GTGGGATCCA
CAGGCTCCAT GGAAACTCCT GGAATGGGCT CAGAAAATGT ACAAAGTGGA TATCGATCCA
AATAATGAAG AGGAAGTGCG TCTGACTCTG GGTTACGGCT GGTGGAAATA CGCTCCAGAT
GCAGCGGAGA AACTGTTGAA AAAACATGGT TTCTACCGCG ATGAAAACGG AAAATGGCAT
CTACCAAATG GTGACCTGTG GAAGATAACC ATACTCAGAG GCCCAGATCC TACAGATATG
GCTAACATCA TCATAGAGGG AATCGCCGAA CAGTGGAAAG AATTCGGTAT AGATGTCGTC
TTCAATGTCT CCTCTGCCGC TGCGACGCTC GCAGGTGAAG GACGGTTTGA GGTGGTCAAC
ACAGCACACG GTGGTTTTGC TGGTGAACCA TGGGGATTCC ATCCGGATCT TTACAGATGT
TTCAATGCGT TCAGAAGCGA TTTTGTAAAA CCCATTGGTG AACTGACACT TGGTAGTGCT
CTTAGGTGGA GTGATCCCAG AATGGACAAA ATCATAGAGG AACTCGAAAA AACAGACTGG
AACGATTACG AAAAAGTTAT AGATCTTGGA GTCGAAGGAT TGAAGCTCGA AATTGAAGAG
ATGGTAGCAA TACCGGTGTT CAACTGTCCT ATAACGATCG TCTTCGATGA GTATTACTGG
ACCAACTTCC CAAGTCCAGA AAACGATTAT GCGAGATGTG ACAACTTCAC CACCTGGCCC
CAGCTGAAGT ACCTGCTCCA CATGGTCAAA CCTGCTAAAT GA
 
Protein sequence
MKKVLVFVFL VLSAISAVMA QMLPPGIPRE KTLILPFLFA PLPAPGNWNL WAGWRAQNCG 
LHQFVTEPLW TINPNPEEGG IINALAAEPP IYNEDFTKLT IKLRKGIYWS DGVEFTADDF
VFTIKTVKDT PGLDYHGPMQ DVKDVYALDK YTVVVELERP NSRFHAYFVE RWNALRPMPK
HIFEKVKDVV SFDFNPPVSL GPYVLKDYDP AGYWVLWEKR KDWQRTVTGQ LFGEPVPEYV
LFINYGTPEK NTMAMLRHEL DVLQGSAEQL ITLLRMSKNT RSYRKTWPYI DPRDISTRGP
GFNFMVYPYN IKDVRWALAL SIDIVKLAIS TYDGMVAMTP GLPLVVNKNF YEWYFKRLEP
WLGELALDLG NGETFKPWDP QAPWKLLEWA QKMYKVDIDP NNEEEVRLTL GYGWWKYAPD
AAEKLLKKHG FYRDENGKWH LPNGDLWKIT ILRGPDPTDM ANIIIEGIAE QWKEFGIDVV
FNVSSAAATL AGEGRFEVVN TAHGGFAGEP WGFHPDLYRC FNAFRSDFVK PIGELTLGSA
LRWSDPRMDK IIEELEKTDW NDYEKVIDLG VEGLKLEIEE MVAIPVFNCP ITIVFDEYYW
TNFPSPENDY ARCDNFTTWP QLKYLLHMVK PAK