Gene TRQ2_1614 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_1614 
Symbol 
ID6093063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp1627037 
End bp1628212 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content48% 
IMG OID642488815 
Productextracellular solute-binding protein 
Protein accessionYP_001739633 
Protein GI170289395 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000159886 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT TTCTGGTGAT CGCTTTGCTT GTCGTTTCTC TAGTTGTCCT CGCCCAACCG 
AAACTCACCA TCTGGTGCTC TGAGAAGCAG GTTGACATCC TTCAGAAACT CGGAGAGGAG
TTCAAGGCGA AGTACGGCGT AGAGGTTGAA GTGCAGTACG TGAACTTCCA AGACATCAAG
TCTAAGTTCC TAACAGCAGC TCCTGAGGGA CAGGGTGCAG ATATCATCGT TGGAGCACAC
GACTGGGTAG GCGAACTCGC AGTCAACGGT TTGATCGAAC CCATTCCAAA CTTCAGTGAC
CTGAAAAACT TCTATGAAAC CGCTCTCAAC GCGTTCTCTT ACGGTGGAAA ACTCTACGGT
ATTCCTTACG CCATGGAAGC GATCGCACTC ATCTACAACA AGGACTATGT TCCTGAACCC
CCAAAGACCA TGGACGAGCT TATAGAAATA GCAAAACAGA TCGATGAAGA ATTTGGAGGA
GAAGTGAGAG GTTTCATCAC CTCAGCGGCC GAGTTTTACT ACATTGCTCC TTTCATTTTC
GGATACGGTG GATACGTATT CAAACAGACA GAAAAAGGAC TGGACGTCAA CGATATCGGA
CTGGCCAACG AAGGAGCCAT CAAGGGTGTG AAACTCCTCA AAAGATTGGT TGATGAGGGA
ATACTGGATC CCAGTGACAA TTATCAGATC ATGGATTCCA TGTTCAGGGA AGGCCAGGCG
GCGATGATCA TCAACGGACC GTGGGCCATT AAGGCGTACA AGGATGCAGG AATAGACTAT
GGTGTAGCCC CAATCCCCGA TCTGGAACCT GGCGTTCCTG CAAGACCTTT CGTTGGGGTC
CAGGGCTTCA TGGTGAACGC AAAATCCCCA AACAAACTCC TTGCCATCGA ATTCCTGACC
AGTTTCATTG CAAAAAAGGA AACGATGTAC AGAATCTACC TTGGAGATCC AAGACTTCCC
TCCAGAAAGG ACGTGCTCGA ACTTGTGAAA GATAACCCAG ACGTAGTTGG CTTCACACTG
AGCGCAGCCA ACGGTATTCC AATGCCCAAC GTTCCACAGA TGGCCGCTGT CTGGGCCGCT
ATGAACGATG CGCTCAATCT CGTTGTGAAC GGAAAAGCAA CGGTCGAAGA AGCGCTCAAA
AACGCCGTTG AAAGAATCAA AGCTCAGATT CAGTAA
 
Protein sequence
MKKFLVIALL VVSLVVLAQP KLTIWCSEKQ VDILQKLGEE FKAKYGVEVE VQYVNFQDIK 
SKFLTAAPEG QGADIIVGAH DWVGELAVNG LIEPIPNFSD LKNFYETALN AFSYGGKLYG
IPYAMEAIAL IYNKDYVPEP PKTMDELIEI AKQIDEEFGG EVRGFITSAA EFYYIAPFIF
GYGGYVFKQT EKGLDVNDIG LANEGAIKGV KLLKRLVDEG ILDPSDNYQI MDSMFREGQA
AMIINGPWAI KAYKDAGIDY GVAPIPDLEP GVPARPFVGV QGFMVNAKSP NKLLAIEFLT
SFIAKKETMY RIYLGDPRLP SRKDVLELVK DNPDVVGFTL SAANGIPMPN VPQMAAVWAA
MNDALNLVVN GKATVEEALK NAVERIKAQI Q