Gene TRQ2_0661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_0661 
Symbol 
ID6092078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp675118 
End bp676365 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content45% 
IMG OID642487847 
Productextracellular solute-binding protein 
Protein accessionYP_001738697 
Protein GI170288459 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000689748 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAAGT TACTGGTATT TCTGGTAGTT CTTGTTTTAG CTCTTCCACT CATAGCCAAG 
ATTCAAATTA CGTTCATGAC GCCACTCTCC GGTGCTGATG GAGCGTATAT GGACCAGATC
ATTCAGAAGT TCAACGAAAC ACATCCTGAT ATTGAGATTG TTCATCTTGT CGTAGGAAGT
TCCCTGGAAT ACAAGCAGAA GCTTGCCACG GGTATTTCCA CGAAATCTGC TCCCCAGGTT
CTGTTTATTA GGAAGCATGA CATGCCGCTG TTTCTTGATC ACTTCAGAAC CTTCACAAAA
GAAGAGCTCC AAAAGTTGGG TATCGATATC GATGATATTT ATCCCTCTGT CCTCGAAGGA
CTTGTAACAA AAGACGGTAA GTACTATGGA ATACCAATTG ACGTATGGAT TTTCTACATG
GCCTACAGGA AAGACAATTT CAAAAAGGCT GATCTTGATC CAGACCTTCC ATTGAAGGAA
GGGCCACTCA ACAGAGAACA GTTTGTGAAC GTTCTAAGGG CTCTCAGAAA AGTCACACCA
GAAGGTTCAT TCCCGTGGTG TGAGTCTCCA AGCTGGGATT GGGAATTTGT ACATTTGCTG
TGGCAATTTG GTGGAGATAT TCTGACACCT GACTTCAAGC ACCCTGCATT CAAAGAAGCT
GGTATAAAAG TTCTCAAATT CCTCCAGGAA CTTCAAAAAG AAGGATTGTA TCCTGATCAA
CCTATCGATG CAGGGCCAAC CTTTGAGTCT GGAGCGGGTT CTGTGTTGAT AACCGGTATC
TGGACGATCA ATCCATGGCT TGATCTGCTT GGAGATGACT TTGGCTACGC ACCAGCTCCT
CAGCTTGGAA CAACAAAATC TGTGTTTGGT GGTTCACATG TGATCGCAAT TCCAAAGGTC
ATGGTGGAAG ACGAAAAGAC CTTCAACGCC GTGATGACCT GGGTTAAGTA TCTGTGGGAT
CACGCAATCG AATGGTATGC GGCTGGTCAG ACACCCGCCA GGAAATCCAT AGCTGAGAGC
GAAGAATTTA AAGAAAAGTT CCCACATCTG TACGTTGCGG CTCAGCAGGT ATCTTATGTT
AAAACCTTCC AGATGTTCCC GTACATAGCC GAGATCCTTG CCGAGATAGT GCCATATATT
GAAGAAGTGC TTATCAACAA GAGCATGACG CCTGAGGAAG CAATGGAGGA AGCCGAAATG
GTTGCTCAGG AAATAATTGA CGATTACTGG GCAACAGTTG GAGAATGA
 
Protein sequence
MRKLLVFLVV LVLALPLIAK IQITFMTPLS GADGAYMDQI IQKFNETHPD IEIVHLVVGS 
SLEYKQKLAT GISTKSAPQV LFIRKHDMPL FLDHFRTFTK EELQKLGIDI DDIYPSVLEG
LVTKDGKYYG IPIDVWIFYM AYRKDNFKKA DLDPDLPLKE GPLNREQFVN VLRALRKVTP
EGSFPWCESP SWDWEFVHLL WQFGGDILTP DFKHPAFKEA GIKVLKFLQE LQKEGLYPDQ
PIDAGPTFES GAGSVLITGI WTINPWLDLL GDDFGYAPAP QLGTTKSVFG GSHVIAIPKV
MVEDEKTFNA VMTWVKYLWD HAIEWYAAGQ TPARKSIAES EEFKEKFPHL YVAAQQVSYV
KTFQMFPYIA EILAEIVPYI EEVLINKSMT PEEAMEEAEM VAQEIIDDYW ATVGE