Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_0661 |
Symbol | |
ID | 6092078 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | - |
Start bp | 675118 |
End bp | 676365 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 642487847 |
Product | extracellular solute-binding protein |
Protein accession | YP_001738697 |
Protein GI | 170288459 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000689748 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGAAGT TACTGGTATT TCTGGTAGTT CTTGTTTTAG CTCTTCCACT CATAGCCAAG ATTCAAATTA CGTTCATGAC GCCACTCTCC GGTGCTGATG GAGCGTATAT GGACCAGATC ATTCAGAAGT TCAACGAAAC ACATCCTGAT ATTGAGATTG TTCATCTTGT CGTAGGAAGT TCCCTGGAAT ACAAGCAGAA GCTTGCCACG GGTATTTCCA CGAAATCTGC TCCCCAGGTT CTGTTTATTA GGAAGCATGA CATGCCGCTG TTTCTTGATC ACTTCAGAAC CTTCACAAAA GAAGAGCTCC AAAAGTTGGG TATCGATATC GATGATATTT ATCCCTCTGT CCTCGAAGGA CTTGTAACAA AAGACGGTAA GTACTATGGA ATACCAATTG ACGTATGGAT TTTCTACATG GCCTACAGGA AAGACAATTT CAAAAAGGCT GATCTTGATC CAGACCTTCC ATTGAAGGAA GGGCCACTCA ACAGAGAACA GTTTGTGAAC GTTCTAAGGG CTCTCAGAAA AGTCACACCA GAAGGTTCAT TCCCGTGGTG TGAGTCTCCA AGCTGGGATT GGGAATTTGT ACATTTGCTG TGGCAATTTG GTGGAGATAT TCTGACACCT GACTTCAAGC ACCCTGCATT CAAAGAAGCT GGTATAAAAG TTCTCAAATT CCTCCAGGAA CTTCAAAAAG AAGGATTGTA TCCTGATCAA CCTATCGATG CAGGGCCAAC CTTTGAGTCT GGAGCGGGTT CTGTGTTGAT AACCGGTATC TGGACGATCA ATCCATGGCT TGATCTGCTT GGAGATGACT TTGGCTACGC ACCAGCTCCT CAGCTTGGAA CAACAAAATC TGTGTTTGGT GGTTCACATG TGATCGCAAT TCCAAAGGTC ATGGTGGAAG ACGAAAAGAC CTTCAACGCC GTGATGACCT GGGTTAAGTA TCTGTGGGAT CACGCAATCG AATGGTATGC GGCTGGTCAG ACACCCGCCA GGAAATCCAT AGCTGAGAGC GAAGAATTTA AAGAAAAGTT CCCACATCTG TACGTTGCGG CTCAGCAGGT ATCTTATGTT AAAACCTTCC AGATGTTCCC GTACATAGCC GAGATCCTTG CCGAGATAGT GCCATATATT GAAGAAGTGC TTATCAACAA GAGCATGACG CCTGAGGAAG CAATGGAGGA AGCCGAAATG GTTGCTCAGG AAATAATTGA CGATTACTGG GCAACAGTTG GAGAATGA
|
Protein sequence | MRKLLVFLVV LVLALPLIAK IQITFMTPLS GADGAYMDQI IQKFNETHPD IEIVHLVVGS SLEYKQKLAT GISTKSAPQV LFIRKHDMPL FLDHFRTFTK EELQKLGIDI DDIYPSVLEG LVTKDGKYYG IPIDVWIFYM AYRKDNFKKA DLDPDLPLKE GPLNREQFVN VLRALRKVTP EGSFPWCESP SWDWEFVHLL WQFGGDILTP DFKHPAFKEA GIKVLKFLQE LQKEGLYPDQ PIDAGPTFES GAGSVLITGI WTINPWLDLL GDDFGYAPAP QLGTTKSVFG GSHVIAIPKV MVEDEKTFNA VMTWVKYLWD HAIEWYAAGQ TPARKSIAES EEFKEKFPHL YVAAQQVSYV KTFQMFPYIA EILAEIVPYI EEVLINKSMT PEEAMEEAEM VAQEIIDDYW ATVGE
|
| |