Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_1797 |
Symbol | |
ID | 6093248 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | - |
Start bp | 1814259 |
End bp | 1815470 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 642488994 |
Product | major facilitator transporter |
Protein accession | YP_001739811 |
Protein GI | 170289573 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000972386 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAGAA CGGGAATTCT TCTTGGAATA TGTCTTGGCC TCACCAGTTT TTCAATACTT CAGGGATCGG TGTTCGGAGC GGTTCTTCCC TCCATTGTGG AAGAATTCGG CGTGGATTGG AGTATCATAG GAGTTGCTAT GAGTGTCTGG ACGGTCATTT CTGCTCTCTC ACCCATGTTA TTTGGAAGAT TTGTTCATAG ATTATATCCA ATGAACTCCA TGGCTCTGGT CATGATGATG CTCTCTATTC CAACAATTCT TGTTGCTTTC GTGAAAGACT TTTTCTCTTT AAACGTTGTG AAGATAGTGG GGAGCCTGGC TGTTCCCTTC TCTTATCCTC TTGCTGCAAA AGTGGTGGAG ATGTATGTGG ACTCCAGAAA AAGGGGAATC GCAACTGCCA TATACAACAC TGGTTCTATG ATCGGACTTG CACTCGGATA CGCTGTTGTT GCGTTAGCAG GTGGTTATTG GAAAAGATCC ATGATCACTG GAGGATTTCT CGGTGTTATT TATGTTCCTG TTGCATACAT TCTGTGGAAA AGCTTGCTGG AGTCAAAGGT ACAGAGAAAG CCGGAGTGGA ACGATTCTCA AAAGAGATCA CATGTTTCTT TCAAACGAGT GTTCTCCATC ATACTGTGGC TTTCCTTCGG TCATTTTTCT GCTGTTTACA CCTGGAATCT CATGTTCAAT TGGCTTTCTA CTTTCCTTGT TCGTGAGATC CAGCTGGGTT ATAGTTTCAT AGCCCTTGTG CTTGGAATCA TGGCTGTTGT ATCGAGCGTA ATGGAGGTTT TCGTTGGATT GTGGTCTGAC CGGGTGAGAG GAATGCGTGG AAGGTTAATT CCCCTGTATA CCGGTTTATT TCCGTCGGCT TTTCTTTTAA TACTTTCCAC TCTTTCAACC AATCCTCTTC TGACATCCAT TCTGGTGGGG TTCTCCATCC TCTTCTGGAG ACTTTCAACC CCTTCTTTCT GGGCAATATT TGGAGATCTC ATTCCGCAGG AACACTTCGA AAAAGCGAGT AGTATCTACG TGGGAGCTGT CCTTCTTTCT GGTATTGCTT CTTCTATTAT GAACGGTTAC ATAGTCTCGT TGACAGGTTC GATGAAGTAC GCCATACTCC TTTCGGCTTT TATACTGATT CTTTCTCCGA TTTTCTTCAC GGTAGCGGGA AAAGTTGGTA CGAGAATTTC AGGAGCATGG ATCCATCTAT AG
|
Protein sequence | MERTGILLGI CLGLTSFSIL QGSVFGAVLP SIVEEFGVDW SIIGVAMSVW TVISALSPML FGRFVHRLYP MNSMALVMMM LSIPTILVAF VKDFFSLNVV KIVGSLAVPF SYPLAAKVVE MYVDSRKRGI ATAIYNTGSM IGLALGYAVV ALAGGYWKRS MITGGFLGVI YVPVAYILWK SLLESKVQRK PEWNDSQKRS HVSFKRVFSI ILWLSFGHFS AVYTWNLMFN WLSTFLVREI QLGYSFIALV LGIMAVVSSV MEVFVGLWSD RVRGMRGRLI PLYTGLFPSA FLLILSTLST NPLLTSILVG FSILFWRLST PSFWAIFGDL IPQEHFEKAS SIYVGAVLLS GIASSIMNGY IVSLTGSMKY AILLSAFILI LSPIFFTVAG KVGTRISGAW IHL
|
| |