Gene TRQ2_1797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_1797 
Symbol 
ID6093248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp1814259 
End bp1815470 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content44% 
IMG OID642488994 
Productmajor facilitator transporter 
Protein accessionYP_001739811 
Protein GI170289573 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000972386 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAGAA CGGGAATTCT TCTTGGAATA TGTCTTGGCC TCACCAGTTT TTCAATACTT 
CAGGGATCGG TGTTCGGAGC GGTTCTTCCC TCCATTGTGG AAGAATTCGG CGTGGATTGG
AGTATCATAG GAGTTGCTAT GAGTGTCTGG ACGGTCATTT CTGCTCTCTC ACCCATGTTA
TTTGGAAGAT TTGTTCATAG ATTATATCCA ATGAACTCCA TGGCTCTGGT CATGATGATG
CTCTCTATTC CAACAATTCT TGTTGCTTTC GTGAAAGACT TTTTCTCTTT AAACGTTGTG
AAGATAGTGG GGAGCCTGGC TGTTCCCTTC TCTTATCCTC TTGCTGCAAA AGTGGTGGAG
ATGTATGTGG ACTCCAGAAA AAGGGGAATC GCAACTGCCA TATACAACAC TGGTTCTATG
ATCGGACTTG CACTCGGATA CGCTGTTGTT GCGTTAGCAG GTGGTTATTG GAAAAGATCC
ATGATCACTG GAGGATTTCT CGGTGTTATT TATGTTCCTG TTGCATACAT TCTGTGGAAA
AGCTTGCTGG AGTCAAAGGT ACAGAGAAAG CCGGAGTGGA ACGATTCTCA AAAGAGATCA
CATGTTTCTT TCAAACGAGT GTTCTCCATC ATACTGTGGC TTTCCTTCGG TCATTTTTCT
GCTGTTTACA CCTGGAATCT CATGTTCAAT TGGCTTTCTA CTTTCCTTGT TCGTGAGATC
CAGCTGGGTT ATAGTTTCAT AGCCCTTGTG CTTGGAATCA TGGCTGTTGT ATCGAGCGTA
ATGGAGGTTT TCGTTGGATT GTGGTCTGAC CGGGTGAGAG GAATGCGTGG AAGGTTAATT
CCCCTGTATA CCGGTTTATT TCCGTCGGCT TTTCTTTTAA TACTTTCCAC TCTTTCAACC
AATCCTCTTC TGACATCCAT TCTGGTGGGG TTCTCCATCC TCTTCTGGAG ACTTTCAACC
CCTTCTTTCT GGGCAATATT TGGAGATCTC ATTCCGCAGG AACACTTCGA AAAAGCGAGT
AGTATCTACG TGGGAGCTGT CCTTCTTTCT GGTATTGCTT CTTCTATTAT GAACGGTTAC
ATAGTCTCGT TGACAGGTTC GATGAAGTAC GCCATACTCC TTTCGGCTTT TATACTGATT
CTTTCTCCGA TTTTCTTCAC GGTAGCGGGA AAAGTTGGTA CGAGAATTTC AGGAGCATGG
ATCCATCTAT AG
 
Protein sequence
MERTGILLGI CLGLTSFSIL QGSVFGAVLP SIVEEFGVDW SIIGVAMSVW TVISALSPML 
FGRFVHRLYP MNSMALVMMM LSIPTILVAF VKDFFSLNVV KIVGSLAVPF SYPLAAKVVE
MYVDSRKRGI ATAIYNTGSM IGLALGYAVV ALAGGYWKRS MITGGFLGVI YVPVAYILWK
SLLESKVQRK PEWNDSQKRS HVSFKRVFSI ILWLSFGHFS AVYTWNLMFN WLSTFLVREI
QLGYSFIALV LGIMAVVSSV MEVFVGLWSD RVRGMRGRLI PLYTGLFPSA FLLILSTLST
NPLLTSILVG FSILFWRLST PSFWAIFGDL IPQEHFEKAS SIYVGAVLLS GIASSIMNGY
IVSLTGSMKY AILLSAFILI LSPIFFTVAG KVGTRISGAW IHL