Gene TRQ2_1595 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_1595 
Symbol 
ID6093044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp1609646 
End bp1611319 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content50% 
IMG OID642488796 
Productextracellular solute-binding protein 
Protein accessionYP_001739614 
Protein GI170289376 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0133917 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGGT TTTTAGTCGT TCTCGTTCTG GTCCTGGCAC TGGTTTCGGT TTTCGGACAG 
ACTTTTGAGA GAAACAAAAC GCTCTACTGG GGTGGAGCGC TGTGGTCTCC TCCATCCAAC
TGGAACCCGT TCACACCATG GAACGCGGTT GCGGGAACCA TCGGTCTTGT CTATGAACCT
CTGTTCCTCT ACGATCCTCT GAACGACAAG TTCGAGCCGT GGCTTGCAGA AAAAGGAGAA
TGGGTCAGCA ACAACGAATA CGTACTCACG CTCAGAAAGG GTCTCAGATG GCAGGACGGA
GTTCCTCTCA CGGCAGACGA CGTGGTTTTC ACCTTTGAAA TCGCCAAGAA GTACACTGGT
ATCAGCTACA GTCCTGTGTG GAACTGGCTC GACAGGATCG AAAGGGTCGA TGAACGAACG
CTGAAGTTCG TCTTCTCCGA CCCGAGGTAC CAGGAATGGA AACAGATGCT CATCAACACA
CCGATCGTAC CAAAACACAT CTGGGAAAAC AAAACAGAGG AAGAAGTTCT TCAGGCGGCC
AATGAAAATC CAGTTGGATC CGGTCCGTAC TACGTTGAAA GCTGGGCAGA CGACAGATGT
GTATTCAAGA AGAACGGGAA CTGGTGGGGC ATCAGAGAAC TCGGTTACGA TCCCAAACCT
GAAAGGATCG TGGAACTGAG AGTGCTCAGC AACAATGTCG CAGTAGGAAT GCTCATGAAA
GGAGAACTCG ACTGGAGCAA CTTCTTCCTG CCGGGTGTTC CGGTTTTGAA GAAAGCATAC
GGAATCGTCA CCTGGTATGA AAACGCTCCT TACATGCTCC CGGCCAACAC CGCAGGAATC
TACATCAACG TGAGCAAGTA TCCTCTCAGC ATACCTGAGT TCAGAAGAGC AATGGCTTAC
GCTATCAATC CCGAGAAGAT CGTTACCAGA GCTTACGAGA ACATGGTGAC GGCTGCCAAT
CCCGCTGGAA TCCTGCCGCT TCCCGGTTAC ATGAAGTACT ATCCGAAAGA AGTCGTCGAT
AAGTACGGAT TCAAGTACGA TCCGGAGATG GCAAAGAAGA TCCTCGACGA GCTTGGATTC
AAAGATGTGA ACAAGGATGG ATTCAGAGAA GATCCGAACG GAAAGCCGTT CAAGCTCACG
ATTGAGTGTC CGTACGGATG GACCGACTGG ATGGTTTCTA TCCAGTCCAT TGCAGAAGAT
CTCGTGAAAG TCGGAATCAA CGTCGAACCT AAATACCCCG ACTACTCCAA ATACGCAGAC
GACCTCTACG GTGGAAAGTT CGATCTCATA CTCAACAACT TTACAACCGG TGTTTCCGCT
ACCATCTGGT CCTATTTCAA CGGTGTGTTC TATCCGGATG CAGTAGAATC CGAGTACTCC
TACTCCGGAA ACTTTGGAAA GTACGCCAAT CCTGAAGTTG AGACTCTTCT CGACGAACTC
AACAGAAGCA ATGATGATGC TAAAATTAAA GAAGTAGTAG CCAAGCTGTC AGAGATACTG
CTCAAGGATC TGCCGTTCAT TCCTCTGTGG TACAACGGTG CATGGTTCCA GGCTTCTGAA
GCTGTGTGGA CCAACTGGCC AACGGAGAAG AATCCGTACG CTGTCCCGAT AGGCTGGAAC
GGCTGGTGGC AGCTCACAGG AATCAAGACG CTCTTTGGTA TTGAAGCAAA GTAA
 
Protein sequence
MKRFLVVLVL VLALVSVFGQ TFERNKTLYW GGALWSPPSN WNPFTPWNAV AGTIGLVYEP 
LFLYDPLNDK FEPWLAEKGE WVSNNEYVLT LRKGLRWQDG VPLTADDVVF TFEIAKKYTG
ISYSPVWNWL DRIERVDERT LKFVFSDPRY QEWKQMLINT PIVPKHIWEN KTEEEVLQAA
NENPVGSGPY YVESWADDRC VFKKNGNWWG IRELGYDPKP ERIVELRVLS NNVAVGMLMK
GELDWSNFFL PGVPVLKKAY GIVTWYENAP YMLPANTAGI YINVSKYPLS IPEFRRAMAY
AINPEKIVTR AYENMVTAAN PAGILPLPGY MKYYPKEVVD KYGFKYDPEM AKKILDELGF
KDVNKDGFRE DPNGKPFKLT IECPYGWTDW MVSIQSIAED LVKVGINVEP KYPDYSKYAD
DLYGGKFDLI LNNFTTGVSA TIWSYFNGVF YPDAVESEYS YSGNFGKYAN PEVETLLDEL
NRSNDDAKIK EVVAKLSEIL LKDLPFIPLW YNGAWFQASE AVWTNWPTEK NPYAVPIGWN
GWWQLTGIKT LFGIEAK