Gene TRQ2_1697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_1697 
Symbol 
ID6093147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp1718939 
End bp1719988 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content43% 
IMG OID642488897 
ProductSARP family transcriptional regulator 
Protein accessionYP_001739714 
Protein GI170289476 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000546308 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTATTT TCGTGAAAAC CTTTGGCGGG ACAAGAGTTA TCAAGAACAA TGATATCGTG 
AACGCAAGAG ACTGGCCATC GCAAAAAGCG TTTGCCCTGT TCAGGTATCT CATTTTCAGA
AGGAACGAAG AAGTGTCCGT CGAAGAGATC TACAACCTCT TTTGGGAAGA CATGGACGAC
ACATTCGCGA AATCAAATCT GAACACCACA CTCCATATTA TAAGGAGAAC CACCGGAATA
ACGAGCGAAG AACTCTTTGT GAAAGGAGAT CTCTGCTGTT TCTTCCCGGG AGACAAAATC
ACCATAGACG CAGATATTTT CGAAGAGTGC CACAGAAATC TGATGAAAGC CACATCGGAT
ATTGAACATG AAAAACTTCT TAAGAGGATG TTCGAGATTT ATGCAGGACC GTTTCTAATC
GAAGACATTT TCGCAGAATG GGTGCAGGAA ATCAGAGAAA TCTACGAATC GTGGTACTCA
GATGTTTTAA AAGAGCTCTT CAAATTGTAT CTGGCAAAAA AAGATTACGA CGCCGCCCTC
GAGATGGTAA ACGCTTATTT TCAGAGAGAG CCTTACGACG AGGATATGTA CTACAAAGCA
ATAGAGGTCC TTCTGAAAAA GGGTGATATC ACAAGGGCAA AACGCGTATA CGACAAGCTC
TCGAGTCATC TTATGGAGAT AGGGATCAAA CCTCGATTGA AATTCGATGA TTTTCTCTCC
AAAAGAGGCT CAGAATTCAT GCTGAACGGT AACAAGGCAG TGGTGGTTGA TGAAAAGCTT
TTTGAAAGTT TCCTCTTTCT GGAGAGTCGA AGGAGAGAAA AGTCTTTTGT CCTCGTCGAG
GTGAAACTGA TGAATAAGAG TATCAGCACT GAAGATGTCT CCCAAAGGGT AGCATCTCAT
CTTCGAAAGG GAGACGTGAT GACCTTCTCA GGTGAAACTA TCCGAATTCT CTTCCACTGT
CCCGAACAGC GTCGTCCAAC AATGGAAAAA CGTGTAGCAG ACGTCCTCGA GAAAGTTGGA
GTGAAGAAAG GTCAGTACGA AATTTCCTGA
 
Protein sequence
MSIFVKTFGG TRVIKNNDIV NARDWPSQKA FALFRYLIFR RNEEVSVEEI YNLFWEDMDD 
TFAKSNLNTT LHIIRRTTGI TSEELFVKGD LCCFFPGDKI TIDADIFEEC HRNLMKATSD
IEHEKLLKRM FEIYAGPFLI EDIFAEWVQE IREIYESWYS DVLKELFKLY LAKKDYDAAL
EMVNAYFQRE PYDEDMYYKA IEVLLKKGDI TRAKRVYDKL SSHLMEIGIK PRLKFDDFLS
KRGSEFMLNG NKAVVVDEKL FESFLFLESR RREKSFVLVE VKLMNKSIST EDVSQRVASH
LRKGDVMTFS GETIRILFHC PEQRRPTMEK RVADVLEKVG VKKGQYEIS