Gene TRQ2_1021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_1021 
Symbol 
ID6092451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp1067395 
End bp1069560 
Gene Length2166 bp 
Protein Length721 aa 
Translation table11 
GC content42% 
IMG OID642488217 
ProductCRISPR-associated HD domain-containing protein 
Protein accessionYP_001739054 
Protein GI170288816 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR01587] CRISPR-associated helicase Cas3
[TIGR01596] CRISPR-associated endonuclease Cas3-HD 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGATGTAG TGAAGTCTCA TCCTGACAGA AGACTGATCG ACCATCTCGA AGGTGTTAAA 
AAACGCTCTA TGGAGAAATT CAGATCACTT GATCTTTCCT GGGAAGAACT TTTCGGATAC
GATGAAAGAA TTCTGGAAGA ATTCGTTTCT CTACTCTCTG AATCGCACGA TATTGGAAAG
AGTACAACCT ACTTTCAAAA GCATCTGAAC GGTGAGAGAG TTGAAAGATC TTTGAGTGCG
CACGCTTTCT CCTCTGCGGT TGCTTTCTTC CACAGATCAA AAAACTTGCC GGACAAACTC
AGAATATTCG GTTTTGAGAT AATAAGAAGA CACCACGGAG ATTTCAGGAA CTTTCTGGAT
ATAGAAATCG ACGAAAAAGT TCTGGAAAAA CATTTCTCAG CCATTCCAGA AGATTTCCTG
AGAAGATATG ACCTTCATGA TTTGAAACTG TCAGAAACGA TGAAAGAGAT GAAAAAATTG
ATCAGTAAGT TTTCTATCCT TGGAAAAAAA GAGCTTTCTG ATTACTTTTT GATCCATCTT
TTTATGTCCG TACTTGTTTC TTCGGATAGG GAGGATGTGG TTTTCAAAGA CGAGCCGCTT
CCTTCTCTTC CTTCTTATGA AAAGGAGAAG ATACTCTCCT ACAGAGAGAA TCTCGGCAGA
AAGAATCAGA TCGACTCTCT GAGATGGAGA TTTCAGGACG AGATCTTGAA CTTCAAACCC
GAAAGAGGAA GGATATACTC CATAACTGCA CCGACGGGAA TAGGAAAAAC TCTTGCAAAT
CTCCTTTTTG CTGGACATCT CGCTGATGAA GACACGATCA TAATATACGC ACTTCCCTTC
ATAAACATAA TTGAGCAGAC CGTCGATAAG ATAAAGGAAA TATTTGAGAC AGAGGATCCG
TTCTTTGTCC TTCCTTTTCA TCACCTCGCA AATCCAGTTT ACGAAGAAGC TGACAAATAT
GAAGATCTTC TAATGAATCT CTGGCACTCC AGGGTGATAG TCACAACATT CGTGTCGCTT
CTGGAGTCCC TCATCACTTT CAGAAAGATT CCGTTCTTCT ACAAATTTCC TAAAGCCGTC
CTCATACTCG ATGAGGTTCA GGCAATACCT CACGAATACT GGACCCCTGT TGAAAAAACC
GTTGAATTTC TTTCGAAAAT GGGCACCACC GTTTTACTCT CTACAGCGAC AAAGCCCGCT
CTTTTGAAAG AAGCTCAGGA GGTGGTGTCT AACAAGAATG TTTATTTCAC AGCCTTGAAC
AGAACTGTTC TGAAAGTGGA AAAAGAGATG AGCTTTGAAG AATACAAAGA ATTCGTAATA
GAGACCTTGA AAGATGGTAA AAGAACGCTC ATCATAACAA ATACGATAAG AGAAGCCGAG
GAGATATACG ACGCTGTTGA AAGCATGGGA AAAACGTGTT TTCTTTCCTC CAGGGTGATA
CCAAAGCACA GACTGGAGAT TGTTTCGAAG ATAAATGAGT ACGATCTGTG CGTCTCCACT
CAGGTGGTGG AAGCAGGTGT TGATATCTCG TTTGAAAGGG TGATAAGAGA CATCGCACCT
GTTGACAGTA TCGTTCAGGC AGCGGGAAGG TGCAACAGAC ACTTCGAGTT GGAAAAGGGA
GAAGTGATAG TTGTACCTGT CAAGGATGAG AGAAAAAACA CCCTCTTTTC TTCTTACGTT
TACGGAAGTT TCCTCACAGA GACTTCCATG AACGTATTGA AGAACTACAA AGTTCTGGAA
GAAAGTGAAT TTTTCACGCT CGTGGAGGAT TTCTTCGACT ATGTGAAAAT GTACGGCAAT
CCGGATAAGA AAGGAATCGG CAAAGCGCTC GAGAATTTGA ATTTCAAAAA AATAGGAGAA
TTCAGTCTTA TTGAACCAGA ACCAACGGTA CCGTTCATCG TTCTTATTGA TGAAGAGGCT
CAAAGAGTAT TCGAAGAATT TGCAGAGATC TTCGATGGGA AAAAGTCAAG AGAAAACTTC
TCTCTCGTGA AGAGGTTGTT CAGAGAACTT TCTCCCTACA TCGTGAGCGC GAGGATAAAG
AAGGATCTTT CATTCCCACA CACGATCGCC GGCATGGTGG TTATCTACAG AAACGTTCTC
GACAAGTGGT ACCATCCTGT GAAGGGTTTG AGAGTGGAAG GATCCGATGA GGTGATCATC
ATATGA
 
Protein sequence
MDVVKSHPDR RLIDHLEGVK KRSMEKFRSL DLSWEELFGY DERILEEFVS LLSESHDIGK 
STTYFQKHLN GERVERSLSA HAFSSAVAFF HRSKNLPDKL RIFGFEIIRR HHGDFRNFLD
IEIDEKVLEK HFSAIPEDFL RRYDLHDLKL SETMKEMKKL ISKFSILGKK ELSDYFLIHL
FMSVLVSSDR EDVVFKDEPL PSLPSYEKEK ILSYRENLGR KNQIDSLRWR FQDEILNFKP
ERGRIYSITA PTGIGKTLAN LLFAGHLADE DTIIIYALPF INIIEQTVDK IKEIFETEDP
FFVLPFHHLA NPVYEEADKY EDLLMNLWHS RVIVTTFVSL LESLITFRKI PFFYKFPKAV
LILDEVQAIP HEYWTPVEKT VEFLSKMGTT VLLSTATKPA LLKEAQEVVS NKNVYFTALN
RTVLKVEKEM SFEEYKEFVI ETLKDGKRTL IITNTIREAE EIYDAVESMG KTCFLSSRVI
PKHRLEIVSK INEYDLCVST QVVEAGVDIS FERVIRDIAP VDSIVQAAGR CNRHFELEKG
EVIVVPVKDE RKNTLFSSYV YGSFLTETSM NVLKNYKVLE ESEFFTLVED FFDYVKMYGN
PDKKGIGKAL ENLNFKKIGE FSLIEPEPTV PFIVLIDEEA QRVFEEFAEI FDGKKSRENF
SLVKRLFREL SPYIVSARIK KDLSFPHTIA GMVVIYRNVL DKWYHPVKGL RVEGSDEVII
I