Gene TRQ2_1009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_1009 
Symbol 
ID6092439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp1053866 
End bp1056019 
Gene Length2154 bp 
Protein Length717 aa 
Translation table11 
GC content43% 
IMG OID642488205 
ProductCRISPR-associated Csm1 family protein 
Protein accessionYP_001739042 
Protein GI170288804 
COG category[R] General function prediction only 
COG ID[COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) 
TIGRFAM ID[TIGR02578] CRISPR-associated protein, Csm1 family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAGACA GAGAAGAGCT TGTTGTTGGG GCTCTCCTGC ACGACATAGG AAAGGTCGTA 
AGAAGGGCGG GGGATGACAG AAGGCATCAG ATTGCCGGAT ACGATTTTAC AAACAAAGTG
AAGAAATTCG CTGTAATTCA GGACTACATT CACTACCATC ATGAAAAGGA TCTTCTGAAA
AAAAGTTTGG AAAACGAAAA GGTTTGGTAC GTGTGCTTCG CAGACAACCT CTCGAGCAAA
GAAAGAATGA CTGAAGGTCA GAAGTTTGAA GAACTTCGAA GAATGGACAA CCTCCTCTCA
AAGATCCCGG AGGGGGAATC TTCCAGAAAC GTCACTTACT TTCCTGCGAA ACCAGCCAAC
GAAGTGGTGG AAGCGGTAAA GGACATGAAA GAAGATCAGA AAACCTACGA GGATCTCTAC
AGGAGATTTG TCGAAGACGC TCAGAAGATT TCTCCCACAC CAGATGATGT GAATTTTCTC
ACTTACAAGT ACTTCTCGTT CATTCCTCAG GAAACAAGAG TGGAAGGAGA CATGGATATA
TCGCTCTACG ACCATCTGAA GGTCACGGCT ATGCTTGCTC TCTCGCTCTA TGACTACGCA
AAGGAAAACG ACCTCAAATT TGAATCCTAC CAGGAGATGA AATCACACTT TGAGAATTCC
AACGTGAAAC CCTTCCTACT TGTTGGGGGA GATGTCTCCG GGATACAGAA TTTTATTGCC
AACGTCTCTT CAAAAGGAGC TCTCAGATCC TACAGAGGAA GGAGTTTCTT CATAGAAATT
CTCCAGGAAG TCGTTGTGGA TGAGATTCTT GATAAAACAG GCTTTTACAG GACAAACGTT
CACTTCATAG GAGGAGGACA TTTCTACCTT GTCCTTTCCA ACACAGAGAA GGTGAAGAAA
GCCCTCGAAG AGATCAGAAA CGAGCTGAAC GAATGGTTCA GAAATAGGGG TCTTTCACTC
CACCTTGTGA TCGAATCTGT TGAATTTTCG GTGAAAGACG TGGAAGACAT GTCCAAGGTT
TTCAAGAAGA TCGGTGAAAA GTTGAACGAA AGAAAATACA GAATGTACAC AGAAAAAGAC
CTCGAAGCGA TCTTTCCCGA TGATTTGAAT CTGATCCAGG AGAAGGGAAA CCACACCTGC
AAAATCTGTG GAAACAGAGT AGACAGGCTC TTTTCCATTC GAGAAGGAGA AGAAGAAATC
GCCTGTGACT TCTGCAAGGA AATGTACGAG CTTGGAAGAG AACTTCTTGA AGAGTCTCAC
GTGTATCTTG CTGAAAGGAA GAATGGAAAG TTCGAGATTT TTAAGAGAAA ATTCGATTTC
TCGAGAGAAC CGGGAGAGGG TTTCAGCTAC AAGTTGAGAA GGATATACGA ATTCTCGGAA
AAAGAAAAGA ACGTCAGAAG AATACAGGTG GTGACGTATT TCAAAGAACA GGAGTTCGAG
AAAATCGCAG AAAAAGCACC TGGCAAAAAA ATAGCGAGCC TCCTTGTTGA CGTTGACAAC
CTTGGAAAGA TCTTTCTCAA AGGTTTAAAG AAAAAGACTC TTTCCAGATA CAGCACCCTC
TCAAGGCTCA TGAGCTTTTT CTTCAAAGAA AGAGTAGAGA GTATTGTTGA AGGAAAGAAC
GTTATGGTGA TTTATTCCGG CGGAGACGAT CTCTATCTGG TTGGCGGATG GAACGATGTT
CTGGATGTGG CAAAAGAGTT GAGAGAGGCG TTTGGAAGAT TCACAACGAA CGACTTCATG
ACGTTCTCCG CGGGATACGT GATCACCGAT GAAAAGACAA GCATGAGCCT AATAAGAGAA
ATGTCTGAAA GAGCCGAAAG CGCTGCCAAG AAATCCGGGA AGAACAGCAT AGCATTTTCG
AACAGAAACT ACTATGCGGT AAAGTGGAAC ACCTTCTTCG AAATGTACAA CTTTTATCAA
GAGTTGAAGG AAATAGCAGA CAAAGTGGAC AGAAGTGTCA TTAGAAAGGC TTTGAATCTC
ACAAGGGAAG AGTCTCCTCT GAACAAAGCC TTCCTCGCCT ACATAGAAGC AAGGGAGAAC
AAAGACGAAG ACAAAAGAGT GGCAAATCTC ATGAGAGAGA ACATAGATCA CCTCGGTGAA
AACGCCTTGA ATGTAATCCT CCAGTTTGTG GATCTTCTCT CAAGAAAAAG CTGA
 
Protein sequence
MKDREELVVG ALLHDIGKVV RRAGDDRRHQ IAGYDFTNKV KKFAVIQDYI HYHHEKDLLK 
KSLENEKVWY VCFADNLSSK ERMTEGQKFE ELRRMDNLLS KIPEGESSRN VTYFPAKPAN
EVVEAVKDMK EDQKTYEDLY RRFVEDAQKI SPTPDDVNFL TYKYFSFIPQ ETRVEGDMDI
SLYDHLKVTA MLALSLYDYA KENDLKFESY QEMKSHFENS NVKPFLLVGG DVSGIQNFIA
NVSSKGALRS YRGRSFFIEI LQEVVVDEIL DKTGFYRTNV HFIGGGHFYL VLSNTEKVKK
ALEEIRNELN EWFRNRGLSL HLVIESVEFS VKDVEDMSKV FKKIGEKLNE RKYRMYTEKD
LEAIFPDDLN LIQEKGNHTC KICGNRVDRL FSIREGEEEI ACDFCKEMYE LGRELLEESH
VYLAERKNGK FEIFKRKFDF SREPGEGFSY KLRRIYEFSE KEKNVRRIQV VTYFKEQEFE
KIAEKAPGKK IASLLVDVDN LGKIFLKGLK KKTLSRYSTL SRLMSFFFKE RVESIVEGKN
VMVIYSGGDD LYLVGGWNDV LDVAKELREA FGRFTTNDFM TFSAGYVITD EKTSMSLIRE
MSERAESAAK KSGKNSIAFS NRNYYAVKWN TFFEMYNFYQ ELKEIADKVD RSVIRKALNL
TREESPLNKA FLAYIEAREN KDEDKRVANL MRENIDHLGE NALNVILQFV DLLSRKS