Gene TRQ2_1018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_1018 
Symbol 
ID6092448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp1064228 
End bp1065865 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content42% 
IMG OID642488214 
ProductCRISPR-associated Csh1 family protein 
Protein accessionYP_001739051 
Protein GI170288813 
COG category 
COG ID 
TIGRFAM ID[TIGR02556] CRISPR-associated protein, TM1802 family
[TIGR02591] CRISPR-associated protein, Csh1 family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTGGAGA AGATATACAA TCTTGGGAAA ATCGGAATCG AATCTCTCTC AGATTTCGCC 
GAAGAACTTC CTCTTGAAGA CGGACTGGCT ATTTTTCTCG AAAGTGGTCC AAATGGTTTG
ATTTTCAAAG ATATTGGTCC CGTTAAGAAA AGCCCTGGTG GAAAGAGTTC GAAGGTTTTC
CTCTACAGAA AACAGCGCGG GAATTTCTCT TCTTCTGTTT CCCCCACCAT GAAATTCCTC
GACGAATCTA GTGTTTCGAG AGACCCTCTG GAAAGGACCT TTGATGTCTT CTTAGGCTTT
TTCGACCATC CGAAGATATC TCACATCAAA GATGTCCTTG AGAGAAATAA AGAAAAGATT
ATTGAAAAGC TAAGAAATTA CGATTTAAAA AACAGATTTC TCACGATATC CATCGATGGA
AAATTTCCCG CAGAAGTTCC AGAAATTCTC CGGGCGTTTG AAGATAAAGT CACACAGAAG
AAAACTAAGG GAGAGGCGAA ATGTTTCCTG TGTGGAAAAG AAGCAAATTC CAGACTGAGT
GATGTGTTCA AATTTGCCAC CTTTGACAAG CCTGGCTTCA CACCTTTTCT CTCCAGGAAA
CACCCGATCC AGATTTGTGG TGAGTGCAGG AGTGTTTTGG AAAAAGCCAG AAGAGTAATT
GATGAAAAGC TTTCTTTCTC CTTCTTCAAC AACAGAATTC TCTGGATAAT TCCCTCCGTG
CCCAATCAGG ATATTCTGGA ATCAGTGATT GAAAAAATAT CTGAAATAAA AGACACCGAC
AAAAAATCAA AGCTTCGCAG TTTTGCTAGA CTCGAAAGAA AAATAGAAGA TGTTCTCTCC
GAAAATGAGA AAGCTGTCTA CGATTTCATA CTGATAGAAA AGGAACAGCA GGCGGAAAGA
ATAGTTCTGC ACATAGAGGA GGTTTCACCT ACAAGAGTGA GACAAATCCT CGATGAGTCA
GATAAAACAG AATCTCAACT CAAGGAAGAT GGCTTCGAAA TTTCTGTGAA CTTCTCGGTG
ATACACAAAT TCTTCGAGGA TCTGGACAGA TACTTCAACT CCCTGTTCAG TGCTGTGTTC
TCTGAGGGGA CGTTTGATAA AAAACTGCTG CTGACTCTCT TTCTTTCGAA AATAAGAAGC
GATTTCTTTG GAAATGATAC TTTACTTTCT GCACGTGAGG CTTTCGCCAC GTACGTTTAT
CTGAGACGTT TGAACGTCTT AAAGGGGGGT GCTTCCGTTT TGAAGGGTGA AGATTTCTTC
TCAAGGTATC CGGAGTTTTT CGATGAACCA TGGAAGAAGG CAGTCTTCTT AGAAGGAGTT
CTGGCAAATT ACCTTCTTTA CCTGCAGTAC GTTAAAAGAA ACTCGAAGGC TTTCACGAAG
AAACTGAAAG GACTCAGACT GACAAAGAGA GATGTGGAGG GGCTTCTCCC AGAAATCAGA
GCCAAGATCG AAGCCTACGG TGGAATGAGT GAAAGTGTGG CAGAGCTTTT CAGGGAGGCT
ACTGAAGCCT TTCTCGAAGC TGGAAACTGG TCAGCATCGC CCGATGAGAT CAGTTTTGTC
TTTGTTTCGG GACTTTCCCT CGGGAAAACT TTCTTCAAGG AGGTAGAGGT TGATGAATCC
GGTGAAAAAC AGGAGTGA
 
Protein sequence
MLEKIYNLGK IGIESLSDFA EELPLEDGLA IFLESGPNGL IFKDIGPVKK SPGGKSSKVF 
LYRKQRGNFS SSVSPTMKFL DESSVSRDPL ERTFDVFLGF FDHPKISHIK DVLERNKEKI
IEKLRNYDLK NRFLTISIDG KFPAEVPEIL RAFEDKVTQK KTKGEAKCFL CGKEANSRLS
DVFKFATFDK PGFTPFLSRK HPIQICGECR SVLEKARRVI DEKLSFSFFN NRILWIIPSV
PNQDILESVI EKISEIKDTD KKSKLRSFAR LERKIEDVLS ENEKAVYDFI LIEKEQQAER
IVLHIEEVSP TRVRQILDES DKTESQLKED GFEISVNFSV IHKFFEDLDR YFNSLFSAVF
SEGTFDKKLL LTLFLSKIRS DFFGNDTLLS AREAFATYVY LRRLNVLKGG ASVLKGEDFF
SRYPEFFDEP WKKAVFLEGV LANYLLYLQY VKRNSKAFTK KLKGLRLTKR DVEGLLPEIR
AKIEAYGGMS ESVAELFREA TEAFLEAGNW SASPDEISFV FVSGLSLGKT FFKEVEVDES
GEKQE