Gene Tpet_1084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_1084 
Symbol 
ID5171159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp1112028 
End bp1114319 
Gene Length2292 bp 
Protein Length763 aa 
Translation table11 
GC content41% 
IMG OID640563601 
ProductCRISPR-associated helicase Cas3 
Protein accessionYP_001244674 
Protein GI148270214 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR01587] CRISPR-associated helicase Cas3
[TIGR01596] CRISPR-associated endonuclease Cas3-HD 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAAAA TTCTTGCCAA GAGCAACGGT GTTACTCTCA GAGAGCACGT ACTGGATTTA 
TTGGAAGTAC TGGAGAATCT TCAGATAGAT TCTGGAGTAA GAGAACTTGC TGGGAAAGCT
ATCCTTTATC ATGATTTGGG TAAAGTAATA TACGCGTTTC AAAAAAAAGT TGGAGGAGAT
GTCCCGGAAG ATGGTATTCC CGACATTCCG CACAGCTTTC TGAGTATAGC ATTTATCCCG
GAGAATACTC TCAGAGAACT CGGTGAAGGA CTTTCTCGCA TTTTTCTTTC AGCGGTTCTT
TACCATCATT GGCGAGAAAC CTATCTTGAC TATCTGTTTG GAAGAAAAAA AGAAAGTGTT
TGTAAAGCTT ACGAAAGACT TCTTGAAGTA GGTGATGAGA TAGTTTGGCT TCTCAGAGAA
GAAATGGGAG ATCTCTGTGA AATTGAATTG AACAAACCGC TGTGTGAATA CTTATCACAT
AACTCGATAA TTGACAGTGG CTTAGTATTC CCTCCTCATT TGATTTCTCT GCTTCCGAGT
ATCATCCTTA AAGAACTGAA TATAAGAGAT GAAAATTACA AGCGTTACAT AATCACCTCT
GGGACAGTCA TGCGAGCCGA TAGATTCGCA TCTTACGTGG AAACGGCTGG AAATAAGGAG
CTTCTCAAGA AAGCAGATAA AAGACTGGAA AGAGATCAGC TTGAGACTGT TCGAAGATAT
CTGGAGAGAA TATCCAACAA AGTATGGCAA CTGGATCTTC TGAAAGATTG TCGAGGAGAA
AATATTGTAC TTGTGGCGCC GACCGGAGCC GGAAAGACAG AATTTGCCCT GATGTGGAGC
AAAGGAAAAA CCTTGTTCAC TCTTCCTCTT CAGAGCGCAA CGAACATGAT GTACGAAAGA
GTTAAGAATT ATTTTGGTGA AGAAAACGTC GGTCTTCTAC ATTCCGATGC TGCTGTTTAT
CTGTTTTTCT CCAGTTTTCT GAAAAACAAC TTTGAGGACA GAGAAGGAGA AGTGCTTCAG
ATAGTGGAAC AATCAAGATT CTTTTCTCAT CCTTTCGTAA TATCGACGGG GGATCAAGTC
TTTCCTTCTG CTTTGAAATA TCCGGGATAC GAGATGATCT ATTCAATCCT GGCTAATTCC
TATCTTGTGA TAGACGAAAT ACAGGCCTAT TCTCCCGAAG CTGCAGCGAT AATTGTCAAG
ACAGCTGAGG ATGTGAAACA ACTCGGAGGA CATTTTTTGA TAATGACAGC TACACTGCCT
GGATTCATCA GGGATGAGAT CTTAAAAAGA GCCGAGCTAG AGGAAAAGAA TATAAAAGAT
GTTTACGAAG ACATTCCTCA GGGAAGATTA CTACGGAATT TGGTTAAAGT TGAAAAAAGC
TTGGATCCCG TGGAAAAAGC TGTTGAATTT TTCAAAAAAG GATGTAGGGT TTTGATAGTA
AGAAACACAG TGCGGAATGC GATTGAAACT TATCGGCAAC TTGTAGAGAA ACTGGGAAAA
GAGGATGTTC TACTGATACA TTCCCGGATG ACTCTGGAGG ATCGAAGAAG AATAGAGGAG
ATTTTGGAAA GTTACCGCCC AGGAGCGAAG GGAAGAAACA TTATTCTCGT ATCAACACAG
GTTGTGGAGG CTTCAATGGA CATTGATTTT GATATTCTGC TTACAGATAT AGCACCAGCG
GATTCTCTGG TGCAGAGGAT GGGAAGAATC TTCAGAAAAA GGGAGTGGAC TGAAGAATTT
CCGAATACTT TCATATACGC TGATAGCAAT GAGAAAAAGC GAAAGGAGCT TATCGCCGGG
GTTTACAGTG AAGATGTTGT ATCTGCAACT TTGAACGCGT TAGAAGGCAT GTTTGACATA
GAAAAACCAT TTAAGCTGGA TGAGCTATCA AAAAGAAAAT GGGTGGAAGA GACCTATCAG
GCACTTTCTC AAGGAAGCAA CTATTTAAAG AAATTTCGGG AGACTCTTGA TGTTCTCGAT
AGCGGTTATT CCTCAGAAAA GAAGCATGAA GCCCACAGGA TTTTCAGAAG GGTAGTATCC
ATGAGCATTG TTCCGGAAAA TTTGAAAGAA GATCTTAAGC GGGAGCTGAA GAGTGTATCG
GATTACTTTG ACTTCAGGCG AGTTACTGCA AAATATCTTG TTGATGTTCC GCTCCACCAC
ATAAAAAGTG GAGCTTTAGA GCCTTTGGAG GTAGAATCCG AGAATGAACA AGTATTGAAA
TGGGCAAGCG CGCTTTATGT GTTGAAGGGT AGCAGGTACG AAAAAGGTCT GGGAATTTTC
CTGGAGGAAT GA
 
Protein sequence
MSKILAKSNG VTLREHVLDL LEVLENLQID SGVRELAGKA ILYHDLGKVI YAFQKKVGGD 
VPEDGIPDIP HSFLSIAFIP ENTLRELGEG LSRIFLSAVL YHHWRETYLD YLFGRKKESV
CKAYERLLEV GDEIVWLLRE EMGDLCEIEL NKPLCEYLSH NSIIDSGLVF PPHLISLLPS
IILKELNIRD ENYKRYIITS GTVMRADRFA SYVETAGNKE LLKKADKRLE RDQLETVRRY
LERISNKVWQ LDLLKDCRGE NIVLVAPTGA GKTEFALMWS KGKTLFTLPL QSATNMMYER
VKNYFGEENV GLLHSDAAVY LFFSSFLKNN FEDREGEVLQ IVEQSRFFSH PFVISTGDQV
FPSALKYPGY EMIYSILANS YLVIDEIQAY SPEAAAIIVK TAEDVKQLGG HFLIMTATLP
GFIRDEILKR AELEEKNIKD VYEDIPQGRL LRNLVKVEKS LDPVEKAVEF FKKGCRVLIV
RNTVRNAIET YRQLVEKLGK EDVLLIHSRM TLEDRRRIEE ILESYRPGAK GRNIILVSTQ
VVEASMDIDF DILLTDIAPA DSLVQRMGRI FRKREWTEEF PNTFIYADSN EKKRKELIAG
VYSEDVVSAT LNALEGMFDI EKPFKLDELS KRKWVEETYQ ALSQGSNYLK KFRETLDVLD
SGYSSEKKHE AHRIFRRVVS MSIVPENLKE DLKRELKSVS DYFDFRRVTA KYLVDVPLHH
IKSGALEPLE VESENEQVLK WASALYVLKG SRYEKGLGIF LEE