Gene Tpet_1098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_1098 
Symbol 
ID5170121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp1126847 
End bp1128493 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content37% 
IMG OID640563615 
Producthypothetical protein 
Protein accessionYP_001244688 
Protein GI148270228 
COG category 
COG ID 
TIGRFAM ID[TIGR02221] CRISPR-associated protein, TM1812 family
[TIGR02549] CRISPR-associated DxTHG motif protein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0288139 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGAAG TTGTAAAACT GGTTACATTT CTTGGCCTGG GTAAGTATGA AGAAAGCGAA 
ATTAAAATTA ATGGGAATCT TTTAGATGGG TGTGTAAAAA ATGATGGCTC CCAGGATATT
TTTTGTGTTC GACAATCGAT TTTCCCACTA GCATTAATTG AATATCTGAA TCATATAGGA
AAACAAGTTG AAACAATATT CTTCCTAACA AAAACCGTGA AAGAAAGCAA AACGTGGGAT
GAGTGTAGAA GATTTTTGAG ATCAAATATT AAAGATTTTG AAATACCGGG AGGAGAATCT
TCAAGTAACT TGATGAATTT CCTTGTAAGA AACCTTGTGG AAAATTTGGA AAAAGGGGAC
AGAGTTATAC TGGATGTTAC TCATTCGTTC AGAAGTATTC CATTGATGGC AAGTGTAGTA
GCCCTGTACC TTAAAGAGGC CAAAGATGTT AATGTGAGTG TAGTTTACGG CAAGTACAAT
AAAGAAACAA AAGTAACAGA GTGCGAAGAT TTGACCCCAC TAACTAAAGC TACATCATGG
ATATATGCTG TTCGATTGTT CAAAGAGTAT GGATATGCGA AAGAACTCGC TGATTTGATA
AAAAAGAGAA ATGAAGAAAT ATACAGGGGA AGTCAAAGTT CGAAAAAACC AAAGCTACTT
GGTTCTATGT CTCAAAAACT TCAGGATCTT TCGTCTTCTA TACGTCTTGG ATCCATAGTA
GCCATAAGGA AGAATTTAAC CAATTTCTTC AATTTTATTG ATAGAAATAA AGCAAGAATA
AGAGAGGAGA CGGAGGTTTT TGTTCCAGAG ATTGCGGCTC TCTTAGATGG GATCGAAAAA
AGATACAGGG TTATACATGT GAAATCGGAG AATTTTGAAC TGAGCGAAAA AGAATTGGAA
TCTGAAAAAG AGTTACTGGA TTTCTATCTT CAAACAGGAG ATTTAGGTAT GGCTCTTCGT
TTGGCAAGAG AATATCTTAT AAATGTCTAT TTGATGTCTG GAGGGGAGAA GAGTGATTTC
TTGGATAGAA ATGTTAGAGA GTCTGTGAGC ATTTCAACGT TTGGTTATGA TACCATCCTT
CAGGCAAGAA ATCATGTTGC TCATTTCGGA TTCAACAAAC TCCAGCTTCC CTCCTTGAAA
AAGATAGAAG ATCACCTGAA AGTGCTTGTT CAAACCCCAC CGGAACAACT TCTGGAGTCT
GCGAGAAAAA CTCAGCGAAA CAGAAAAAGA GCTCTTCTCA CTCCTCTCGG TACGACAAAA
GGTGCTTTAT ACACCGTTCT GAAAAAAATT TCACCTGATC TACTCCTGGT TATAACTTCA
AAACAGGGAA AAGCTATTCT TTCAGAAATT CTGGAGAAAG CTGAATTCAA GGGTGAATTT
AGGGTAATCC TTCTCGAAGA CCCTTTTATG GGTGTAAGTG AAATAGATCG AGTAGTATCC
GAAATCAAAG AACATCTTTC TGACGTGGAT GAAGTGATCG TCAATCTCAC AGGTGGAACC
ACTTTTCTTA CGTACGTTAT CGAGCGTGCC AAAAATCAAA TTAGATACGG AAGAAAAGTA
AAAACTATCC TGGCTGTGGA CAAGAGAACT TACGAAGAAC AAAAGCAGAA TCCTTTTGTG
GTGGGTGAAA TTCTGGAGCT TGATTGA
 
Protein sequence
MGEVVKLVTF LGLGKYEESE IKINGNLLDG CVKNDGSQDI FCVRQSIFPL ALIEYLNHIG 
KQVETIFFLT KTVKESKTWD ECRRFLRSNI KDFEIPGGES SSNLMNFLVR NLVENLEKGD
RVILDVTHSF RSIPLMASVV ALYLKEAKDV NVSVVYGKYN KETKVTECED LTPLTKATSW
IYAVRLFKEY GYAKELADLI KKRNEEIYRG SQSSKKPKLL GSMSQKLQDL SSSIRLGSIV
AIRKNLTNFF NFIDRNKARI REETEVFVPE IAALLDGIEK RYRVIHVKSE NFELSEKELE
SEKELLDFYL QTGDLGMALR LAREYLINVY LMSGGEKSDF LDRNVRESVS ISTFGYDTIL
QARNHVAHFG FNKLQLPSLK KIEDHLKVLV QTPPEQLLES ARKTQRNRKR ALLTPLGTTK
GALYTVLKKI SPDLLLVITS KQGKAILSEI LEKAEFKGEF RVILLEDPFM GVSEIDRVVS
EIKEHLSDVD EVIVNLTGGT TFLTYVIERA KNQIRYGRKV KTILAVDKRT YEEQKQNPFV
VGEILELD