Gene Tpen_1354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1354 
Symbol 
ID4602191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1307106 
End bp1308806 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content70% 
IMG OID639774129 
ProductCRISPR-associated helicase Cas3 
Protein accessionYP_920754 
Protein GI119720259 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR01587] CRISPR-associated helicase Cas3 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.245938 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCTTGC AGGAGTACTG GGGGGTAGTG GAGAGGGTCG TCTCGTCGAG GAACAGGAGA 
CCGGTGCGCT ACTCGTACTT CGAGGACGTC TGGAGGGCCG TAGACGAGGG GCACAGGCTG
GTCGTCGTAA GGGCACCCAC TGGTTGCGGC AAGACGGAAG CGGCCACCGC GCCGTTCATC
AGCGACGCGG CTAAAGGCTC GAGGCGCTGG GTCTCGCTCG TCTACGCGCT ACCCACCCGC
TCCCTCGCCT CGGCGATGCT GAGGAGACTG TCCCGCTCGC TGGCCGCGGC GGGGGCGGAG
TGGACCACCG CGACGCTGAG CTACGGGGGC CTCTGGGAGG CCAGGCCGTA CCTCGAGGGG
GACGTGGCAG TCACGACGTA CGACACGCTC CTCCACCAGT TCTACGGCGT CGTCTCCCCG
GGATACCACC TCCTCCTGCC CGCGGCGAAA GTCTCTGGCT CGCTCGTGGT CCTCGACGAG
ACCCACCTGC TCCAGGACGC CCACTGGTAC GCCCCGAGCC TTCTCCCGGC CCACGTCGCG
TCCCTCGTAT CCCTGGGGGC CCAGGTGCTC GTGGTGGGGG CCACGGTGCC GGAGGTCCTG
CTCGAGGAGC TACGGAGGGA GTACAGGCTT GTGAGCCGCG GGGAGGAGCC GGCAGTGGTA
GACGCGGCGG ACGAGCCGGC GAGGGGCAGG CTGGACGTCG AGCTCAGGGG CGGCGGGATG
CCCGTGGAGG GGCTGTGCAG CCTCCTCGAG GGCGCCCCGA GGCCCGCGCT CGTCGTGGTG
AACAAGGTCG AGAAGGCCGT CGAGGCCTAC AGGGCGCTCC GCTCCTGCCT CGGGGGGAGC
GTTGCGCTAC TGCACTCGAG GCTGAGGGGC GGCGTCAGGG CGCGCGTTGA GGGGCTGTTC
GAGGGGGACG GCGCCCCCGG GGACCTAGTC CTGGTCGCAA CCCAGGTCGT GGAGGCGGGG
CTGGACCTCG ACGTGCGCTT CCTCGTGACC GAGGTCTCGC CTGTCGACTC GCTGATACAG
AGGCTGGGCA GGTGCGCGCG GAGGAGCGAC GGCCACGCCG TCGTCTTCCT GGACGAGGAG
GCGGCGCGGA ACGTCTACCC GAGGGAGCTC GTCGAGAGAA CGCTGGGGGT CGTCGACGCC
CAGTCGCTGG CGGAGAGCGT TAGGAGGCTA AGCGTTGCCC GCGAGCTCGT AGACGGGGTG
TACGCGGCGG AGGTCGTCGA GAGGCTTAGG AAGGGGTGGG AGCGCGCCCT GAGCGAGGTG
AAGGGCTGGG CCTTGAGGTT CCCGCGAAGC CTCCTCCACA AGGAAGCGCA CAGAGAGCCG
GGGCCCCTAC TCAGGCTCGG CTACGAGGTC GCCTGCTACC TGCCGGGGAG CGCTGGCGAG
TACGAGGCGT TGCTGGGCGG CGGGGAGACC GCAGTATCCC TGGAAAGGCT CAGGGACTAC
ACCGTGAGGC TGTCCGTGGA GGGGCGCGGA GAGGCCCCGG CGGCCGTCGT CCACGAGGTC
GGCGGCCGCG AAGTCGTCGT CGCGCTCGAG TACAAGCGGG TCGACGGAGG GCTAGCCCTG
AGGGGGAGGC GCATGGAGCC CCGCGCTTTC CCCCGCGCCG TCGAGTCCGG CGAGCTCTTC
CTGCTGAACC CGGCGTTCTA CCTGAGCGAG GGCGGGGACG AGCTCGGGGT GGTTCGGCCG
TGGAGGTCCC GGAGTGCGTA G
 
Protein sequence
MSLQEYWGVV ERVVSSRNRR PVRYSYFEDV WRAVDEGHRL VVVRAPTGCG KTEAATAPFI 
SDAAKGSRRW VSLVYALPTR SLASAMLRRL SRSLAAAGAE WTTATLSYGG LWEARPYLEG
DVAVTTYDTL LHQFYGVVSP GYHLLLPAAK VSGSLVVLDE THLLQDAHWY APSLLPAHVA
SLVSLGAQVL VVGATVPEVL LEELRREYRL VSRGEEPAVV DAADEPARGR LDVELRGGGM
PVEGLCSLLE GAPRPALVVV NKVEKAVEAY RALRSCLGGS VALLHSRLRG GVRARVEGLF
EGDGAPGDLV LVATQVVEAG LDLDVRFLVT EVSPVDSLIQ RLGRCARRSD GHAVVFLDEE
AARNVYPREL VERTLGVVDA QSLAESVRRL SVARELVDGV YAAEVVERLR KGWERALSEV
KGWALRFPRS LLHKEAHREP GPLLRLGYEV ACYLPGSAGE YEALLGGGET AVSLERLRDY
TVRLSVEGRG EAPAAVVHEV GGREVVVALE YKRVDGGLAL RGRRMEPRAF PRAVESGELF
LLNPAFYLSE GGDELGVVRP WRSRSA