Gene Tneu_0997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_0997 
Symbol 
ID6164806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp888778 
End bp890253 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content60% 
IMG OID641668150 
ProductCRISPR-associated helicase Cas3 
Protein accessionYP_001794375 
Protein GI171185456 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR01587] CRISPR-associated helicase Cas3 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.242255 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0349281 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAGGG CGGTGGAGAG GGGGCTTGAG CTGGCGCGCG CGGGCGCTTC AAGGATTGTC 
TTGGAGCTTC CGACGGGATA CGGGAAGACC TACGCCGCGC CGCTGTTTTA CAAGGCTCTG
AGGGAGGGGG GTCTCTGCAG TAAGGCGATC CACGTCATGC CCCTGAGGGC TCTCATTAGG
AAGCAGATCG AGAGGTACGC CTCTGCCCAC CCCGATATAA AGTTCGCGTA TCAAGACGGC
GACGTCCTCC TTCGGGATAG GTACGTGAAG GATCCGTACT TCCGCGAGGA GTACGTCCTC
ACGACGCTGG ACTCCTTCAT CCACAACCTG GCGAAGGTCC CCGTCGCAGA GATGTGGAAG
TTCTACAAGT CCGAGAGACC GTCGGTGAGG TACCACATAC CCCTCGCCTA CATCTACCCC
TCGTGTGTCT TTCTAGACGA GGCGCATATG GCAGCTGGCT CCAGGAAAGA GCTCGCCGCA
GTAAAGGCGG CTGTGAGGTT CCTCAAAGAG CTCGGCGTGC CCACCGTCGT CATGTCGGCA
ACTCTTGGCG AGTGGAAGAG GGATGTATTC AAGGGCTTTA GGTTTGTGTC GCTTGGGGAG
AGGGATGAGG AGAGGGGCGG CGACGTCAAA GTCGCCGATC CGGAGTTCGA ACGGCGGATG
TCGAACGTAG CGTACAAGGT GGGGAGGGTT GACAATATAG AGGAGGTGGC GAGGCGTAAG
GCGGAGGAGG GGAAGAGGGT GCTCGTGATA GTTAACGACG TCTGCAGAGT CCAGCGCCTT
TCCGAGACGT TGGGGGCGCC GGCGGTGCAC TCCCTCATGA CGCGGAGAGA CCGCGAGAGG
GCTGAGCGCG AGCTTGATAA GGCTCAGATT GTGGTGGGTA CAGACGCGAT TGGGGCCGGC
GTCGACGTAG ACTTCGACGT GTTGATAACC GAAGCCGACG AGGTGGAGCG GGTCGCGCAA
CGGGTGGGGC GCGTCTGTAG AAACAGGGAG AGCTGCGTAG GCGAGATATA CTTCGTCGGC
GAGGTCGATG AGGAGCTCCT CAAGGTCAAG AACTGGCGCC TCCCGCACAA GCCGGACAGC
TACCTACGCC TCCTCCGAGG GGAGGCCGAG GTAGACGAGA AACACTTCTG GTTTCTGGGA
ACCGTCATGA AGGAGGAAGT GGTGGATACA CACAACTTGA GGGAAAAGTT GCTGGAGGAG
GGCGGCTCCT TCGTCCGCCA CGCCCTGCTG GAGGCGGAGG TGGGGGAGGG GCCCGAGGAG
TCCTTCACAG TTTCGCTGGA CAGGCTGGGA CAGCTGGAGT TTGAGGTGTA TGTTGGAGAG
AGGCGGATTG AGCCGCCCCG TAGCTACGGA GATAGGGAAT TGATGGAGTG GCTTCTTGAC
GTCGTCGATG AGTACGGCGA TGTGCCGAGG ATCCGCGTTG TGAAATACAG AGAGGGGCTT
GGGGCTGTGC TCAAGAAGTG TGGAGAGGGG GTATGA
 
Protein sequence
MRRAVERGLE LARAGASRIV LELPTGYGKT YAAPLFYKAL REGGLCSKAI HVMPLRALIR 
KQIERYASAH PDIKFAYQDG DVLLRDRYVK DPYFREEYVL TTLDSFIHNL AKVPVAEMWK
FYKSERPSVR YHIPLAYIYP SCVFLDEAHM AAGSRKELAA VKAAVRFLKE LGVPTVVMSA
TLGEWKRDVF KGFRFVSLGE RDEERGGDVK VADPEFERRM SNVAYKVGRV DNIEEVARRK
AEEGKRVLVI VNDVCRVQRL SETLGAPAVH SLMTRRDRER AERELDKAQI VVGTDAIGAG
VDVDFDVLIT EADEVERVAQ RVGRVCRNRE SCVGEIYFVG EVDEELLKVK NWRLPHKPDS
YLRLLRGEAE VDEKHFWFLG TVMKEEVVDT HNLREKLLEE GGSFVRHALL EAEVGEGPEE
SFTVSLDRLG QLEFEVYVGE RRIEPPRSYG DRELMEWLLD VVDEYGDVPR IRVVKYREGL
GAVLKKCGEG V