Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0997 |
Symbol | |
ID | 6164806 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 888778 |
End bp | 890253 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641668150 |
Product | CRISPR-associated helicase Cas3 |
Protein accession | YP_001794375 |
Protein GI | 171185456 |
COG category | [R] General function prediction only |
COG ID | [COG1203] Predicted helicases |
TIGRFAM ID | [TIGR01587] CRISPR-associated helicase Cas3 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.242255 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0349281 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTAGGG CGGTGGAGAG GGGGCTTGAG CTGGCGCGCG CGGGCGCTTC AAGGATTGTC TTGGAGCTTC CGACGGGATA CGGGAAGACC TACGCCGCGC CGCTGTTTTA CAAGGCTCTG AGGGAGGGGG GTCTCTGCAG TAAGGCGATC CACGTCATGC CCCTGAGGGC TCTCATTAGG AAGCAGATCG AGAGGTACGC CTCTGCCCAC CCCGATATAA AGTTCGCGTA TCAAGACGGC GACGTCCTCC TTCGGGATAG GTACGTGAAG GATCCGTACT TCCGCGAGGA GTACGTCCTC ACGACGCTGG ACTCCTTCAT CCACAACCTG GCGAAGGTCC CCGTCGCAGA GATGTGGAAG TTCTACAAGT CCGAGAGACC GTCGGTGAGG TACCACATAC CCCTCGCCTA CATCTACCCC TCGTGTGTCT TTCTAGACGA GGCGCATATG GCAGCTGGCT CCAGGAAAGA GCTCGCCGCA GTAAAGGCGG CTGTGAGGTT CCTCAAAGAG CTCGGCGTGC CCACCGTCGT CATGTCGGCA ACTCTTGGCG AGTGGAAGAG GGATGTATTC AAGGGCTTTA GGTTTGTGTC GCTTGGGGAG AGGGATGAGG AGAGGGGCGG CGACGTCAAA GTCGCCGATC CGGAGTTCGA ACGGCGGATG TCGAACGTAG CGTACAAGGT GGGGAGGGTT GACAATATAG AGGAGGTGGC GAGGCGTAAG GCGGAGGAGG GGAAGAGGGT GCTCGTGATA GTTAACGACG TCTGCAGAGT CCAGCGCCTT TCCGAGACGT TGGGGGCGCC GGCGGTGCAC TCCCTCATGA CGCGGAGAGA CCGCGAGAGG GCTGAGCGCG AGCTTGATAA GGCTCAGATT GTGGTGGGTA CAGACGCGAT TGGGGCCGGC GTCGACGTAG ACTTCGACGT GTTGATAACC GAAGCCGACG AGGTGGAGCG GGTCGCGCAA CGGGTGGGGC GCGTCTGTAG AAACAGGGAG AGCTGCGTAG GCGAGATATA CTTCGTCGGC GAGGTCGATG AGGAGCTCCT CAAGGTCAAG AACTGGCGCC TCCCGCACAA GCCGGACAGC TACCTACGCC TCCTCCGAGG GGAGGCCGAG GTAGACGAGA AACACTTCTG GTTTCTGGGA ACCGTCATGA AGGAGGAAGT GGTGGATACA CACAACTTGA GGGAAAAGTT GCTGGAGGAG GGCGGCTCCT TCGTCCGCCA CGCCCTGCTG GAGGCGGAGG TGGGGGAGGG GCCCGAGGAG TCCTTCACAG TTTCGCTGGA CAGGCTGGGA CAGCTGGAGT TTGAGGTGTA TGTTGGAGAG AGGCGGATTG AGCCGCCCCG TAGCTACGGA GATAGGGAAT TGATGGAGTG GCTTCTTGAC GTCGTCGATG AGTACGGCGA TGTGCCGAGG ATCCGCGTTG TGAAATACAG AGAGGGGCTT GGGGCTGTGC TCAAGAAGTG TGGAGAGGGG GTATGA
|
Protein sequence | MRRAVERGLE LARAGASRIV LELPTGYGKT YAAPLFYKAL REGGLCSKAI HVMPLRALIR KQIERYASAH PDIKFAYQDG DVLLRDRYVK DPYFREEYVL TTLDSFIHNL AKVPVAEMWK FYKSERPSVR YHIPLAYIYP SCVFLDEAHM AAGSRKELAA VKAAVRFLKE LGVPTVVMSA TLGEWKRDVF KGFRFVSLGE RDEERGGDVK VADPEFERRM SNVAYKVGRV DNIEEVARRK AEEGKRVLVI VNDVCRVQRL SETLGAPAVH SLMTRRDRER AERELDKAQI VVGTDAIGAG VDVDFDVLIT EADEVERVAQ RVGRVCRNRE SCVGEIYFVG EVDEELLKVK NWRLPHKPDS YLRLLRGEAE VDEKHFWFLG TVMKEEVVDT HNLREKLLEE GGSFVRHALL EAEVGEGPEE SFTVSLDRLG QLEFEVYVGE RRIEPPRSYG DRELMEWLLD VVDEYGDVPR IRVVKYREGL GAVLKKCGEG V
|
| |