Gene Tneu_1134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1134 
Symbol 
ID6165757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp1023368 
End bp1025068 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content65% 
IMG OID641668285 
ProductCRISPR-associated helicase Cas3 
Protein accessionYP_001794510 
Protein GI171185591 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR01587] CRISPR-associated helicase Cas3 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0626867 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000190446 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTGGTA GAGGGCTTGA CACGCTGGTG AGGGAGCTGA GCCTGGCGTG GTGTAGGTCG 
AAGGGGGAGC CCAGCTGCGC CGTGAGGGAG GAGGTCCTCG CGGAGCAGGC CAAGGCGGCT
GAGGGGATCG AGAGGTCAGA CGGCGTGATC CTCCTCAAGG CGCCGACGGG CTTCGGAAAG
ACCGAGATCT GGACAGCGCC GTTTTTCGCC CAGTGGCTTA GGGGGGAGTG GTTCGCGCCC
AGGTTGTACG TCGTCGAGCC CATGCACGCC CTCCTTAGGC AGATGAAGAG GAGGATGGAG
GTCTACGCCC AGGCCGTCCA AGGCCTGGGG CTCCCGAGGC TGAACGTCGC TGAGGACCAC
GGCGAGGTGG CGAAGCCCCT CTTCCTATAT GGGGGGCACA TAGTCCTCAC CACGGTCGAC
TCGCTGGCCT ACGGCTACCT CGCGAGGCGG GTGCAGAGGT GGCGGGAGGA GGGCGTGGAG
AGGGGGAGGT ACAGCATGCC CGCGGGCCTA CTCGCAAGCG CCTACATTGT CCTAGACGAG
GCCCACCTCA TACAGGACGA GGCCTACCTA GGCCCCAGGG TGCTGGGGAA GATAGTCTGC
GACCTGGCCT CCGCCGGCGC CAAGGTCGTC ATATCCACCG CCACGGTCCC CGAGACCTTC
CTCAAGCACA TCCCGTGTCT CGGCGGGAGG CTGACGCTTG GGTCCGGCAC TGTCAGAAGA
AACGTGGAGG TGGAGAGGAG GAAGGGGGTC CTCAAGGCGG AGGAAATCGA ATGCGGCAGG
CCCACCATCG TCATTGTGAA CACGATAGAG AGGGCCCGGC GCATCTACAA ACAGGTGAGG
TGCGGGAAGA AGGCCGTGGT GCACTCGTTG ATGAGGAGGG AGGACAGGGA GAGGCAGCTG
AGCAGGGTGC TGGCGGACGG AAAGGTGGCC GAGGACGCGG TGCTGATTGG GACCCAGGCG
CTTGAGGTGG GGCTCGACTT CTCCAACCTA AGGGCGCTCT ACACCGAGAC GGCCCCCGTA
GACGCGCTGA TACAGCGCAT TGGGAGGGTG GGGAGAGACG GGGGTAAGGC GGAGGCCTAC
ATCTACGAGG CCGAGGGAGA TGCCCCCTAC CCGCAGACCC TCATGCAGGC CACGCGCGAA
GCCCTTGAGG AGGAGCTACG GGGAGGCGCC GCGCTGACGT CTTGGGAAGA CGCACAGAGG
GCCGTGGACA GAGTGTACAA CGAGAAGGCC GTGGAGGAGC TCATGACGAG GGGGCTGGCG
TGGTACGGCC AAGCGCTCGG CTACCTGCAG GAGCTGTCCC TCTTCTCCTA CCCGCCCAGA
GGAGAGGTGA GGATAAGGCC CTCCAGCTAC ATCACGCTTG TTATTGCCGA CGTGAAGCAG
GATGGGGATA AGGGGCGCTA CATCACGGAG GAGGACGTGG AGAGAGGCGC GATGAAGATG
AGCTACACCT CCAGAGAGGA CCCCAGGATT AACGCCCTGC TCCAGAAGGT ATCCACCGCA
TATACGGTGA GGGGGATCGC CACGGCGAAG GACGAGACTC TCTACTACCT AAGCGAGCTC
CGGGGGGGCT GGGATGGGGT TGAGGTGGTT GTGGTGGACA GGAGGGACGT GGAGGAGCTG
TACGACGAGG CGGGGCTCGA CGTGGCGCAG CTCAGCGGCG GAGGCCAGAA GAGGAGGGGG
AGGGGGCGGA GGAGGCGATG A
 
Protein sequence
MTGRGLDTLV RELSLAWCRS KGEPSCAVRE EVLAEQAKAA EGIERSDGVI LLKAPTGFGK 
TEIWTAPFFA QWLRGEWFAP RLYVVEPMHA LLRQMKRRME VYAQAVQGLG LPRLNVAEDH
GEVAKPLFLY GGHIVLTTVD SLAYGYLARR VQRWREEGVE RGRYSMPAGL LASAYIVLDE
AHLIQDEAYL GPRVLGKIVC DLASAGAKVV ISTATVPETF LKHIPCLGGR LTLGSGTVRR
NVEVERRKGV LKAEEIECGR PTIVIVNTIE RARRIYKQVR CGKKAVVHSL MRREDRERQL
SRVLADGKVA EDAVLIGTQA LEVGLDFSNL RALYTETAPV DALIQRIGRV GRDGGKAEAY
IYEAEGDAPY PQTLMQATRE ALEEELRGGA ALTSWEDAQR AVDRVYNEKA VEELMTRGLA
WYGQALGYLQ ELSLFSYPPR GEVRIRPSSY ITLVIADVKQ DGDKGRYITE EDVERGAMKM
SYTSREDPRI NALLQKVSTA YTVRGIATAK DETLYYLSEL RGGWDGVEVV VVDRRDVEEL
YDEAGLDVAQ LSGGGQKRRG RGRRRR