Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_1562 |
Symbol | |
ID | 4461854 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | + |
Start bp | 1701139 |
End bp | 1702116 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 639700583 |
Product | CRISPR-associated Cas5e family protein |
Protein accession | YP_843972 |
Protein GI | 116754854 |
COG category | [S] Function unknown |
COG ID | [COG5551] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.258656 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTTTTT GTTGCAATTT ATGGGATTTG AAGCCATACG ATTTCAAATT AAAGATATCA ATGCAATTAT CGACAGGTTT AAATCCTTGT ACAACTGATT TTTCATCAAT GCCTCATAAG ATAACACTGA TAGCAAGACC CGTGGACACA TTCGAGGTGC CATCGAGTGA AGGATATCAG CTGTATTCTG CGCTCCTCAA CATAATAAGA CAAGAAGATA AGGATGTTGC AACGCACGCA CATGATTCCC CTCTGAGCAA TCTATCGCTG AGCCCATTGC GCGGCGCCTT CATGCCTGGA GATCGTCCGA GGCACAAGAA GCTCGATCCA GCATGCAGCT ACTCAATGAA GATCGGCATA GTTGATTCCA GGGAGAGCGA GCTTTTCTCA TCCATAGTGA GACAGCTGGT CCTCCAGGAG AGGCGTCTCG TGCTGGAGAA GGGCGAGCTC CAGGTCGAGA GGGTCAGCAC ATCGGCCTCA AGCTTCGCGG AGCTGCTTTC GCCTTCATGC GATCACGAGG ACCCTGGCGT GGATATCAGG TTCATCTCGC CCACATGCAT ACAGTACAGG AACAGCGGTG TCTGCGAGAT GTTCCCTCAC CGTGAGGCTG TCTTCTCATC CCTGCTGGCC AAATGGAACT CATCGTGCCC GGATGGATTC AAGATGGATA TCGAGAGGGA CGAGATGGCC AGGTTCATCA TCGAACGGCC GGTATCATAT GAAACTCACA GCGCGATGGT GAACACAGTA TTCGATAGGA AAAAGGGCCA TCACAGACCG ATCCTGAGGC CTGGATTCAC AGGCAGATGC ATCTACACAT TCACGGACGA TGCGCCTGAT GACGTGAGAA ATGGGATACT GGCTCTATCG AGATTCGCTG AGTTCAGCGG CATCGGAAGC GCTGTGGCGA GGGGATGCGG AGCCGTCGAG GTGCGCATCT GCAAGTCAGA TAAAATCTCC ACAGGCCCCG CACGATAA
|
Protein sequence | MLFCCNLWDL KPYDFKLKIS MQLSTGLNPC TTDFSSMPHK ITLIARPVDT FEVPSSEGYQ LYSALLNIIR QEDKDVATHA HDSPLSNLSL SPLRGAFMPG DRPRHKKLDP ACSYSMKIGI VDSRESELFS SIVRQLVLQE RRLVLEKGEL QVERVSTSAS SFAELLSPSC DHEDPGVDIR FISPTCIQYR NSGVCEMFPH REAVFSSLLA KWNSSCPDGF KMDIERDEMA RFIIERPVSY ETHSAMVNTV FDRKKGHHRP ILRPGFTGRC IYTFTDDAPD DVRNGILALS RFAEFSGIGS AVARGCGAVE VRICKSDKIS TGPAR
|
| |