Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mhun_0735 |
Symbol | |
ID | 3923162 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanospirillum hungatei JF-1 |
Kingdom | Archaea |
Replicon accession | NC_007796 |
Strand | - |
Start bp | 845516 |
End bp | 846511 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637896372 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_502207 |
Protein GI | 88602029 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0403946 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.187894 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTCGG TGCTGATTAC AGGTGCCGGC TATCGAATTC GCAAGCGTGG GGATGTGTTG ACGATTGAAA CGGGGAAGGA TAGTGATACT GCTGAACCTC CCAGGACCCT CTCACCACTT GGTCTTGACC TGCTGGCAAT AGCGGGCGAT CACTCAATTT CCACTGCTGC GGTTCGTCTG GTAACCTCTC ATGGTGGGGC CATTGCACTC ATGGACGGAC TAGGAAATCC TTTTGGACAT TTTCTCCCTC TTGGAAGATC TGCTCTCATT GAACAATATG AGGCTCAGGC TTCTGCTCCG GAGGAGAGAA GACTTGAGAT TGCCCGGTCG ATATGCACTG GGGCACTGGA GAATAAACGA ACGCTTCTTT CAAATCTTGA ACGTATCCGT GGATTTGATC TCTCACGAGA GATTAGACTT GTTGAGGATG CACAGGATAA GGCTCTCGAA TGTCAGAGTC TTGATAGTCT CCGGGGCGTA GAAGGATCAG GTGCTCATGC GTATTTTCAG GGTTTTTCAT TGGCTTTTGA TGAGGAATGG GGTTTTTTGG GCAGGTCGCA GAATCCTGCA ACTGATCCAG TGAACAGTTT ACTCAGTTAC GGATACGGGA TGCTCTATAT TCAGGCCAGG CAGGCACTGG TGCTGTCAGG GTATTCTCCT TATTATGGGG CATATCATGA AACATATAAA AAGCAGGAGG CACTGGTATA CGATCTGGTT GAGGAGTTCA GACAACCAGT GGTTGACCGG ACCGTTGTGA CATTTCTTGC TAAACATATG GCCACACCTG ATGATTTTAC CTATCCAGAT GAGGGTGGTT GTATGATCGG AACGATGGCA AAGAAGAAGT ATGCAGCTGC TGTACTTACT CGAATACATG GGAAAGTAAA ATATGAAGAA CAAACATTTC AGGATATTTT CAAAAGGCAG GCGGAAAGGA TTGGAAAAGC ACTGACTGAA GGGGATGAGT ATGTTCCGTA CCGGTACCGG ACATGA
|
Protein sequence | MNSVLITGAG YRIRKRGDVL TIETGKDSDT AEPPRTLSPL GLDLLAIAGD HSISTAAVRL VTSHGGAIAL MDGLGNPFGH FLPLGRSALI EQYEAQASAP EERRLEIARS ICTGALENKR TLLSNLERIR GFDLSREIRL VEDAQDKALE CQSLDSLRGV EGSGAHAYFQ GFSLAFDEEW GFLGRSQNPA TDPVNSLLSY GYGMLYIQAR QALVLSGYSP YYGAYHETYK KQEALVYDLV EEFRQPVVDR TVVTFLAKHM ATPDDFTYPD EGGCMIGTMA KKKYAAAVLT RIHGKVKYEE QTFQDIFKRQ AERIGKALTE GDEYVPYRYR T
|
| |