Gene Mhun_0735 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMhun_0735 
Symbol 
ID3923162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanospirillum hungatei JF-1 
KingdomArchaea 
Replicon accessionNC_007796 
Strand
Start bp845516 
End bp846511 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content47% 
IMG OID637896372 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_502207 
Protein GI88602029 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0403946 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.187894 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCGG TGCTGATTAC AGGTGCCGGC TATCGAATTC GCAAGCGTGG GGATGTGTTG 
ACGATTGAAA CGGGGAAGGA TAGTGATACT GCTGAACCTC CCAGGACCCT CTCACCACTT
GGTCTTGACC TGCTGGCAAT AGCGGGCGAT CACTCAATTT CCACTGCTGC GGTTCGTCTG
GTAACCTCTC ATGGTGGGGC CATTGCACTC ATGGACGGAC TAGGAAATCC TTTTGGACAT
TTTCTCCCTC TTGGAAGATC TGCTCTCATT GAACAATATG AGGCTCAGGC TTCTGCTCCG
GAGGAGAGAA GACTTGAGAT TGCCCGGTCG ATATGCACTG GGGCACTGGA GAATAAACGA
ACGCTTCTTT CAAATCTTGA ACGTATCCGT GGATTTGATC TCTCACGAGA GATTAGACTT
GTTGAGGATG CACAGGATAA GGCTCTCGAA TGTCAGAGTC TTGATAGTCT CCGGGGCGTA
GAAGGATCAG GTGCTCATGC GTATTTTCAG GGTTTTTCAT TGGCTTTTGA TGAGGAATGG
GGTTTTTTGG GCAGGTCGCA GAATCCTGCA ACTGATCCAG TGAACAGTTT ACTCAGTTAC
GGATACGGGA TGCTCTATAT TCAGGCCAGG CAGGCACTGG TGCTGTCAGG GTATTCTCCT
TATTATGGGG CATATCATGA AACATATAAA AAGCAGGAGG CACTGGTATA CGATCTGGTT
GAGGAGTTCA GACAACCAGT GGTTGACCGG ACCGTTGTGA CATTTCTTGC TAAACATATG
GCCACACCTG ATGATTTTAC CTATCCAGAT GAGGGTGGTT GTATGATCGG AACGATGGCA
AAGAAGAAGT ATGCAGCTGC TGTACTTACT CGAATACATG GGAAAGTAAA ATATGAAGAA
CAAACATTTC AGGATATTTT CAAAAGGCAG GCGGAAAGGA TTGGAAAAGC ACTGACTGAA
GGGGATGAGT ATGTTCCGTA CCGGTACCGG ACATGA
 
Protein sequence
MNSVLITGAG YRIRKRGDVL TIETGKDSDT AEPPRTLSPL GLDLLAIAGD HSISTAAVRL 
VTSHGGAIAL MDGLGNPFGH FLPLGRSALI EQYEAQASAP EERRLEIARS ICTGALENKR
TLLSNLERIR GFDLSREIRL VEDAQDKALE CQSLDSLRGV EGSGAHAYFQ GFSLAFDEEW
GFLGRSQNPA TDPVNSLLSY GYGMLYIQAR QALVLSGYSP YYGAYHETYK KQEALVYDLV
EEFRQPVVDR TVVTFLAKHM ATPDDFTYPD EGGCMIGTMA KKKYAAAVLT RIHGKVKYEE
QTFQDIFKRQ AERIGKALTE GDEYVPYRYR T