Gene Mhun_1375 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMhun_1375 
Symbol 
ID3923370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanospirillum hungatei JF-1 
KingdomArchaea 
Replicon accessionNC_007796 
Strand
Start bp1573758 
End bp1574657 
Gene Length900 bp 
Protein Length299 aa 
Translation table11 
GC content49% 
IMG OID637897012 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_502834 
Protein GI88602656 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00127759 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.582034 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCCTA CTCTCCCCGA TATAAAGCCG ATTCCGATAA AAGAGCGTTC TTCGGTGGTA 
TTTCTTGGTA GGGGAGAATT GGATGTTATT GATGGTGCGT TTGTTCTGGT TGATACGAAT
GGGATCAGGA TGCAGATTCC TGTCGGTGGC CTTGCATCGC TGATGCTGGA ACCTGGGTCA
CGGGTTTCCC ATGCAGCGGT CTCTCTTGCA TCGAAAGTTG GTTGTCTGCT TGTTTTTGTA
GGTGAGGGTG GTGTTCGTCT CTATTCTGTT GGTCATCCCG GGGGTGCCCG ATCAGATCGT
CTTTTGTACC AGGCACGTCT TGCTCTTGAT GAGGTATTAC GGCTAAAGGT TGTGAAGAAG
ATGTTCTCTC TCCGGTTTGG AGAGGATTTT TCTGATGCAT ATTCTGTTGA ACAGTTACGG
GGACTTGAAG GGGTTCGTGT CAGGGAAGGG TATCGTAAGA TTGCAAGAGA TACTGGTGTC
ATCTGGAATG GCAGGCGATA TGATCCTCAT TCCTGGGGGA GTGCTGATCT TCCGAATAGA
TGTCTTAGTG CAGCAACCGC CAGTTTATAT GGGATTTGTG AGGCTGCTGT TCTTGCTGCA
GGGTATTCTC CTTCTATCGG GTTCTTACAT ACGGGAAAGC CGCTTTCTTT TGTGTATGAT
ATTGCAGACC TTTTCAAATT TGAAACGGTT GTCCCTGCAG CGTTTAAAAC GGCTGCATTA
AATCCCAGGG AGCCTGAGCG TGAGGTCAGG TATGCCTGCC GCGATTTATT CCGGGAAACA
CAACTCCTCA AGAGGATTAT TCCAACGATT GAGGAGGTGC TGACAGCTGG TGGCATTTCT
GCGCCTGCTC CTCCTGACTG GGTTGTTCCA CCGGCGATTC CTGTTGATGA GGAGGGATGA
 
Protein sequence
MTPTLPDIKP IPIKERSSVV FLGRGELDVI DGAFVLVDTN GIRMQIPVGG LASLMLEPGS 
RVSHAAVSLA SKVGCLLVFV GEGGVRLYSV GHPGGARSDR LLYQARLALD EVLRLKVVKK
MFSLRFGEDF SDAYSVEQLR GLEGVRVREG YRKIARDTGV IWNGRRYDPH SWGSADLPNR
CLSAATASLY GICEAAVLAA GYSPSIGFLH TGKPLSFVYD IADLFKFETV VPAAFKTAAL
NPREPEREVR YACRDLFRET QLLKRIIPTI EEVLTAGGIS APAPPDWVVP PAIPVDEEG