Gene Mbur_0573 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbur_0573 
Symbol 
ID3997098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanococcoides burtonii DSM 6242 
KingdomArchaea 
Replicon accessionNC_007955 
Strand
Start bp582959 
End bp584155 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content37% 
IMG OID637958379 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_565297 
Protein GI91772605 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.00904377 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTTC TACTTCTAAA CGGTCATGGA ATTAACATGC GTGTAGATGG TGCTAAACTC 
CATATCAAAG ATGGAAGATT CTCAGCAACT GAAGATCCTC AGGAGTATGT GTTCTCTCCC
AAGAGGATTG ATATTGATAG CATTGTTGTG TATGGTCGAA GTGGATCTCT AAGCTTTGAA
GCTATCAGAT GGTTGATTAA ACACAATGTA CAGGTTACTA TGTTAGATTG GAATGGCAAG
TTTCTAACAA CAATGCTTCC TTCTGAAAGT ACCAATGTTA AAACAAAGTT TGCTCAATAC
CATGCTTATG AAGATCAGGA TGCAAGAGTA AAACTTGCAA GAAAATTCAT TGAGGCTAAG
TTCTACAAAT CTGAAGCTGT TCTTGATTAT CTTAAACAAA GGTATCCTGA GATTGAATAT
GATTTCTCCG TTGATAAGGG TAAACTTGAA AATGCCAAAT CTGTAAGAGA GATACTTGGG
ATTGAAGGTG GAGTTGCTTC AAAGTACTGG AATGAGTATT CTAAAGCTAT TCCTGATGAA
TATGATTTCA GAGCAAGGAC TGATAATAAT GCCAGAGCTT CTAATTCAGG CGATAAAGTC
AATGTTATGC TTAATTACGG ATATGCTTTG CTTGAATCTG AATGTCTGAG AGCCATCAAT
TCAGTTGGTC TTGATGCTCA TGTAGGTTTC CTTCATGAGA TGAATCCAAG TAAGAACAGT
TTAGCCTATG ATCTCCAAGA GCCATTTAGG TTTATTGTGG ATCTTGCTGT TATGAACCTG
ATAGAAAAGG AAGTTATGGA TAGTAAGGAT TTTATCAGGA CTGAGAGTTT TTCATTGAGG
CTTAAACCTA CTGGAGCAAG GAAGGTTACT GATAAATTCA ATTCTATGAT GAACGGCAAG
GTTGAGTATA GGAAGAAGAA TAGTTCTTGG GGATCTGTTC TTTTGGTTAA GGCAAGGGAG
TTAAGCCATC AACTTGTAGG GAAGAGGAAA ACAATTGAAT TTAGTAAGCC TGTTTATGTG
GTTGAAAGAG ATGATTCCAA TCTGTTGAGG AAAAGGATCA TTGACATGCC TTATGTTGAA
TGGAAGAAGA TGGGTTTCTC AAAGGGTACT CTCCATTACA TGAAGCAGAA TGCCAAGAGT
GATTTACCGT TTACTCTCAA TGGTCATGTG AAGGAAAGGT TGGAGAATTG GGAATAA
 
Protein sequence
MKLLLLNGHG INMRVDGAKL HIKDGRFSAT EDPQEYVFSP KRIDIDSIVV YGRSGSLSFE 
AIRWLIKHNV QVTMLDWNGK FLTTMLPSES TNVKTKFAQY HAYEDQDARV KLARKFIEAK
FYKSEAVLDY LKQRYPEIEY DFSVDKGKLE NAKSVREILG IEGGVASKYW NEYSKAIPDE
YDFRARTDNN ARASNSGDKV NVMLNYGYAL LESECLRAIN SVGLDAHVGF LHEMNPSKNS
LAYDLQEPFR FIVDLAVMNL IEKEVMDSKD FIRTESFSLR LKPTGARKVT DKFNSMMNGK
VEYRKKNSSW GSVLLVKARE LSHQLVGKRK TIEFSKPVYV VERDDSNLLR KRIIDMPYVE
WKKMGFSKGT LHYMKQNAKS DLPFTLNGHV KERLENWE