Gene Mbar_A3119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A3119 
Symbol 
ID3626570 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp4009517 
End bp4011169 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content36% 
IMG OID637701958 
ProductCRISPR-associated Cas1/Cas4 family protein 
Protein accessionYP_306586 
Protein GI73670571 
COG category[L] Replication, recombination and repair 
COG ID[COG1468] RecB family exonuclease
[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR00372] CRISPR-associated protein Cas4 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000141365 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAAC CTGTAGCTAG GTATGAAGAG CCGAAACTGA TTCCTGCAAG AATGCTAAAT 
GAATTTGTAT ATTGTCCACG TTTATGTTAT ATGGAGTGGG TACAGGGAGA ATTTGAGCAT
AGTGAAGATA CTTTGGAAGG AAAATTTGTT CATCGAAATG TAGATCAAGA AAAAACCAAA
GATTTACAAG GTGAAGAAAA GAAGATTCAT AGCACTTCTG TGATGCTTTC GGGTTATGAG
ACTGGTGTGA TTACAAGAAT TGACCTTCTA GAAGAATCAA ACGGAAAAGC CGTGCCAGTT
GAATACAAAA AGGGCTATGT TCCAGATATT CCTGAAAAAG TCTATGAACC GGAGAGAATT
CAATTATGTG CTCAAGGCTT GGTGTTGAAA GAAAAGGGAT TTGATTGCAC GGAAGGAGTT
ATTTATTTTG TTAATTCAAA AAAAAGAGTG GCAGTTGACT TCGATGAGGA GCTTATCCAA
AAAACTAAAG AAACAATTTT GAGGTTTCTT GAGACAATTG GAAAAAAAGA GATTCCGGCT
CCTCTTGAAA ATAGTCCTAA ATGCTCCAGA TGTTCACTTT CGGGAATTTG TCTTCCCGAT
GAGACTAATA TTTTAAAAGG TTCAGCCTCA CAAGTTAGAT CATTAAATGT ATCTAAAGAC
GACAAAAAAC CCGTTTATGT TACTGGATGG GGAACTTCAG TACACAAGAA AGGAGATAGG
TTGGTTATTA AGAAGAATGA TGAAGAACTG CAGAGCGTCC CGTTAAGGCA AATATCCCAA
CTATCAATTT ATGGGGATGC TCATATTTCT TTGCCGGTAC TAAGAAGTTT AATTGAAATG
AATGTGCCTG TGTGTTATTT TTCCTTTGGG GGTTGGTTTT ATGGGCTATC ACATGGGGTT
ATGAGTAAAA ATGTTGATTT AAGGATTCAT CAATATCAGA CTGCTTTTGA TTCAGAAAGG
TCACTGGCAA TTTCTCGCAA GATGATTGCT GGCAAAATCA AAAATTGTCG GACTTTATTA
AGAAGAAACG ACACTGAAGT TTCAGAGAAA ATTCTCTCTC AATTAAATTC TCTTGAAAAA
AAGGCATCGA ATGCAAAAGA AATCGGACAG CTTTTAGGTA TAGAAGGAAC TGCAGCCCAA
ATTTATTTTT CGAGATTTGG CAATATGTTG AAACAAGATC TTGACTGTAA ATTTGAAAAT
CGAAATAAAA GGCCTCCTAC CGACCCGGTA AATGCTGTTC TTTCATATTT ATATGGTATA
TTGACAAAAG AAGTCTTTGT AACTTTATTT TCGGTAGGTT TTGATCCTTA TATGGGTTTC
TATCACCAAC CTAAATATGG AAAACCAGCT CTAGCACTTG ATTTAATGGA AGAGTTTCGG
CCATTGATAG CAGATTCTGT TGCTCTAACG CTTTTCAACA ACAAAACAGT GACACTAGAA
GATTTCGAAA TAACAAATTT TGGAGTTTCG TTAAAGGACA ATACAAAAAA GAAAATAATT
AGTGGATATG AGAGAAGAAT AAATACGGAA ATAACTCACC CTATCTTTGG GTACAAAGCA
AGTTATAGAA GAATTTTGGA AATACAGGTT AGACTTTTAG GCAGAACCGT TACTAAAGAA
ATAGAATCAT ACACTCCATT TTGTACAAGA TAA
 
Protein sequence
MDEPVARYEE PKLIPARMLN EFVYCPRLCY MEWVQGEFEH SEDTLEGKFV HRNVDQEKTK 
DLQGEEKKIH STSVMLSGYE TGVITRIDLL EESNGKAVPV EYKKGYVPDI PEKVYEPERI
QLCAQGLVLK EKGFDCTEGV IYFVNSKKRV AVDFDEELIQ KTKETILRFL ETIGKKEIPA
PLENSPKCSR CSLSGICLPD ETNILKGSAS QVRSLNVSKD DKKPVYVTGW GTSVHKKGDR
LVIKKNDEEL QSVPLRQISQ LSIYGDAHIS LPVLRSLIEM NVPVCYFSFG GWFYGLSHGV
MSKNVDLRIH QYQTAFDSER SLAISRKMIA GKIKNCRTLL RRNDTEVSEK ILSQLNSLEK
KASNAKEIGQ LLGIEGTAAQ IYFSRFGNML KQDLDCKFEN RNKRPPTDPV NAVLSYLYGI
LTKEVFVTLF SVGFDPYMGF YHQPKYGKPA LALDLMEEFR PLIADSVALT LFNNKTVTLE
DFEITNFGVS LKDNTKKKII SGYERRINTE ITHPIFGYKA SYRRILEIQV RLLGRTVTKE
IESYTPFCTR