Gene MCA0651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA0651 
Symbolcas1 
ID3102489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp685018 
End bp686052 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content64% 
IMG OID637169862 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_113164 
Protein GI53804988 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.280265 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGTGC TGCAAAACAC CCTCTACGTC ACCACGCCCG AGGCTTATCT ACGCCTGGAG 
GGCGAGACCG TGTGCGTGAT GATCGAGGAG CAAAAACGCC TGCAGGTCCC GCTGCATCAT
CTATCCGCCT TCGTGCTGTT CGATCACGTC ATGCTCAGCC CTGCCTTGCT TGGGCGCTGC
GCCGAGGATG GCCGATCGGT CGTATGGCTC GATCGGGCGG GCCGGTTCAG GGCGCGGCTG
GAAGGGCCGG TGAACGGCAA CATACTTCTG CGCCAGGCGC AATTCCGCGC GGCGGAAGAC
GGTGCGCAAA CCCTGGGCCT GGCTCGCGCG GTATTGGCCG GCAAGCTGCG TAACAGCCGG
CAGTTGTTGA TGCGGGGCGC GCGGGAAACG GACGACGTGG TCGAAAGAGA CGCTTTGGTG
CGCGCCGCCA AGCTCATCGC CAATCAAGTG CGCAAGCTCC CTTTGGCGCA AGATCTCGAC
ACCCTACGGG GCTTGGAAGG CGATGTGGCC AGGCTTTATT TCGAGGCCCT GCCCAAAGTC
ATGAGGGCGA AGGCGCGGGC CGAGTTCCCT TTCGACTGCC GCAACCGGCG CCCGCCGCGC
GACCGTTTCA ACGCATTGCT CTCTTTCCTG TATGCCCTGG TGTTGAGCGA CTGCCGGGCG
GCCCTGGAAA CCGTCGGGCT CGATCCGCAA TTGGGCTTTC TCCATGCCGT GCGTCCCGGT
CGTCCGGCCT TGGCGCTGGA TCTGTTGGAA GAATTCCGCG CCCCGCTGGC GGACCGGCTG
GCCTTGACCC TCGTCAACCG CGGACAATTG CAGGCCAGCG ATTTCGACGA ACGGGAAGGC
GGTGCCGTGC TGCTCAACGA CAAGGGCCGC AAGACCGTCA TCGCCGCCTA TCAAACCCGC
AAGCAGGAAG CAATCACGCA TCCGCTGCTC AAACAAACCC TGCCGATCGG CCTGCTGCCT
CATTGGCAAG CGCGCTTGCT GGCCCGCTAT CTGCGCCAGG ACGTCGCGCA TTACGTGCCT
TATCTGCACC GCTGA
 
Protein sequence
MTVLQNTLYV TTPEAYLRLE GETVCVMIEE QKRLQVPLHH LSAFVLFDHV MLSPALLGRC 
AEDGRSVVWL DRAGRFRARL EGPVNGNILL RQAQFRAAED GAQTLGLARA VLAGKLRNSR
QLLMRGARET DDVVERDALV RAAKLIANQV RKLPLAQDLD TLRGLEGDVA RLYFEALPKV
MRAKARAEFP FDCRNRRPPR DRFNALLSFL YALVLSDCRA ALETVGLDPQ LGFLHAVRPG
RPALALDLLE EFRAPLADRL ALTLVNRGQL QASDFDEREG GAVLLNDKGR KTVIAAYQTR
KQEAITHPLL KQTLPIGLLP HWQARLLARY LRQDVAHYVP YLHR