Gene MCA0930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA0930 
Symbol 
ID3102101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp972568 
End bp973458 
Gene Length891 bp 
Protein Length296 aa 
Translation table11 
GC content60% 
IMG OID637170122 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_113413 
Protein GI53804737 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.264886 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGCC TGTTTTCCGG TCGCCTCGGA CTTGCCGAGT CGCGCATCCC CCATGCCGAC 
CGTCACGGTC TGCTCTGGCT CACCTTCGGT AATCTGACTG TCGAAGACGG CACCCTGCAT
TTTCGGGCCG CTCCATCGGA ATGGATGGAC GCCGGAGATT ATGCGATCCC ATACCAGGGA
TTGTCGATGA TTCTGTTGGC CCCCGGAACG ACGGTCAGTC ACGATGCATT GCGCCTGTTG
GCTCGTCATG GCACGTTGCT GGCCGCAGTC GGAGATGGAG GGGTCAAATT CTACACCGCA
CCGCCCATGG GCCAAGGCCG CTCCGACGTG GCCCGCGCCC ATGCCAGGCT GTGGGCCGAT
GAAAAGGCGC GGTTGGATGT GGCCCGCCAT ATGTATGCAT TCCGTTTCGG CCGTATTCTG
CCGCACAAGG ATATTGCGGT GCTGCGAGGC ATAGAAGGTG CGCGCGTGAA AGAGACTTAC
CAGCTGCTCG CCAATCAGCA TGGCGTGGAG TGGAAGGGAC GCCGTTACGA CAGGAATAAC
CCCAATGCCG CCGACATTCC CAACCAGGCC ATCAACCACG CCGCCACTTT CGTCGAAGCG
GCGGCAGATG TCGCGGTGGC CGCGGTCGGC GGTTTGCCGC CGCTAGGGTT CATTCACGAA
GAATCCAGCA ATGCCTTCAC CCTGGACATC GCCGATCTGT TCCGCGCCGA AGTCACCTTG
CCACTGGCTT TCAGCGTCGC TAAGCGCGTG ATGGACGATC CGTCTTTGCC ATTGGAGAGG
GCACTGCGCA AAGAAGCCGC CCGGCAGTTT CACAAGCAAA AAGTCATTCC GAAGATGATC
GACCGCATCA AGGAGTTGCT GCATGTCGAT GACGGTAATG GTGACGCGTA A
 
Protein sequence
MSGLFSGRLG LAESRIPHAD RHGLLWLTFG NLTVEDGTLH FRAAPSEWMD AGDYAIPYQG 
LSMILLAPGT TVSHDALRLL ARHGTLLAAV GDGGVKFYTA PPMGQGRSDV ARAHARLWAD
EKARLDVARH MYAFRFGRIL PHKDIAVLRG IEGARVKETY QLLANQHGVE WKGRRYDRNN
PNAADIPNQA INHAATFVEA AADVAVAAVG GLPPLGFIHE ESSNAFTLDI ADLFRAEVTL
PLAFSVAKRV MDDPSLPLER ALRKEAARQF HKQKVIPKMI DRIKELLHVD DGNGDA