Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA0930 |
Symbol | |
ID | 3102101 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | - |
Start bp | 972568 |
End bp | 973458 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637170122 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_113413 |
Protein GI | 53804737 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.264886 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGGCC TGTTTTCCGG TCGCCTCGGA CTTGCCGAGT CGCGCATCCC CCATGCCGAC CGTCACGGTC TGCTCTGGCT CACCTTCGGT AATCTGACTG TCGAAGACGG CACCCTGCAT TTTCGGGCCG CTCCATCGGA ATGGATGGAC GCCGGAGATT ATGCGATCCC ATACCAGGGA TTGTCGATGA TTCTGTTGGC CCCCGGAACG ACGGTCAGTC ACGATGCATT GCGCCTGTTG GCTCGTCATG GCACGTTGCT GGCCGCAGTC GGAGATGGAG GGGTCAAATT CTACACCGCA CCGCCCATGG GCCAAGGCCG CTCCGACGTG GCCCGCGCCC ATGCCAGGCT GTGGGCCGAT GAAAAGGCGC GGTTGGATGT GGCCCGCCAT ATGTATGCAT TCCGTTTCGG CCGTATTCTG CCGCACAAGG ATATTGCGGT GCTGCGAGGC ATAGAAGGTG CGCGCGTGAA AGAGACTTAC CAGCTGCTCG CCAATCAGCA TGGCGTGGAG TGGAAGGGAC GCCGTTACGA CAGGAATAAC CCCAATGCCG CCGACATTCC CAACCAGGCC ATCAACCACG CCGCCACTTT CGTCGAAGCG GCGGCAGATG TCGCGGTGGC CGCGGTCGGC GGTTTGCCGC CGCTAGGGTT CATTCACGAA GAATCCAGCA ATGCCTTCAC CCTGGACATC GCCGATCTGT TCCGCGCCGA AGTCACCTTG CCACTGGCTT TCAGCGTCGC TAAGCGCGTG ATGGACGATC CGTCTTTGCC ATTGGAGAGG GCACTGCGCA AAGAAGCCGC CCGGCAGTTT CACAAGCAAA AAGTCATTCC GAAGATGATC GACCGCATCA AGGAGTTGCT GCATGTCGAT GACGGTAATG GTGACGCGTA A
|
Protein sequence | MSGLFSGRLG LAESRIPHAD RHGLLWLTFG NLTVEDGTLH FRAAPSEWMD AGDYAIPYQG LSMILLAPGT TVSHDALRLL ARHGTLLAAV GDGGVKFYTA PPMGQGRSDV ARAHARLWAD EKARLDVARH MYAFRFGRIL PHKDIAVLRG IEGARVKETY QLLANQHGVE WKGRRYDRNN PNAADIPNQA INHAATFVEA AADVAVAAVG GLPPLGFIHE ESSNAFTLDI ADLFRAEVTL PLAFSVAKRV MDDPSLPLER ALRKEAARQF HKQKVIPKMI DRIKELLHVD DGNGDA
|
| |