Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mnod_5409 |
Symbol | |
ID | 7301998 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium nodulans ORS 2060 |
Kingdom | Bacteria |
Replicon accession | NC_011894 |
Strand | - |
Start bp | 5494660 |
End bp | 5495604 |
Gene Length | 945 bp |
Protein Length | 314 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643603040 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_002500556 |
Protein GI | 220925254 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.031752 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGGTG CAAGTCTGCC CGGGCTGCCA CCGCCCAAAC CTATCCCGAT CAAGGATCGA GCTTCCTTGC TATTCGTCGA GAAGGGACAG CTCGACGTGC TCGACGGTAC CTTCGTGTTG GTGGACGAGA ACGGGGTGCG GGTGCAGATC CCGATCGGCG GCCTCGTCTG CCTCATGCTG GAACCCGGTA CCCGGGTGAG CCATGCCGCC GTGGCGCTCG CCGCCCGAGC AGGCACGTTG CTGGTCTGGG TTGGGGAGGC GGGAGTGCGC CTCTACGCGG CTGGTCAGCC GGGAGGCGCC CGTGCGGACC GGCTGCTGTG GCAGGCCCGC CTCGCCCTTG ATGAGTCGGC CCGGCTCAAG GTGGTGCGCC GAATGTTTGA GCTGCGCTTT GGCGAGGCCG CGCCCGAACG CCGCTCCATC GACCAGCTCC GGGGAATTGA GGGCGCCCGC GTGCGTCGCC TCTACCAGTT GTACGCCCAA CAGAACGGGG TCGTGTGGAA CCGGCGCGCC TACGATCAGG GAGATTGGGA CGCCTCCGAT GTACCGAACC GCTGCCTCTC GGCCGCTACG GCCTGCCTGC ACGGCCTGGC TGAGGCCGCC GTGCTAGCAG CCGGCTATGC GCCAGCCATC GGCTTCCTGC ATACCGGACG GCCACGCTCC TTCGTGTATG ATGTAGCGGA CTTATTCAAG TTCGAGACCG TAGTGCCGGC GGCCTTTCGG GTGGCGGGGC GCGCTGCGAG AGGTGCGCTG TCTGGGCCAA CGGAAAGGGT AGTGCGGCAC GAATGCCGTG ACATCTTCCG GCGCACGAGT CTCCTGGAGC GGATCATTCC GGCGATTGAA GACGTGCTAG CCGCCGGCGG CCTGCCGCCG CCCGACGCGC CCACTGACGC CCAAGAGCCG GCCTTCGATG AACCATCCTC AGGCGACCCG GGCCACCGCG GATGA
|
Protein sequence | MSGASLPGLP PPKPIPIKDR ASLLFVEKGQ LDVLDGTFVL VDENGVRVQI PIGGLVCLML EPGTRVSHAA VALAARAGTL LVWVGEAGVR LYAAGQPGGA RADRLLWQAR LALDESARLK VVRRMFELRF GEAAPERRSI DQLRGIEGAR VRRLYQLYAQ QNGVVWNRRA YDQGDWDASD VPNRCLSAAT ACLHGLAEAA VLAAGYAPAI GFLHTGRPRS FVYDVADLFK FETVVPAAFR VAGRAARGAL SGPTERVVRH ECRDIFRRTS LLERIIPAIE DVLAAGGLPP PDAPTDAQEP AFDEPSSGDP GHRG
|
| |