Gene Mnod_5409 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_5409 
Symbol 
ID7301998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp5494660 
End bp5495604 
Gene Length945 bp 
Protein Length314 aa 
Translation table11 
GC content68% 
IMG OID643603040 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_002500556 
Protein GI220925254 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.031752 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGTG CAAGTCTGCC CGGGCTGCCA CCGCCCAAAC CTATCCCGAT CAAGGATCGA 
GCTTCCTTGC TATTCGTCGA GAAGGGACAG CTCGACGTGC TCGACGGTAC CTTCGTGTTG
GTGGACGAGA ACGGGGTGCG GGTGCAGATC CCGATCGGCG GCCTCGTCTG CCTCATGCTG
GAACCCGGTA CCCGGGTGAG CCATGCCGCC GTGGCGCTCG CCGCCCGAGC AGGCACGTTG
CTGGTCTGGG TTGGGGAGGC GGGAGTGCGC CTCTACGCGG CTGGTCAGCC GGGAGGCGCC
CGTGCGGACC GGCTGCTGTG GCAGGCCCGC CTCGCCCTTG ATGAGTCGGC CCGGCTCAAG
GTGGTGCGCC GAATGTTTGA GCTGCGCTTT GGCGAGGCCG CGCCCGAACG CCGCTCCATC
GACCAGCTCC GGGGAATTGA GGGCGCCCGC GTGCGTCGCC TCTACCAGTT GTACGCCCAA
CAGAACGGGG TCGTGTGGAA CCGGCGCGCC TACGATCAGG GAGATTGGGA CGCCTCCGAT
GTACCGAACC GCTGCCTCTC GGCCGCTACG GCCTGCCTGC ACGGCCTGGC TGAGGCCGCC
GTGCTAGCAG CCGGCTATGC GCCAGCCATC GGCTTCCTGC ATACCGGACG GCCACGCTCC
TTCGTGTATG ATGTAGCGGA CTTATTCAAG TTCGAGACCG TAGTGCCGGC GGCCTTTCGG
GTGGCGGGGC GCGCTGCGAG AGGTGCGCTG TCTGGGCCAA CGGAAAGGGT AGTGCGGCAC
GAATGCCGTG ACATCTTCCG GCGCACGAGT CTCCTGGAGC GGATCATTCC GGCGATTGAA
GACGTGCTAG CCGCCGGCGG CCTGCCGCCG CCCGACGCGC CCACTGACGC CCAAGAGCCG
GCCTTCGATG AACCATCCTC AGGCGACCCG GGCCACCGCG GATGA
 
Protein sequence
MSGASLPGLP PPKPIPIKDR ASLLFVEKGQ LDVLDGTFVL VDENGVRVQI PIGGLVCLML 
EPGTRVSHAA VALAARAGTL LVWVGEAGVR LYAAGQPGGA RADRLLWQAR LALDESARLK
VVRRMFELRF GEAAPERRSI DQLRGIEGAR VRRLYQLYAQ QNGVVWNRRA YDQGDWDASD
VPNRCLSAAT ACLHGLAEAA VLAAGYAPAI GFLHTGRPRS FVYDVADLFK FETVVPAAFR
VAGRAARGAL SGPTERVVRH ECRDIFRRTS LLERIIPAIE DVLAAGGLPP PDAPTDAQEP
AFDEPSSGDP GHRG