Gene Mnod_5414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_5414 
Symbol 
ID7302003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp5498806 
End bp5500488 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content70% 
IMG OID643603045 
ProductCRISPR-associated protein, Cse1 family 
Protein accessionYP_002500561 
Protein GI220925259 
COG category 
COG ID 
TIGRFAM ID[TIGR02547] CRISPR system CASCADE complex protein CasA/Cse1 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCCCT TCTCACTGCT CACCGAGCCG TGGATCCCCG TCCTGCGCGC CGACGGCACC 
CACGCCTGCA TTCGCCCGGC TGAAATCACC GCGGACATCG CGGCCAATCC CGTCGTCGCT
CCGGCCTGGG GCCGGCCCGA CCTCGACGCG GCGACCCGGG AGTACTGGAT CGCCCTCTTC
GGCACGGCCT GCGGCAGTTG GGCCGGTCCA GGCGCCTGGA GGGAGCATCT GCGCCACCCA
CCCGCACCCG AGGTGCTCGA CGCTGCCTTC GCTCCGCTCG CACCCGCCTT CATCCTGGAC
GGCGAGGGGC CGCGCTTCGG GCAGGACCTC GAGGACATTG CGGGCGAGAC CGTTCCGGTC
GGTCAGCTTC TGATCGAGGC GCCCGGCGCC AACACGATCA AGCGCAACCT CGACCACTTC
GTGCGCCGGG GCCGGGTCGA GACCCTGTCG CGGGCAGGCG CCGCCATCGC GCTCCACACG
CTCCAGACCT ACGCCCCGTC CGGCGGTGCC GGCCACCGCG TGTCAGTCCG CGGCGGTGGG
CCCCTCACCA CGCTACTCCT TCCAGGCCCG CCCCGCGGCG GCGACCCGGC CCGGCCGGTC
CCACTATGGC AAACCCTCTG GCTCGCGACA CCGGCGTGTG AGGCAAGCTC GCTCAAGCGC
GTTTTTCCCT GGCTCGCGCC CACTCGCACC TCCGAGCAGA AGCGGGTGAC GACGCCCTCC
GACGTCGATC CGCTTCAGGC CTTCTGGGGC ATGCCGCGCC GCGTCCGGCT CGTCTTCGAA
GCGAACACCG AGGGCCACCC CTGCGATCTC ACTGGCCGCA TCGACCCAGT GGTGGTCCGG
GCCTACCGCA CCCGGCCTCA TGGCACGAGT TACGTTGGCT TCACTCACCC ACTCTCGCCG
CACTACCGGG GCAAGGCCGA TGAGCCTTTT CTGCCGGTGC ACGGTCAGCC CGGCCGTGTC
GGGTACCGGC ATTGGGTCGG ACTGGTGGTG AGCGATCAGG CCGCCTCGCC TCTGAGGAGA
CCGGCTGACG CTGTCACTCT GGGGCTCTCC CGCCTCGAAG GCGTCGGCGG GCCCACCGCG
GCGCAAGCTC GTCTCCTCGC GACCGGCTAC GACATGGACA ACATGAAGGC CCGCGCCTTC
ATCGAGAGCG AGATGCCGCT CCACCTGCCG CCCCCCGGCC GGTTCAGTGA CCTCAACGGT
GCTGTGAGCG ACATGATCAA GGGAGCGTAC GCGGCTGAGG GACTGCTGCG CACCGGGGTG
CGCGCGGCCC TGTTCGTCAA GGCCACGGCC GGCGATGGCT TCCAGAACGC TCCAAAGGGG
GGCGGGGCAA TCGATCTGGC TCGCGCCCGG TTCTGGGAGC GCACGGAAGC CGCTTTCGGC
GAGGCCCTCG CGGCTCTTTC GGAAGACTTA GCCGATCCGA ACGCCGACGC CCTGGTCGTC
ACGACTGCGG CACGCGAGGC TTGGCGCGAG AGCCTGCGCC GGGCGGCGAT CGACCTCTTT
GACGACCTCG TCCCACTCGA CGACCTCGAC GCCCTCGACC TGCGGGCCGG GCAGGCTCGG
ATCGAGGCCC GCAGCAATCT TCACCTTGCC CTTCACGGCT ACGGCAAGTC TGGCGCCAGC
TTCTTCGAGG CTTTGGGGTT CGAGCCGCCG AAGCCGTCCA ACAAGCGCAA GGAGCGCGCA
TGA
 
Protein sequence
MRPFSLLTEP WIPVLRADGT HACIRPAEIT ADIAANPVVA PAWGRPDLDA ATREYWIALF 
GTACGSWAGP GAWREHLRHP PAPEVLDAAF APLAPAFILD GEGPRFGQDL EDIAGETVPV
GQLLIEAPGA NTIKRNLDHF VRRGRVETLS RAGAAIALHT LQTYAPSGGA GHRVSVRGGG
PLTTLLLPGP PRGGDPARPV PLWQTLWLAT PACEASSLKR VFPWLAPTRT SEQKRVTTPS
DVDPLQAFWG MPRRVRLVFE ANTEGHPCDL TGRIDPVVVR AYRTRPHGTS YVGFTHPLSP
HYRGKADEPF LPVHGQPGRV GYRHWVGLVV SDQAASPLRR PADAVTLGLS RLEGVGGPTA
AQARLLATGY DMDNMKARAF IESEMPLHLP PPGRFSDLNG AVSDMIKGAY AAEGLLRTGV
RAALFVKATA GDGFQNAPKG GGAIDLARAR FWERTEAAFG EALAALSEDL ADPNADALVV
TTAAREAWRE SLRRAAIDLF DDLVPLDDLD ALDLRAGQAR IEARSNLHLA LHGYGKSGAS
FFEALGFEPP KPSNKRKERA