Gene Ppha_2454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpha_2454 
Symbol 
ID6461195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelodictyon phaeoclathratiforme BU-1 
KingdomBacteria 
Replicon accessionNC_011060 
Strand
Start bp2537649 
End bp2539934 
Gene Length2286 bp 
Protein Length761 aa 
Translation table11 
GC content40% 
IMG OID642728634 
ProductCRISPR-associated protein, Csm1 family 
Protein accessionYP_002019256 
Protein GI194337462 
COG category[R] General function prediction only 
COG ID[COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) 
TIGRFAM ID[TIGR02578] CRISPR-associated protein, Csm1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.442356 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGCCAA CTGGACACAA TATTGTAGAA CTTGGAATAA AGCTTCTCAA AGAAAAAAAG 
ATTGCTATCA GTGCATTTGC AGAAGCATTT TCACTTGCAG GCGTTACAGC GACGGAGTTG
AAACTCGCTC CGCTGAAATC CGTCTTTGGT GGGGATAGAT ATTTTGAGCC AAAAGAGCTT
TCTGCTTTTG ATATCAATTA TCCCACACTT GATTCTATTG ACTATTTCAA TAATGTAACG
TTAGATGTTA TCGGAAATTG TCTTGCTTCA CTTGAACGGT TCGGATCCTT TGTCGCAACC
GATATCGACA ATACTATCCC TGTCTATGAT CTTTTTAAGA CCGCAGCAGC TATTCAAAAT
TGCTTAAACA ATGAACTAAA GGATAATCCA TTTCTGCTTG TAAGTGCCGA TTTTTCCGGC
ATTCAGGATA CAGTATACAC CATTTCCTCA AAAGGAGCAC TCAAGACACT TCGAGCTCGT
TCCTTCATGC TTGAGTTGCT CACAGAACAC ATAGTGTATG AGATTTTACA TGCATGTGGT
TCAGAGCGGT ATGCTATTAT CTACTCTGGT GGGGGCGGCT TTAGCCTGTT ACTGCCGAAC
ATAGCCGGAA ATGTAAAAGC TATTGATGGC TATAGAGAGG TACTTAATAA ATGGGCTTTA
CAGGAATTTT CAGGGAAGTT TTTTATTGCA ATGGATGCAT TGGTATTCAA CGAAGAGAAG
CTGAAAGACA AAGGGATTTT TAAAAAGTTA AGACAAGAAC AATCTGATAA TTTAGATAGG
CAGAAATCTC GGAAATTTTT ACGCCAACTT GATAAGCTTT TCAAACCTGA GATGCCAGAG
CAAGTTAACT TGCAGACTGA GTGCCAGATT ACCAGACGGG ACGATTTGCC AAATAAAGAA
ATGCGTGATT TAGGAAGTGG CGAAAAAATG CCGGAGCATA ATCCCGATAT GACAAAAACC
TGGGTTTCAG AAAGCTGTTA CAGGCAATAT CGTTTGGGTG ACAAGCTAAT TGGTACAAAG
TATGTCTGTC GGAGAAAAAT AGCAAAGGAC GATAGAAGCG GGTTTGTTCA GCTTCCGACT
ATGAAATCGA CACCAACGAA GCTCTCATTT TGTTATTATG TTGTCAGTTC GAAGTTGATT
CACAATTCAG ATGCAATTTG GTGCATCAAT TCTTGGACAG AACGAGAGAG TTCGAGAACC
TTACTGTACT CTAATTATGT CCGAAAGTAT GGTGATCTTA CTGCCTATAC TAAAAAGTTA
GGAAATGAAT CGCATACAAA TGTTGAAGAA AGACAAGCCA AGGATACTGA TACAGCGACG
TTTGAGGGTT TAGCTGCAAG TTCCTGCGGT GCAGATCTGA TTGGTGCATT GCGAATGGAT
GTCGATAACA TGGGTGACTT GTTTCGCAAT ATTGAAAGCT TTGCCGCGCT TTCAACAAAA
TCAAGAATGC TGAACCTCTT TTTTAAGGTG TACCTCAATC AAATATGCGC AGGACATTTA
GGCAGCAGAC TTTCTCCAAC TGACATTGTC GGGAAAAATT ACACGGAAAA GAATGCAGCC
GAAGAAAATA AAGGACGCAA TGTCTCAGTG ATTTACGCAG GAGGTGACGA TCTATTTATT
CTCGGTGCAT GGGATGAAAC AACGGAACTT GCCTTCGATA TTCAACGCTG CTTTGCCTTA
TTCACAGGTG GAATGTTTGA TACAACTAAA AAGATCATTA GCGGAGGACA AGGAATATCC
GGCGGTCTCA CTCTGCACCA GCCAAAATTC CCACTTTACC AAATGGCAAT AAAATCAGGT
GAGGCAGAAA GTGTTGCAAA ACATGCAAAA GTGCAAAAAA ACTGCTTCAC ACCTTTTTTG
CTTGCCTTAG ATCATGTTGA TAAGGGTTAT GAATTTTATA ACAACCAAAC AGTTCGAATC
ATTGACTGGA ACGATCCAAC TTTTTTTGGA TTACTTGAAA ACTTAGTAGT ATTAACGGAG
AAGCATATAA CCGGCAATCA GATTGATACC ATCAAACTCG AAGCTGTTTC AAAAGGATTT
ATTTACAAGC TGTTTGAAGT AGCAAGAGTT TGGGTCGCTG AAGGGACTTT ATACTTGCCT
CGCTTGCGTT ACATATTTTC ACGTTTAGAA AAACAATATA TAGATAACAA AGATACTCAC
AAGCAAGCAG GTATTCAAAG CTTGCGAAGT ATTTTGTTTT CATCAATTAA GCATGAAAGA
GAAAAATCTA TTAAGCGTAT GACATTGACG CTTAACTGGT TTGAACAACT TCAACGCAAT
AAGTAA
 
Protein sequence
MVPTGHNIVE LGIKLLKEKK IAISAFAEAF SLAGVTATEL KLAPLKSVFG GDRYFEPKEL 
SAFDINYPTL DSIDYFNNVT LDVIGNCLAS LERFGSFVAT DIDNTIPVYD LFKTAAAIQN
CLNNELKDNP FLLVSADFSG IQDTVYTISS KGALKTLRAR SFMLELLTEH IVYEILHACG
SERYAIIYSG GGGFSLLLPN IAGNVKAIDG YREVLNKWAL QEFSGKFFIA MDALVFNEEK
LKDKGIFKKL RQEQSDNLDR QKSRKFLRQL DKLFKPEMPE QVNLQTECQI TRRDDLPNKE
MRDLGSGEKM PEHNPDMTKT WVSESCYRQY RLGDKLIGTK YVCRRKIAKD DRSGFVQLPT
MKSTPTKLSF CYYVVSSKLI HNSDAIWCIN SWTERESSRT LLYSNYVRKY GDLTAYTKKL
GNESHTNVEE RQAKDTDTAT FEGLAASSCG ADLIGALRMD VDNMGDLFRN IESFAALSTK
SRMLNLFFKV YLNQICAGHL GSRLSPTDIV GKNYTEKNAA EENKGRNVSV IYAGGDDLFI
LGAWDETTEL AFDIQRCFAL FTGGMFDTTK KIISGGQGIS GGLTLHQPKF PLYQMAIKSG
EAESVAKHAK VQKNCFTPFL LALDHVDKGY EFYNNQTVRI IDWNDPTFFG LLENLVVLTE
KHITGNQIDT IKLEAVSKGF IYKLFEVARV WVAEGTLYLP RLRYIFSRLE KQYIDNKDTH
KQAGIQSLRS ILFSSIKHER EKSIKRMTLT LNWFEQLQRN K