Gene Ppha_2459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpha_2459 
Symbol 
ID6461403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelodictyon phaeoclathratiforme BU-1 
KingdomBacteria 
Replicon accessionNC_011060 
Strand
Start bp2541825 
End bp2544008 
Gene Length2184 bp 
Protein Length727 aa 
Translation table11 
GC content53% 
IMG OID642728638 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_002019260 
Protein GI194337466 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair
[COG3344] Retron-type reverse transcriptase 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.670262 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATGGC TTTACAATCA GATGGCTTTG CCTGAAACAC TTTTTCAGGC CTGGAACAAG 
GTGGTTTCGA ACGACGGAAT GGCCGGGTAC GACAAACAGT CAATTACCGA TTATTCGTGG
CGCATCGAGG AGCATCTTGC CGATCTTGGC AGGCAGCTCT TGACCAACAC CTATGAGCCG
CAGCCGCTCC TGAAACTCGT CATGCTGAAA CCCACCGGAA AGTTGCGCAC CCTCCTGATT
CCTACGGTGA TGGAGCGGGT TGCGCAAACC GCAGCGGCGA TCGTGCTGAC CCCTCTGGTG
GAGTCGGAAC TGGGAGCCAA TACCTTCGCC TACCGGCCGG GGTTGTCGCG CATGACGGCG
GCCCGTGAAA TCGAGAGGCT GCGAAACCTT GGTTATAACT GGGTGGTCGA TGCCGATATC
AGCAGCTTTT TCGATACCGT TGACCACCCG TTACTCTTTC AGCGCTTTCG TGAACTCTGT
GATGACGAGG AGCTGCTCAC GCTGATTGCG CGATGGCTGA CCGCAGAAAT TGTTGATGGC
CAGAACCCAA AGGTGAAGAA CACCATCGGC CTGCCGCAGG GCTGCCCCAT TTCGCCGATG
CTTGCAAACC TCTATCTGGA TAAATTTGAT GAGCGCATGG AGCAGGAGGG GTTCAAACTG
GTGCGCTTTG CCGATGATTT TCTCATTCTT TGCAAGTCAA AACCAAAAGC AGAAGCGGCC
CTCCAGCTTT CGGAAAGTGC GCTGGCGGAG CTGAAACTGC AACTCAACAA CGAAAAGACC
CGGATCACCA CCTTTGCGGA GGGGTTCAAA TACCTTGGCT ACCTCTTCAT CCGTTCGCTG
GTGTTGCCCA CCAAAATGCA CCCGGATGAG TGGTACGACA AACTGGGCAA GCTGAAACTC
CGCAAAGCTC CGAAAAGCCT GCTGCACCCT GAGGAAAGTG ATGATGAGGA GTACGAACTG
CAAACCGGTG ACTCCGACGC CATAACGGTG ACCAAAGAGA CGCTGCTTCA GACCGAGTTT
GGTGAAAAGC TGCTGCAAAG CCTTGAAAAA CAGCAGATTG ATGTAGACAA ATTCCTTGAG
AAAACGGCCA AAGAGGATGC CCTGCGGCAA AAAGAGAAGC ACGAAGCGCT CAACAAACTC
TACTCCCCGC TGCTCAATAC CCTCTACCTG CAGGAACAGG GCAGCATATT GCGTAAGGAT
GGTGAACGGT TCAGCGTTGA AAAAGAGGGC AAGCAACTCA ACGACATCAT TGTCCGCCGG
GTAGAGCAGA TTCTGGTGCT CGGCAACATC ACGCTCACCA CACCGGCTAT GCAATACTGC
ATGAAAAGCA ACATTCCCAT CACCTTTGTT TCGCAACATG GCAGCTACTT TGGTCGGCTC
GAAGCCACCA CAGCCGACAA CTCCGCGCTC GAACGCTTTC AGTACCTCCG CTCACTCGAT
GAACCCTTTG CCCTCGGAAT TGCCAGCGCT ATTGTAGAGG CAAAAATCAG AAACTCGCGC
ACCATGATCC AGAAACGCAA AGCCATGGCG TGGGAAAGTA ACGGAGAGCT GAAAGAAAAA
TTCGATGCGT CCCTCTTGCT GATGACCTCG CTTGCCGAAC ATACCAAAAG CTGTGACAAT
ATGGAGGCGC TCCGGGGGAT CGAAGGCAAA GCCGCCGCAC TCTACTTTGA ACTCTTCGGC
CTCCTCTTCA AAAAAGAGCT TCCCTTTTAT ACCAGTGCAT TCCGCAGGGT GCGGCGTCCG
CCAACCGACC CGGTCAACAG CCTGCTCAGT TTCGGCTATA CCCTGCTGCA CAACAACATC
TTCTCCCTCG TGCGCATGAA GGGAATGAAC CCCTACCTCG GCTTTCTGCA CGCCGAAGAC
AAAGGCAACC CGGCGCTGAT CAACGACCTC GTCGAAGAGT TCAGAACCAT TATCGACTCC
ATGACGCTCT ACACCCTCAA CAAGGGAGTT TTGCGGAACA AGGATTTTTA TTATCGTAAG
GACAAGGCGG GATGCTTTCT CACTGATGAG GCCCGGAAAA AGTTTCTGGA ACTCTTTGAA
CAGCGAATGT GGGCAGAGTC GCTCGACCCG CAAAGTGGCA AAAGCCTGAA TATCCGGCGG
CATATTGAGT CGCAGGTGGT CAAAATCAGC GAAGTGCTTG CCGGGACGAG GGCTGTTTAT
GAACCCTGGC GGTCAGAATG GTAA
 
Protein sequence
MGWLYNQMAL PETLFQAWNK VVSNDGMAGY DKQSITDYSW RIEEHLADLG RQLLTNTYEP 
QPLLKLVMLK PTGKLRTLLI PTVMERVAQT AAAIVLTPLV ESELGANTFA YRPGLSRMTA
AREIERLRNL GYNWVVDADI SSFFDTVDHP LLFQRFRELC DDEELLTLIA RWLTAEIVDG
QNPKVKNTIG LPQGCPISPM LANLYLDKFD ERMEQEGFKL VRFADDFLIL CKSKPKAEAA
LQLSESALAE LKLQLNNEKT RITTFAEGFK YLGYLFIRSL VLPTKMHPDE WYDKLGKLKL
RKAPKSLLHP EESDDEEYEL QTGDSDAITV TKETLLQTEF GEKLLQSLEK QQIDVDKFLE
KTAKEDALRQ KEKHEALNKL YSPLLNTLYL QEQGSILRKD GERFSVEKEG KQLNDIIVRR
VEQILVLGNI TLTTPAMQYC MKSNIPITFV SQHGSYFGRL EATTADNSAL ERFQYLRSLD
EPFALGIASA IVEAKIRNSR TMIQKRKAMA WESNGELKEK FDASLLLMTS LAEHTKSCDN
MEALRGIEGK AAALYFELFG LLFKKELPFY TSAFRRVRRP PTDPVNSLLS FGYTLLHNNI
FSLVRMKGMN PYLGFLHAED KGNPALINDL VEEFRTIIDS MTLYTLNKGV LRNKDFYYRK
DKAGCFLTDE ARKKFLELFE QRMWAESLDP QSGKSLNIRR HIESQVVKIS EVLAGTRAVY
EPWRSEW