Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ppha_2459 |
Symbol | |
ID | 6461403 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pelodictyon phaeoclathratiforme BU-1 |
Kingdom | Bacteria |
Replicon accession | NC_011060 |
Strand | + |
Start bp | 2541825 |
End bp | 2544008 |
Gene Length | 2184 bp |
Protein Length | 727 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642728638 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_002019260 |
Protein GI | 194337466 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair [COG3344] Retron-type reverse transcriptase |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.670262 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGATGGC TTTACAATCA GATGGCTTTG CCTGAAACAC TTTTTCAGGC CTGGAACAAG GTGGTTTCGA ACGACGGAAT GGCCGGGTAC GACAAACAGT CAATTACCGA TTATTCGTGG CGCATCGAGG AGCATCTTGC CGATCTTGGC AGGCAGCTCT TGACCAACAC CTATGAGCCG CAGCCGCTCC TGAAACTCGT CATGCTGAAA CCCACCGGAA AGTTGCGCAC CCTCCTGATT CCTACGGTGA TGGAGCGGGT TGCGCAAACC GCAGCGGCGA TCGTGCTGAC CCCTCTGGTG GAGTCGGAAC TGGGAGCCAA TACCTTCGCC TACCGGCCGG GGTTGTCGCG CATGACGGCG GCCCGTGAAA TCGAGAGGCT GCGAAACCTT GGTTATAACT GGGTGGTCGA TGCCGATATC AGCAGCTTTT TCGATACCGT TGACCACCCG TTACTCTTTC AGCGCTTTCG TGAACTCTGT GATGACGAGG AGCTGCTCAC GCTGATTGCG CGATGGCTGA CCGCAGAAAT TGTTGATGGC CAGAACCCAA AGGTGAAGAA CACCATCGGC CTGCCGCAGG GCTGCCCCAT TTCGCCGATG CTTGCAAACC TCTATCTGGA TAAATTTGAT GAGCGCATGG AGCAGGAGGG GTTCAAACTG GTGCGCTTTG CCGATGATTT TCTCATTCTT TGCAAGTCAA AACCAAAAGC AGAAGCGGCC CTCCAGCTTT CGGAAAGTGC GCTGGCGGAG CTGAAACTGC AACTCAACAA CGAAAAGACC CGGATCACCA CCTTTGCGGA GGGGTTCAAA TACCTTGGCT ACCTCTTCAT CCGTTCGCTG GTGTTGCCCA CCAAAATGCA CCCGGATGAG TGGTACGACA AACTGGGCAA GCTGAAACTC CGCAAAGCTC CGAAAAGCCT GCTGCACCCT GAGGAAAGTG ATGATGAGGA GTACGAACTG CAAACCGGTG ACTCCGACGC CATAACGGTG ACCAAAGAGA CGCTGCTTCA GACCGAGTTT GGTGAAAAGC TGCTGCAAAG CCTTGAAAAA CAGCAGATTG ATGTAGACAA ATTCCTTGAG AAAACGGCCA AAGAGGATGC CCTGCGGCAA AAAGAGAAGC ACGAAGCGCT CAACAAACTC TACTCCCCGC TGCTCAATAC CCTCTACCTG CAGGAACAGG GCAGCATATT GCGTAAGGAT GGTGAACGGT TCAGCGTTGA AAAAGAGGGC AAGCAACTCA ACGACATCAT TGTCCGCCGG GTAGAGCAGA TTCTGGTGCT CGGCAACATC ACGCTCACCA CACCGGCTAT GCAATACTGC ATGAAAAGCA ACATTCCCAT CACCTTTGTT TCGCAACATG GCAGCTACTT TGGTCGGCTC GAAGCCACCA CAGCCGACAA CTCCGCGCTC GAACGCTTTC AGTACCTCCG CTCACTCGAT GAACCCTTTG CCCTCGGAAT TGCCAGCGCT ATTGTAGAGG CAAAAATCAG AAACTCGCGC ACCATGATCC AGAAACGCAA AGCCATGGCG TGGGAAAGTA ACGGAGAGCT GAAAGAAAAA TTCGATGCGT CCCTCTTGCT GATGACCTCG CTTGCCGAAC ATACCAAAAG CTGTGACAAT ATGGAGGCGC TCCGGGGGAT CGAAGGCAAA GCCGCCGCAC TCTACTTTGA ACTCTTCGGC CTCCTCTTCA AAAAAGAGCT TCCCTTTTAT ACCAGTGCAT TCCGCAGGGT GCGGCGTCCG CCAACCGACC CGGTCAACAG CCTGCTCAGT TTCGGCTATA CCCTGCTGCA CAACAACATC TTCTCCCTCG TGCGCATGAA GGGAATGAAC CCCTACCTCG GCTTTCTGCA CGCCGAAGAC AAAGGCAACC CGGCGCTGAT CAACGACCTC GTCGAAGAGT TCAGAACCAT TATCGACTCC ATGACGCTCT ACACCCTCAA CAAGGGAGTT TTGCGGAACA AGGATTTTTA TTATCGTAAG GACAAGGCGG GATGCTTTCT CACTGATGAG GCCCGGAAAA AGTTTCTGGA ACTCTTTGAA CAGCGAATGT GGGCAGAGTC GCTCGACCCG CAAAGTGGCA AAAGCCTGAA TATCCGGCGG CATATTGAGT CGCAGGTGGT CAAAATCAGC GAAGTGCTTG CCGGGACGAG GGCTGTTTAT GAACCCTGGC GGTCAGAATG GTAA
|
Protein sequence | MGWLYNQMAL PETLFQAWNK VVSNDGMAGY DKQSITDYSW RIEEHLADLG RQLLTNTYEP QPLLKLVMLK PTGKLRTLLI PTVMERVAQT AAAIVLTPLV ESELGANTFA YRPGLSRMTA AREIERLRNL GYNWVVDADI SSFFDTVDHP LLFQRFRELC DDEELLTLIA RWLTAEIVDG QNPKVKNTIG LPQGCPISPM LANLYLDKFD ERMEQEGFKL VRFADDFLIL CKSKPKAEAA LQLSESALAE LKLQLNNEKT RITTFAEGFK YLGYLFIRSL VLPTKMHPDE WYDKLGKLKL RKAPKSLLHP EESDDEEYEL QTGDSDAITV TKETLLQTEF GEKLLQSLEK QQIDVDKFLE KTAKEDALRQ KEKHEALNKL YSPLLNTLYL QEQGSILRKD GERFSVEKEG KQLNDIIVRR VEQILVLGNI TLTTPAMQYC MKSNIPITFV SQHGSYFGRL EATTADNSAL ERFQYLRSLD EPFALGIASA IVEAKIRNSR TMIQKRKAMA WESNGELKEK FDASLLLMTS LAEHTKSCDN MEALRGIEGK AAALYFELFG LLFKKELPFY TSAFRRVRRP PTDPVNSLLS FGYTLLHNNI FSLVRMKGMN PYLGFLHAED KGNPALINDL VEEFRTIIDS MTLYTLNKGV LRNKDFYYRK DKAGCFLTDE ARKKFLELFE QRMWAESLDP QSGKSLNIRR HIESQVVKIS EVLAGTRAVY EPWRSEW
|
| |