Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1029 |
Symbol | |
ID | 4021505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 1172349 |
End bp | 1175543 |
Gene Length | 3195 bp |
Protein Length | 1064 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637961221 |
Product | CRISPR-associated Cas5e family protein |
Protein accession | YP_568168 |
Protein GI | 91975509 |
COG category | [S] Function unknown |
COG ID | [COG3513] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01865] CRISPR-associated protein, Csn1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.169021 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAGT GTGTTACTCG GCGAATTCTT GGAATTGACC TCGGTATTGC ATCTTGCGGA TGGGGGGTCA TCGAGGTTGG CGAGGCTAGC GGAAGTATCA TTGCCTCCGG CGTTCGGTGC TTCGACGCGC CATTGATCGA CAAGACCGGT GAGCCGAAGA GCGCGACGCG GCGCACAGCG CGGGGGCAGC GGCGAATTAT TCGCCGCCGT CGGCAGCGCA TGAACGCCGT CCGGCGCCTG CTTGCTGAAT TTGGTGTGCT TACGGGTCGG TCACCGGATG CCCTACATCA GGCACTGCTC AGACTCTCGC AAAGCGTTGC GGGATCACAA GTCACGCCGT GGACGCTTCG CGCGGCCGCG CATGAGCGTA AGTTGACCAA TGACGAATTG GCAGTTGTTC TCGGTCATAT CGCCCGGCAC CGCGGCTTCC GCTCGAACTC CAAGAACGAT GGCGGCGCGA ATGCCGCGGA CGAAACGTCC AAGATGAAAA AAGCGATGGA GACCACGCGC GAAGGTCTCG CGAGATATCA TTCGTTCGGC GCCATGATCG CCAGCGATCC GAAATTCGCC GATCGAAAGC GCAACCGTGA CAAGGACTAT TCACACACAG CGAAGCGCTC CGATCTGGAA GACGAGGTTC GCACGATTTT TCGGTCGCAA ACCCGGTTCG GCAGCTTGGT TGCCAGTGAA AAACTGTCGC AGGCGTTTGC GGATGCGGCG TTCTTCCAGC GTCCGCTGCA GGACAGCGAA GACATGGTGG GTTCTTGCCC GTTCGAGCCG GGACAGAAGC GGACCGCACG CCGCGCACCT TCATTTGAGT TGTTCCGTTT CCTCTCCCGG CTGGCCAATC TCAAACTGAC CGTCGGCAGA GCGCCGGAAC GGCGGCTGAC CCCGGATGAA ATTGCACTGG CCGCCAAAGG CTTCGGCGAG ACCAAGAAGA GCATCACGTT CAAATCGTTG CGCGAGGCGC TCGATCTCGA TCCGAACGCG CGCTTTTCCG GCGTCGCGAA GGAGAAGGAG AGTACGCTCG ATGTCGCCGC GCGCACCGGG GGGGCCGCCT ACGGAACCAA GACGCTAAAG GATGCGCTGG GCGATGCGCC GTGGCGATCG TTATCGCGAA TGCCTGAAAA ACTCGATCGC ATCGCAGAGA TACTGTCATT CCGCGAGGAC ATGAAAGCGA TCCGGAACGG ACTTGAAGAA GTCGGGCTCG ATGGCCTTGT TGTCGATGCC CTCATGCAGG CCACGGCCAA CGGTGATTTC AAGGACTTTA CGCGGGCCGC GCACATCTCG GCGCTGGCGG CGCGGAATAT CATTCCCGGA TTGCGCGAAG GGCTGGTCTA TTCGGACGCC TGCACGCGCG TTGGTTACGA TCATGCCGCT CGGCCCGCGG TCCCCCTCAG TCAGATCGGC AGTCCGGTGA CCCGAAAGGC GCTGAGCGAG GCGCTCAAGC AAGTGAGGGC CGTGGCGCGT GAATATGGTC CGATCGATTA CTTTCATATC GAACTTGCAC GCTCCATCGG CAAGAGTGCC GAGGAGCGCA AGAAGCTGAC CGACGGCATC GAAGCGAGAA ATGTCGAGAA GGAGAAGCGG CGAAAGGAGG CCGCGGAACA CCTTGGCCGC GCGCCGAGCG ACGATGAACT GTTGCGATAC GAACTCGCCA AGGAACAGAA CTTCAAATGC ATCTATTCCG GTGATCCGAT CGACCCCGCA GGCATCTCGG CGAACGACAC ACGCTATCAG GTCGATCACA TTCTGCCATG GAGTCGCTTC GGCGACGATT CCTATGTGAA CAAGACGCTC TGCACCGCCA GATCGAATCA GAATAAGCGT GGCCGAACGC CGTTCGAATG GTTCGATGCC GACAAGACCG AAGCTGAATG GATGGAGTAT TCAGCCCGCG TCGAGGACCT CAAGGAGGTC AAGGGACGCA AGAAACGGAA CTACAGCATC AAGGATGCGG CGAGCGTCGA GGATAAGTTC AAGGCGCGCA ATCTCACAGA CACGCAATGG GCGACCCGGT TGCTCGCCGA CGAACTCAAG CGGATGTTTC CGCCGCGAGA GTGCGAGCGG GTGGTGACTG TCCGAGCGGA CGGCGGCAAC GATGGACTAT CGATCGTGGA GGAGCGCCGG GTGTTCACGC GACCCGGCGC TATCACATCC AAGCTCAGGC GGGCCTGGGG ACTCGAGGGA CTCAAGAAGC AGGATGGAAA GCGGGTCGAG GATGACCGGC ATCACGCCGT GGACGCGCTG GTGCTGGCCG CCACGACCGA GAGCCTGTTG AATCGCCTGA CCGTCGAGGT GCAACAGCGC GAACGCGAGG GACGCCAGGA CGACATCTTC CATTGCAGCC AGCCGTGGCC GGGTTTTCGG GTTGACGTGC AACGCACGGT CTATGGCTCC GAGACCATGC CGGGTATTTT CGTGTCGCGC GCCGAACGCC GCCGCGCCCG CGGCAAGGCA CATGACGCGA CCGTGAAGCA GATCCGTGAC ATCGATGGCG AGAGAATTGT CTTTGAACGC AAGCCGATTG AAAAACTGAC GGACAAGGAT CTGGAGAGGA TTCCGGTTCC GGAACCGTAC GGGAAGGCGG CCGATCCGAA AAAATTGCGC GATGAACTGG TGGAGAACCT CCGCGCGTGG ATCGCCGCCG GAAAACCCAA GGACAAGCCG CCGCGGTCGC CGAAGGGGGA TATCATTCGC AAGGTGCGGA TTGAGACCAA GGACAAGGTC GCAGTCGAAA TCAACGGCGG CACTGTCGAT CGTGGTGATA TGGCGCGCGT CGATGTTTTC AGGAAAAAGA ACAAGAAGGG CGTGTGGGAG TTCTATGTTA TTCCGATCTA TCCGCATCAG ATCGTTGCAT CCGCATTGCC GCCAAATAGG GCTGTCATTG CGTACAAGGC CGAAAGTGAG TGGACAGCGA TTGATGGCTG TTTCGAGTTT GCTTGGTCGC TCAATCCAAT GAGTTACCTT GAGCTTGTCA AATCAAACGG CGAGCTGATT GAGGGATACT TTCGCAGCAT GGATCGCACC ACGGGCGCGA TCAATCTGTC TCCAATGTCA ACCAACTCGG AAACGATCCG AAGCATCGGA GTCAAGACGC TGTCCAGTTT TCGCAAATTC ACCGTCGACC GTCTCGGACG TAAATTTGAA ATTCCGCGTG AGGTGCGTAC ATGGCGTGGC GAGGCCTGCA CCTGA
|
Protein sequence | MSECVTRRIL GIDLGIASCG WGVIEVGEAS GSIIASGVRC FDAPLIDKTG EPKSATRRTA RGQRRIIRRR RQRMNAVRRL LAEFGVLTGR SPDALHQALL RLSQSVAGSQ VTPWTLRAAA HERKLTNDEL AVVLGHIARH RGFRSNSKND GGANAADETS KMKKAMETTR EGLARYHSFG AMIASDPKFA DRKRNRDKDY SHTAKRSDLE DEVRTIFRSQ TRFGSLVASE KLSQAFADAA FFQRPLQDSE DMVGSCPFEP GQKRTARRAP SFELFRFLSR LANLKLTVGR APERRLTPDE IALAAKGFGE TKKSITFKSL REALDLDPNA RFSGVAKEKE STLDVAARTG GAAYGTKTLK DALGDAPWRS LSRMPEKLDR IAEILSFRED MKAIRNGLEE VGLDGLVVDA LMQATANGDF KDFTRAAHIS ALAARNIIPG LREGLVYSDA CTRVGYDHAA RPAVPLSQIG SPVTRKALSE ALKQVRAVAR EYGPIDYFHI ELARSIGKSA EERKKLTDGI EARNVEKEKR RKEAAEHLGR APSDDELLRY ELAKEQNFKC IYSGDPIDPA GISANDTRYQ VDHILPWSRF GDDSYVNKTL CTARSNQNKR GRTPFEWFDA DKTEAEWMEY SARVEDLKEV KGRKKRNYSI KDAASVEDKF KARNLTDTQW ATRLLADELK RMFPPRECER VVTVRADGGN DGLSIVEERR VFTRPGAITS KLRRAWGLEG LKKQDGKRVE DDRHHAVDAL VLAATTESLL NRLTVEVQQR EREGRQDDIF HCSQPWPGFR VDVQRTVYGS ETMPGIFVSR AERRRARGKA HDATVKQIRD IDGERIVFER KPIEKLTDKD LERIPVPEPY GKAADPKKLR DELVENLRAW IAAGKPKDKP PRSPKGDIIR KVRIETKDKV AVEINGGTVD RGDMARVDVF RKKNKKGVWE FYVIPIYPHQ IVASALPPNR AVIAYKAESE WTAIDGCFEF AWSLNPMSYL ELVKSNGELI EGYFRSMDRT TGAINLSPMS TNSETIRSIG VKTLSSFRKF TVDRLGRKFE IPREVRTWRG EACT
|
| |