Gene RPD_1029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1029 
Symbol 
ID4021505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1172349 
End bp1175543 
Gene Length3195 bp 
Protein Length1064 aa 
Translation table11 
GC content59% 
IMG OID637961221 
ProductCRISPR-associated Cas5e family protein 
Protein accessionYP_568168 
Protein GI91975509 
COG category[S] Function unknown 
COG ID[COG3513] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01865] CRISPR-associated protein, Csn1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.169021 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAGT GTGTTACTCG GCGAATTCTT GGAATTGACC TCGGTATTGC ATCTTGCGGA 
TGGGGGGTCA TCGAGGTTGG CGAGGCTAGC GGAAGTATCA TTGCCTCCGG CGTTCGGTGC
TTCGACGCGC CATTGATCGA CAAGACCGGT GAGCCGAAGA GCGCGACGCG GCGCACAGCG
CGGGGGCAGC GGCGAATTAT TCGCCGCCGT CGGCAGCGCA TGAACGCCGT CCGGCGCCTG
CTTGCTGAAT TTGGTGTGCT TACGGGTCGG TCACCGGATG CCCTACATCA GGCACTGCTC
AGACTCTCGC AAAGCGTTGC GGGATCACAA GTCACGCCGT GGACGCTTCG CGCGGCCGCG
CATGAGCGTA AGTTGACCAA TGACGAATTG GCAGTTGTTC TCGGTCATAT CGCCCGGCAC
CGCGGCTTCC GCTCGAACTC CAAGAACGAT GGCGGCGCGA ATGCCGCGGA CGAAACGTCC
AAGATGAAAA AAGCGATGGA GACCACGCGC GAAGGTCTCG CGAGATATCA TTCGTTCGGC
GCCATGATCG CCAGCGATCC GAAATTCGCC GATCGAAAGC GCAACCGTGA CAAGGACTAT
TCACACACAG CGAAGCGCTC CGATCTGGAA GACGAGGTTC GCACGATTTT TCGGTCGCAA
ACCCGGTTCG GCAGCTTGGT TGCCAGTGAA AAACTGTCGC AGGCGTTTGC GGATGCGGCG
TTCTTCCAGC GTCCGCTGCA GGACAGCGAA GACATGGTGG GTTCTTGCCC GTTCGAGCCG
GGACAGAAGC GGACCGCACG CCGCGCACCT TCATTTGAGT TGTTCCGTTT CCTCTCCCGG
CTGGCCAATC TCAAACTGAC CGTCGGCAGA GCGCCGGAAC GGCGGCTGAC CCCGGATGAA
ATTGCACTGG CCGCCAAAGG CTTCGGCGAG ACCAAGAAGA GCATCACGTT CAAATCGTTG
CGCGAGGCGC TCGATCTCGA TCCGAACGCG CGCTTTTCCG GCGTCGCGAA GGAGAAGGAG
AGTACGCTCG ATGTCGCCGC GCGCACCGGG GGGGCCGCCT ACGGAACCAA GACGCTAAAG
GATGCGCTGG GCGATGCGCC GTGGCGATCG TTATCGCGAA TGCCTGAAAA ACTCGATCGC
ATCGCAGAGA TACTGTCATT CCGCGAGGAC ATGAAAGCGA TCCGGAACGG ACTTGAAGAA
GTCGGGCTCG ATGGCCTTGT TGTCGATGCC CTCATGCAGG CCACGGCCAA CGGTGATTTC
AAGGACTTTA CGCGGGCCGC GCACATCTCG GCGCTGGCGG CGCGGAATAT CATTCCCGGA
TTGCGCGAAG GGCTGGTCTA TTCGGACGCC TGCACGCGCG TTGGTTACGA TCATGCCGCT
CGGCCCGCGG TCCCCCTCAG TCAGATCGGC AGTCCGGTGA CCCGAAAGGC GCTGAGCGAG
GCGCTCAAGC AAGTGAGGGC CGTGGCGCGT GAATATGGTC CGATCGATTA CTTTCATATC
GAACTTGCAC GCTCCATCGG CAAGAGTGCC GAGGAGCGCA AGAAGCTGAC CGACGGCATC
GAAGCGAGAA ATGTCGAGAA GGAGAAGCGG CGAAAGGAGG CCGCGGAACA CCTTGGCCGC
GCGCCGAGCG ACGATGAACT GTTGCGATAC GAACTCGCCA AGGAACAGAA CTTCAAATGC
ATCTATTCCG GTGATCCGAT CGACCCCGCA GGCATCTCGG CGAACGACAC ACGCTATCAG
GTCGATCACA TTCTGCCATG GAGTCGCTTC GGCGACGATT CCTATGTGAA CAAGACGCTC
TGCACCGCCA GATCGAATCA GAATAAGCGT GGCCGAACGC CGTTCGAATG GTTCGATGCC
GACAAGACCG AAGCTGAATG GATGGAGTAT TCAGCCCGCG TCGAGGACCT CAAGGAGGTC
AAGGGACGCA AGAAACGGAA CTACAGCATC AAGGATGCGG CGAGCGTCGA GGATAAGTTC
AAGGCGCGCA ATCTCACAGA CACGCAATGG GCGACCCGGT TGCTCGCCGA CGAACTCAAG
CGGATGTTTC CGCCGCGAGA GTGCGAGCGG GTGGTGACTG TCCGAGCGGA CGGCGGCAAC
GATGGACTAT CGATCGTGGA GGAGCGCCGG GTGTTCACGC GACCCGGCGC TATCACATCC
AAGCTCAGGC GGGCCTGGGG ACTCGAGGGA CTCAAGAAGC AGGATGGAAA GCGGGTCGAG
GATGACCGGC ATCACGCCGT GGACGCGCTG GTGCTGGCCG CCACGACCGA GAGCCTGTTG
AATCGCCTGA CCGTCGAGGT GCAACAGCGC GAACGCGAGG GACGCCAGGA CGACATCTTC
CATTGCAGCC AGCCGTGGCC GGGTTTTCGG GTTGACGTGC AACGCACGGT CTATGGCTCC
GAGACCATGC CGGGTATTTT CGTGTCGCGC GCCGAACGCC GCCGCGCCCG CGGCAAGGCA
CATGACGCGA CCGTGAAGCA GATCCGTGAC ATCGATGGCG AGAGAATTGT CTTTGAACGC
AAGCCGATTG AAAAACTGAC GGACAAGGAT CTGGAGAGGA TTCCGGTTCC GGAACCGTAC
GGGAAGGCGG CCGATCCGAA AAAATTGCGC GATGAACTGG TGGAGAACCT CCGCGCGTGG
ATCGCCGCCG GAAAACCCAA GGACAAGCCG CCGCGGTCGC CGAAGGGGGA TATCATTCGC
AAGGTGCGGA TTGAGACCAA GGACAAGGTC GCAGTCGAAA TCAACGGCGG CACTGTCGAT
CGTGGTGATA TGGCGCGCGT CGATGTTTTC AGGAAAAAGA ACAAGAAGGG CGTGTGGGAG
TTCTATGTTA TTCCGATCTA TCCGCATCAG ATCGTTGCAT CCGCATTGCC GCCAAATAGG
GCTGTCATTG CGTACAAGGC CGAAAGTGAG TGGACAGCGA TTGATGGCTG TTTCGAGTTT
GCTTGGTCGC TCAATCCAAT GAGTTACCTT GAGCTTGTCA AATCAAACGG CGAGCTGATT
GAGGGATACT TTCGCAGCAT GGATCGCACC ACGGGCGCGA TCAATCTGTC TCCAATGTCA
ACCAACTCGG AAACGATCCG AAGCATCGGA GTCAAGACGC TGTCCAGTTT TCGCAAATTC
ACCGTCGACC GTCTCGGACG TAAATTTGAA ATTCCGCGTG AGGTGCGTAC ATGGCGTGGC
GAGGCCTGCA CCTGA
 
Protein sequence
MSECVTRRIL GIDLGIASCG WGVIEVGEAS GSIIASGVRC FDAPLIDKTG EPKSATRRTA 
RGQRRIIRRR RQRMNAVRRL LAEFGVLTGR SPDALHQALL RLSQSVAGSQ VTPWTLRAAA
HERKLTNDEL AVVLGHIARH RGFRSNSKND GGANAADETS KMKKAMETTR EGLARYHSFG
AMIASDPKFA DRKRNRDKDY SHTAKRSDLE DEVRTIFRSQ TRFGSLVASE KLSQAFADAA
FFQRPLQDSE DMVGSCPFEP GQKRTARRAP SFELFRFLSR LANLKLTVGR APERRLTPDE
IALAAKGFGE TKKSITFKSL REALDLDPNA RFSGVAKEKE STLDVAARTG GAAYGTKTLK
DALGDAPWRS LSRMPEKLDR IAEILSFRED MKAIRNGLEE VGLDGLVVDA LMQATANGDF
KDFTRAAHIS ALAARNIIPG LREGLVYSDA CTRVGYDHAA RPAVPLSQIG SPVTRKALSE
ALKQVRAVAR EYGPIDYFHI ELARSIGKSA EERKKLTDGI EARNVEKEKR RKEAAEHLGR
APSDDELLRY ELAKEQNFKC IYSGDPIDPA GISANDTRYQ VDHILPWSRF GDDSYVNKTL
CTARSNQNKR GRTPFEWFDA DKTEAEWMEY SARVEDLKEV KGRKKRNYSI KDAASVEDKF
KARNLTDTQW ATRLLADELK RMFPPRECER VVTVRADGGN DGLSIVEERR VFTRPGAITS
KLRRAWGLEG LKKQDGKRVE DDRHHAVDAL VLAATTESLL NRLTVEVQQR EREGRQDDIF
HCSQPWPGFR VDVQRTVYGS ETMPGIFVSR AERRRARGKA HDATVKQIRD IDGERIVFER
KPIEKLTDKD LERIPVPEPY GKAADPKKLR DELVENLRAW IAAGKPKDKP PRSPKGDIIR
KVRIETKDKV AVEINGGTVD RGDMARVDVF RKKNKKGVWE FYVIPIYPHQ IVASALPPNR
AVIAYKAESE WTAIDGCFEF AWSLNPMSYL ELVKSNGELI EGYFRSMDRT TGAINLSPMS
TNSETIRSIG VKTLSSFRKF TVDRLGRKFE IPREVRTWRG EACT