Gene RPB_3845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3845 
Symbol 
ID3911648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4390822 
End bp4396767 
Gene Length5946 bp 
Protein Length1981 aa 
Translation table11 
GC content69% 
IMG OID637885745 
Producthypothetical protein 
Protein accessionYP_487449 
Protein GI86750953 
COG category 
COG ID 
TIGRFAM ID[TIGR03174] CRISPR-associated protein Csc3 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGCTGT CCGCGATTCA GGAAGCCCTG AACATCAGCG ATGCGCCTGC GGCGACCGAC 
GATCCGATCT CGGCGCGGAG CGACAATTCG CCGGCGCTCT CGCCGCCGCC CGCTTTCAAC
GACGGCGCGC CGGAGCCGCG GATCGCAACC GACCGTCCCG GATTCGAACA ACACGACGAC
GACCGTCAGT TCGCGCAGCG CCCCGCCAAT GACGACCGCG AGACCATCGG CCAGTTGCTG
CAGGCGCTGC AGAAGGGCCG TCCGGCCCGC AACGTCTACA CCCTCGCCTC GATCTTCGCC
GGGGTCTGGA TCTTCGGCGG CATCGTCCTC ACCATCAGCT TCCTGCCCTC GCTGCAGGCG
ATGGTCGGCC AGGGCGCCGG CGGCACGATG GCGCTGGTCG GCCTGATCGC GCTGTTCCTC
GCCCCGGTGA TGCTGTTCTA CTTCCTCGCC AGCCTGACCT GGCGCGGCCA GGAACTGCGG
ATGATCGCGC AGTCGATGGC GCAGGTCGCG ATCCGATTCT CGGAGCCCGA GATCGCGGCG
AGCGATTCCG TCGTGACCGT CGGTCAGGCG ATCCGCCGCG AGGTCGCGGC GATGGGCGAC
GGCGTCGAGC GCGCGATCGC GCGCGCCGGC GAGCTCGAGA CCCTGGTCGC CAACGAGGTC
TCGGCGCTGG AACGCGCTTA CTCCGACAAC GAAGTCCGCA TCCGCGCCCT GCTGCAGGAC
ATCGCGCATC AGCGCGACAA TCTGGTCGGC CAGGCCGAGC AGGTGCGTAG CGCCATCTCC
GGCGTGCAGA TCGATCTGCG CCACGACATC GCGCTGATCT CCGACGCGAT CGCCAGCCGC
GTCGACGAAG TTGCGAAGAG CATCACCAGC GCGCTGGAAG AGCGCGGCGC CCACATCACC
GGTGCGCTGA GCCACGCCGG CGACAACATG ATCCTCGCCC TCGGCGAGCG CGGCGGCGAT
CTGCTCGACC GCCTGGAAGA AGCCAGCGCC GAGACCACGC GCGCCGTGCT CGACGCCAGC
GAGCGGCTGA CCGCCAGCCT GAATTTCAAG ACCGGGCACG TTCACGACGA ATTCGTCGAA
CTCGCCGACC GCGTCCACGA CATGCTCAAC GAGCGGCTCG ATCGCATCAC CGGCGAGTTC
GAACAGCGTT CGTCCAGCAT CGTCGACGGC ATTTCCGAGC GCACCGAGCA GGTCCACGAC
TCGCTGAAGA ACTCCTCGGA CTCGCTGCTG CTCGAACTCG AGCTGCGCAG CGGTGATCTC
GTCAGCAAGA TCGACGACGC CGGCAACCGC CTCGCAGACC AGATCCTGAC CAGCGGCGAC
AAGGCCAGCG AAGCGCTCGA CGTCACCGTG AATACGCTGG TGGCCAAGGT CGTCAGCCAG
ACCGAGAACG CCCACGACGC CCTCAGCATG CAGATGAGCG CGTTCGACGA TCTGGTGAAG
CAGCAGGGCT CCGAGCTCGC CGAGAAGTTC GCGCGCGATT CGGGCACGCT CGGCGCGCTG
ATCACCCGCC ACATCTCCGA ATTCGACCGC ACCGTGAAGA CCTACGGCGG CGACATCGTC
GAGCGCATGG GCCAGCGTAC CCAGGACATC TCGACCACGC TGAAGGACTA CGTCGACAAT
TTCGACCAGC GCGTCACCGC GCGCGGCGGC GAGCTCAGCG CCACGCTGGA TCAGCGCCTG
TCGCATTTCG AGAACGTGTT CGAGACCCGC GTCGGCAGCC TCGACGGCTC GCTCGACGTC
AAGATCAAGT TGCTGGACGA AACCGTCAGC GAACACCTGC GCTCGCTCGA AGACGGCTTC
GCATCGCGCA CCCAGATCGT CACCGAATCG ATCGACAGCC GGATCGCGCG GCTCTCGGAA
ACGCTGACCA CCGGCTCGGA TCAGGCGATC GTCGCCATCG ACAGCCGGAT CGCCGAACTG
TCGACCACGA TGTCGAGCGG CACGAGCCAG GCCGTCGATC TGATCGACAC CCGCATCGAC
AAGCTCTCCG GCACGCTGAC GGCGGGCGCG GTGCAGGCGA TCGAGGCAAT CGATCAGCGC
CTGTCGCATC TCACCAACTC GCTCACCGAC GGCACCGCGC AGGCGATCCA CGCGATCGAT
CAGCGCATCG GCAACGTCAC CGAGACGATC GACAGCCGCA GCGCCGGACT GGCGGCGACC
GTCAGCGCCC GCTTCCAGGA GATCCACGAA GGCCTCGAAG GCCGGGTCGG CACCGTGCTG
ACCGACATCG ACCTGCGCGT GTCGCAATTC GAGGACCTGC TCGGCTCGCG CATCGACGCG
GTGGCCGGTC GATTCGAGAA CACCGGGCGC GAAGCCAGCG ATCTGCTGAT GGCGCGCGCC
GAAGAACTCT CGCTCGGCAT CAAGTCGCAT GTCGCCGACG CCGAGCGTTC GCTCACCGGC
CTCGTCGTCA GCACCAGCGA CACGATCCAG AGCGGCACCC GCTCCGCCAA GGAAGCGATG
CTGTCGGTGT CGACCGAGGT CGGCGCGCAG CTGCAACTCA CCTCGTCCGA GATCGAACGC
GCCCTCACCG CGGTCGGCAC CAGCGCCGCG AACACGGTGC TGCTCAGCGC GACCGAGGCC
CAGTCCAAGC TGGTCGCAGC GTCGAGCGAT ACGCTCGCCC AGATCAAGTC GCTGTCGGCC
GACGTCGAGC GCACCCTCGC GGTCGCCGGC ACCTCGACCG CGGCATCGGT GCTGGCCGGC
GCGCGCGAGG CGCAGACCAC GCTGGTCGCC ACCTCGACCG ACGTCACCAA CGAGATCAGG
TCGCTGTCGG CCGAAATCCA GCGCACGCTG ACCGACGTCG GCGGCACCGC CTCGACCTCG
GTGCTGAACA GCGCCCGCGA GGCGCAGGCG ACCCTGGTGT CGTCGTCGTC CGACGTCGCC
AGCCAGATCC GCGCGCTGTC CGCCGATATC GAGCGCACCC TGACCGGAGC CGGCGACGCC
ACCTCGGCAT CGGTGCTGGC CGGCGCCCGC GAGGCGCAGT CGACCCTGCT GTCGGCCTCC
ACCGAGGTCA CCGGCCAGAT CCGGTCGCTG TCGTCGGATA TCGAACGCAC GCTGTACGCC
GCCGGCAATG CCACTGCGGA GTCCGTGCTC GGCGGCGCCC GCGCCGCCCA GACCACGCTG
GTGGCAGCCT CCGAAGAGGC CGCGGAGCGC GTCAAGTCGC TGAGCTCCGA CGTTCAGCGC
ACGCTGTCCG AAGCCGGATC CGCGACCGCC AGCGCGATCG TCGCCGGCGC GCGCGAAGCC
CAGAACACGC TGGTCGCCGC CTCCTCCGAG GCGTCGAGCC AGGTGATCTC GCTCGCCAGC
GATGTCGAAC GCAAGCTGAC GGCGGCCGGT AGCGCCACCG TGGAGACCCT GCTGTCCGGC
GCCCGCGAAG CCCAGCACAC CCTGGTGGCG ACCTCGACCG AGACGTCGAA CCAGCTCAAG
GCGCTGACCG GCGACGTCGA GCGCACCCTG ACCACCGCCG GCAGCGCCAC CGCCGGCGCC
ATTCTCGCCA GCGCCCGGGA CGTCCAGAGC ACGCTCGCCA ACGCTTCGAC CGCGGCCTCC
GATCAGGCCC GCTCGCTCGC CGCAGAGGTC GAACGCAGCC TGATCACCGC CGGCAAGACC
TCCGCCGAAT CGATCGTCAC CGGCGCGCGC GAAGCGCAGA CCACGCTGAT GACAGCCTCG
GAGGAAGCCG CCAACCACGT CAAGTCGCTG GCGATCGACG TCGAGCGCAC GCTGACCACG
GTCAGCGCCG CGGCGGCGTC GACCATTCTG GGCAGCGCCC GCGACGTGCA GTCGACCCTC
ACCAACGGCT CGGCGGAAGC GGCGAGCCAG ATCAAGGCGA TCTCCGCCGA CATCGAGCGC
ACGCTCGCCG GCGTCACCAT CAACACCACC GACAACATCC AGGTCAGCGC GGCGGCGGCC
CAGAGCACGC TGGTCGCGGT GTCGAACGAG GTCAGCTCGA AGATCAAGTC GACTTCCGCG
GAGATCGAAC GCTCGGTGTT CGCCGCCAGC GGCAGCTTCG GCTCGACCAT CACCGCCAAG
AGCGACGAGA TCGTCAGCTA CGTTCAGCAG CAGGCCGAGC GCCTGTCGCA GATCATCGAC
GGCAAGCGCG GCCTGCTGGT GGAAGCCATC TCCGGCAAGA CCAACCAGCT CACCATCGAG
ATCGACCGCG TCACCACCGA CGCGCTCGGC GCCATCGAAA GCCGCGGCAA GGCGTTCTCG
AATTCGATCC TGACCAACGG CAACGACGTC GCGCGCTCGA TCACCGGCGC CGGCGATCTC
GCCACCGGCG CGATCAACAA GTCGCTCAAG GATCTCGAGC TCGCCTCGCG CTCCGCGATC
GAACAGTCGC GCCAGGTCTC GGTCACCGCC GTCACCGAGA TGCAGGAGAC CAGCAAGATC
CTGCGCACCG ACACGGTCGC TTTGTTCGAG CGGCTGCGCG AAGGCAACAT CCTGCTGCAG
GAAGTGCTCA CCGGCGCCCA CGAGAACCTC AACTCGCTGG AGCGCGCGCT GGTCACCCGC
GTGGCGGATT TCGTCACCAC CATGAACGAC GTCAACGCCC GCAGCGGCGC CGCCACGCAG
GGCCTGGAAG ACCAGCTCAC CGGCTTCCAC GGCAAGACCG CCAAGGCGCT GGCCGATCTC
AGCGCGTTGT CCGAGCAGTT CGAGATGCAC GGTCAGGCCC TGGCCGAAGC CGCGGAGCTG
GTCCAGCAGA GCAACCGTAG CGCATCGACC TCGGTGCTGG AGCGCAAGGA ATCGCTGGAG
TCGCTGGTCA CCACCATCGA CCTGCGCACC GCCGATCTCG ACCAGCGGCT CACCCGCTTC
ACCGGCCTGC TCGACGAATC GCTCGCCGCG GCTGAGGAGC GCGCCCGCGA CATCGCGCGG
ATCGTCGCCG AGACCGCAGG CGCCGGCTCC GCCAGCATCA CGCAGCAGTT CGAAGCGGTG
CGCAGCGCCG CCGAGCAGGA GCGCCGCCTC ACCACCGAGG CGATGCACCA GGTCTATCAG
CAGGGCACCC AGGAAGCCGA TGCGCTGTTC AAGCAATCGG CCGACCGCTT CGCGGCGATC
GTCCAGGGGA TCAAGCAGAT GGCTTCGGAG ATGCACCACG AACTCGAAGC CACCCGCGAG
GAGCTGCGAC GCGGCGTGCT CGAAATCCCG CAGGAGGCCG CCGAGAGCAC CGCGCAGATG
CGCAAGGTGA TCGTCGACCA GATCGAGGCG CTGGCCGAAC TCAACCGCAT CGTCGCGCGC
CACGGCCGCG GCATGGATGT GGTCGGTACC ACCCGCGCCA GCGCGGTCCG CGACGAAGAG
CCGATGCTGG CGACCGCCGG CGGCCGCGGC GAGGCCATCG CGCGCGCGCC GGAGCCGATG
TCACGGCCGG AGCCGGTGTC GCGTGAACCC GCGTCGCGGC CGGCACAGCG CCGCGAGGCC
AGCGCCGCAA CGCTGCCGCC GCCGGACCTC GGCCTGCCGG GCCCTGCCCC GCGCCGCACC
GAGGCCCCGC CGGTCGCGCC GACCGGCGCC GATCCGGGCC GCGACGGCTG GCTGTCGGAT
CTCCTCAACC GCACCGACGC CGGTGCCGAG CGCGAACTTC CGCGCGCCCG GACGCCGCAG
CAGCCCACGC CGCAGCAGGC GATGAGCGGC AATCCGCTGG AATCGCTGTC GCTCGACATC
GGCCGGCTGA TGGACCGCAA CCTGGTGCTG GAGATGTGGG ATCGCTATCA GCGCGGCGAG
GCCAAGGCGT TCAGCAAGCG GCTGTACACG CCGGCCGGCC AGAAGGCGTT CGACGAGGTC
GCCCGCAAGT ATCGCAGCGA CCGCGCCTTC AAGCAGACGG TGGACCGCTA CATCACCGAG
TTCGAACGCC TGCTCGACGA AGTCGCCCGC GACGAACGCG GCCCGCAGGT GCTGCGCACG
CATCTGAGCT CGGAAACGGG CCTCGTGTAC ACGCTGCTGG CGCATGCGGC GGGGCGGCTG
GGGTAA
 
Protein sequence
MALSAIQEAL NISDAPAATD DPISARSDNS PALSPPPAFN DGAPEPRIAT DRPGFEQHDD 
DRQFAQRPAN DDRETIGQLL QALQKGRPAR NVYTLASIFA GVWIFGGIVL TISFLPSLQA
MVGQGAGGTM ALVGLIALFL APVMLFYFLA SLTWRGQELR MIAQSMAQVA IRFSEPEIAA
SDSVVTVGQA IRREVAAMGD GVERAIARAG ELETLVANEV SALERAYSDN EVRIRALLQD
IAHQRDNLVG QAEQVRSAIS GVQIDLRHDI ALISDAIASR VDEVAKSITS ALEERGAHIT
GALSHAGDNM ILALGERGGD LLDRLEEASA ETTRAVLDAS ERLTASLNFK TGHVHDEFVE
LADRVHDMLN ERLDRITGEF EQRSSSIVDG ISERTEQVHD SLKNSSDSLL LELELRSGDL
VSKIDDAGNR LADQILTSGD KASEALDVTV NTLVAKVVSQ TENAHDALSM QMSAFDDLVK
QQGSELAEKF ARDSGTLGAL ITRHISEFDR TVKTYGGDIV ERMGQRTQDI STTLKDYVDN
FDQRVTARGG ELSATLDQRL SHFENVFETR VGSLDGSLDV KIKLLDETVS EHLRSLEDGF
ASRTQIVTES IDSRIARLSE TLTTGSDQAI VAIDSRIAEL STTMSSGTSQ AVDLIDTRID
KLSGTLTAGA VQAIEAIDQR LSHLTNSLTD GTAQAIHAID QRIGNVTETI DSRSAGLAAT
VSARFQEIHE GLEGRVGTVL TDIDLRVSQF EDLLGSRIDA VAGRFENTGR EASDLLMARA
EELSLGIKSH VADAERSLTG LVVSTSDTIQ SGTRSAKEAM LSVSTEVGAQ LQLTSSEIER
ALTAVGTSAA NTVLLSATEA QSKLVAASSD TLAQIKSLSA DVERTLAVAG TSTAASVLAG
AREAQTTLVA TSTDVTNEIR SLSAEIQRTL TDVGGTASTS VLNSAREAQA TLVSSSSDVA
SQIRALSADI ERTLTGAGDA TSASVLAGAR EAQSTLLSAS TEVTGQIRSL SSDIERTLYA
AGNATAESVL GGARAAQTTL VAASEEAAER VKSLSSDVQR TLSEAGSATA SAIVAGAREA
QNTLVAASSE ASSQVISLAS DVERKLTAAG SATVETLLSG AREAQHTLVA TSTETSNQLK
ALTGDVERTL TTAGSATAGA ILASARDVQS TLANASTAAS DQARSLAAEV ERSLITAGKT
SAESIVTGAR EAQTTLMTAS EEAANHVKSL AIDVERTLTT VSAAAASTIL GSARDVQSTL
TNGSAEAASQ IKAISADIER TLAGVTINTT DNIQVSAAAA QSTLVAVSNE VSSKIKSTSA
EIERSVFAAS GSFGSTITAK SDEIVSYVQQ QAERLSQIID GKRGLLVEAI SGKTNQLTIE
IDRVTTDALG AIESRGKAFS NSILTNGNDV ARSITGAGDL ATGAINKSLK DLELASRSAI
EQSRQVSVTA VTEMQETSKI LRTDTVALFE RLREGNILLQ EVLTGAHENL NSLERALVTR
VADFVTTMND VNARSGAATQ GLEDQLTGFH GKTAKALADL SALSEQFEMH GQALAEAAEL
VQQSNRSAST SVLERKESLE SLVTTIDLRT ADLDQRLTRF TGLLDESLAA AEERARDIAR
IVAETAGAGS ASITQQFEAV RSAAEQERRL TTEAMHQVYQ QGTQEADALF KQSADRFAAI
VQGIKQMASE MHHELEATRE ELRRGVLEIP QEAAESTAQM RKVIVDQIEA LAELNRIVAR
HGRGMDVVGT TRASAVRDEE PMLATAGGRG EAIARAPEPM SRPEPVSREP ASRPAQRREA
SAATLPPPDL GLPGPAPRRT EAPPVAPTGA DPGRDGWLSD LLNRTDAGAE RELPRARTPQ
QPTPQQAMSG NPLESLSLDI GRLMDRNLVL EMWDRYQRGE AKAFSKRLYT PAGQKAFDEV
ARKYRSDRAF KQTVDRYITE FERLLDEVAR DERGPQVLRT HLSSETGLVY TLLAHAAGRL
G