Gene RPB_3780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3780 
Symbol 
ID3911583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4314012 
End bp4317932 
Gene Length3921 bp 
Protein Length1306 aa 
Translation table11 
GC content69% 
IMG OID637885681 
Producthypothetical protein 
Protein accessionYP_487385 
Protein GI86750889 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTAAGC CATCGTTAAC GGCTTCCGTG GTGGAGTGCG ATGCGGATGG CCGCGCCGTG 
TCGGCGCGTG GCGACGTGCG AGCAATGCGA GAAGGCGGGG CGCGGCCCGC CCGGCAGAAC
AGGGTCGTTG ACCGGAAGAT ACGAATGGCG CGGATGGCTG CCGCTGCAGT TTGGTCGCAG
GCGCTCTGTC GGCGCCTCGC ACGAGTCTGC CTGGCGTTAG GGCTGGTGCT CGCCGTCGCG
CTGTCGCCCC GGCCGGCGTC CGCGCAGGCG GTTCGCGGCG AAGCGATTTT CGAGTCGGGC
GGAGGCTACG GGCGGCTGCT GTTCAAGCTC GCCGAGGACG TCGAGTCCGA CGTCGTCATG
GCCGGCCTGA TCGTGGTGAT CCGGTTCAAG CGGCCGGTGG ATATTTCGAT CGACAAGCTC
GCGGAGTCCG CGCCGAACTA CATCGGCTCC GCGCGGCTGG ATCCCGACGG CTACGCGGTT
CGGTTGGCGC TGCTGCGCAA GTTCAGCGTC AATCCGATGT CCGCGGGCGA GCGGCTGTAT
ATCGATTTTC TGCCGGACAA TTGGGTCGGC GCGCCGCCCG GCCTGCCTCC GGACGTCGTC
AAGGCGCTGG CCGAACGCGC CCGCGCGGCG GAGCGCGCGC TACGCGCCCA ACAGGCCAAG
GCGGATGTCA AGAAGAAGCC GCCGATCCGC GTTCGGGCGT CGATGCAGCC GACTTTCGTG
CGCTACATCT TCGAAGTGCC GCCCGGCGTG CAGGTGTCGT CGACGCTGAC CGACAAGAAG
CTGACGGTGC TGTTCAACAC CGGACTGAGT TTCGATCTCG CCGATGCACA ACTCGCCGGC
GCGCCGAATG TCGGATCGAT CGGCCAGAAG ATCGACGGTG ACGGCTCGAG CGTGGACTTC
GCGCTGATTG GCGGCGCCGA CGTGCGTTCG TTTCGTGAGG ACAAGAACTT CATCGTCGAC
GTCAGTTTTC AGCCGCAGGA CGCCGAGCCG TCGAAGAAGA CGTCGCAGGC GCCCATCCTG
CCCGAAATCG CCCGGCTCGA ACGCGAAGCC GCGCCCGCCG CACCGCGCGC CCCGGATGTC
GCCCGAGCCG AGGCGAAGCC TGAGACCAAA TCCGATGGCA AATCTGAAGC GAAGTCCGGT
ACCAAGCCGG AGGTGAAGCC GGCTGCCGCC GCAGCGACGC CGCCGGTGAT CGCGCCGCCC
CAGGCCGTCG CATCACCTGC CGGCCCGGCG GCGCGCAGCG ACGCTCCAGC CCCTGCCGCG
GTCGCGGCAC CGGCTGTCGC GGCGCCTGTG GTCACGGCCC CGGCCGCGCC GGCTCCGGCC
ACGGTGGCGC CGGTTGCGGC GCCGCCTCCC GCAGCGAGCG AGACAACTCG GCCGTCGACG
GAGACGGCCG TCCGCGCCGG CGCTCGCGCC GCGGCGGTCG AGGCTCCGAA GCTGCCACCT
GCGCCCGAAC CGCCGGCGGC GGCGGCGAAC CCGGCTTTAT CCAGCGCAGC ACGACCGGTC
GAGGCGCGCC GCAATACCGA TGGCCTGCAC CTCGCATTCG CGTTCGCGGC GCCGACGCCG
GCCGCGCTGT TCCGGCGCGG CGACATCATC TGGCTGGTAC TCGACAACAC GACACCGTTC
GACCTCACTT CGATCCGCCG CGAAGGTGGC GGCATCGTCG GAGATGTCAG CCGCGTCGAG
CTCCCGAACG GGCAAGCGAT CCGCCTGCGT CTCGATCGTC CGCAACTCGC GACGCTGAGC
GACGACGACG GCTCCGGCAA GAACTGGTCG ATCACGCTCG CGGATTCGGG GCGCGGCGCC
GCGCGGCCGT TGACCGCGGT GCGCAACATC GCCGATCCGG CCCGCGCCAC CGTTGCCGTG
GCGCTCGCCG GCCTCGGCCA GATGCATCGC CTGACCGATC CGGAGGCCGG CGACGCGCTG
ACGGTGATCA CCGCGCTGCC GCCGCCGCGC GGCTTCATCA AGCGGCAGAG TTTCGTCGAG
TTCAGCCTGC TGGAATCGCT GCACGGTGTG GTCATCGAAC TGAAATCCGA CGACGTCACC
GTCGAGACCG TCGCCGATGC GGTGGTGCTG TCGCGGCCCG GTGGACTGAC GCTGTCGTCG
GCCGAGCCGG CCGGGCAGGC CGGCTCTTCG GCAGAGCGGC CGTTCTTCGA CATCACCCAA
TGGGCCAAAG ACCAGGAGGG GCGTTTCTCC GACGCGCTCG ACGCGCGGAT CAGGACGGCG
TCGACCGCCA CCGGCGACGA CCGGCTGCCG GCGCGGCTCG ATCTGGCGCG GTTCTTCATG
GCGCGCGGTC TGTATCACGA AGCCAAGGGC GCGCTCGACC TGGCGCTGCT CGGCGTCAAG
CCCGGACAGG AAGACGTCGC GACGATGATC GGCCACGCCG CGGCGAGCGC GCTGATGCAG
CGGCCGGAGC AGACACTGAA GGATGTCGCC AATCCGGTGA TCGCCTCGAC CTACGACGCG
CAATTGTGGA AAGGCGTCGC GCTGGCGGGC CAGGGCAAAT GGCCGGAGGC GCGCGAGAAG
CTGAAATCGG TGCAGTTCGC GATTACCGCG CTGCCGCTCG ACATGCAGCG CGAGGTGCTG
GCCACCGCGA TGCGCGCCTC GCTGGAAGTC CGCGACTACG CCGGCGCCGC CAAGATCAGC
AGCGATTTCG ATCTGGTCGG CATTCCGCCG GAGATGAAGC CCCCGCTCGC GGTGATGCGC
GGCTGGCTCG ACGAGGCGCT CGGGCGGGAT CCGGAGGCGC TCAAGAGGTA CAAGGAAGCG
ATGGCGTCGG CCGATCGCCA GGCCGCCAGC GAAGCCAAGT TCCGCGACGT CGTGCTGCGC
AGCAAGCGCG GTGAGATGAC GCCGGAGGAA GCGCTGCCCG AGCTCGAACG GCTGTCGACG
ACGTGGCGCG GCGACGATCT CGAAGTCCGC ACCCAGCAAT TGCTGTCGAA GCTCTACGCC
AATGCCGGCC GCTACCGCGA TTCACTGGCG GCGGCGCGGA CCGCGACGCA ACTCGCGCCG
AATTCCGAAT ATGCCCGCCA GGCGCAGGAC GACAGCCGGG CATTGTTCTC GCAGCTGTTC
CTCGGCAACA AGGGCGACGA CATTCCGCCG ATCGAGGCGC TGGCGACGTT CTACGAATTT
CGCGAGCTGA CGCCGATCGG CCGCCGCGGC GACGAGATGA TCCGCAGGCT CGCCGACCGC
CTGGTCGCGG TCGATCTGCT GGATCAGGCG AGCGAGCTGT TGCAGTACCA GGTCGACAAG
CGGCTCGAAG GCGCCGCGCG CGCCCAGGTC GCGGCGCGGC TGGCGATGGT CTATCTGATG
AACCGCAAGC CGGATCGCGC GATCGCCGCG CTGCATTCGT CGCGAATCGC CGATCTCGCC
GGTGAATTGC GCCAGCAGCG GCTGCTGCTC GAGGGGCGGG CGCAGAGCGA CATCGGCCGC
CACGATCTCG CGCTCGACAT CATCACCAAT ATCAGTGGCC GCGAGGCGAT CCGGCTGCGC
TCCGACATCT ACTGGGCGTC GCGGCGCTGG CGCGAATCCT CCGAACAGAT CGAACTGTAT
CTCGGCGACC GCTGGCGCGA TTTCACGCCG CTGTCGCAGG CCGAGAAAAG CGACGTCATC
CGCGCCGTGG TCGGCTACGC GCTGGCCGAG GATGCGCTCG GTCTCGACCG TTTCCGAGAG
AAGTTCGCGC CGTTGATGAC CGACCCGGCC GACCGCGCCG CGTTCGACAT CGCCAGCAGG
CCGGCCGCGG GCGATACCGC GGCGTTTGCC GCGATCGCCA AGATGGCGGC GAGCGTCGAT
ACGCTGGAGG GCTTCCTGCG CGAAATGAAG CAGCGCTTCC CCGACGCCAG CGCCCGCGCC
ACGCCGCCCG GCGCCGACAT GACGTCGACC GGCGCGCTGC CGGAGATCCC GAAGATCCGC
GTCATCAAGA TGACGCGGTA G
 
Protein sequence
MVKPSLTASV VECDADGRAV SARGDVRAMR EGGARPARQN RVVDRKIRMA RMAAAAVWSQ 
ALCRRLARVC LALGLVLAVA LSPRPASAQA VRGEAIFESG GGYGRLLFKL AEDVESDVVM
AGLIVVIRFK RPVDISIDKL AESAPNYIGS ARLDPDGYAV RLALLRKFSV NPMSAGERLY
IDFLPDNWVG APPGLPPDVV KALAERARAA ERALRAQQAK ADVKKKPPIR VRASMQPTFV
RYIFEVPPGV QVSSTLTDKK LTVLFNTGLS FDLADAQLAG APNVGSIGQK IDGDGSSVDF
ALIGGADVRS FREDKNFIVD VSFQPQDAEP SKKTSQAPIL PEIARLEREA APAAPRAPDV
ARAEAKPETK SDGKSEAKSG TKPEVKPAAA AATPPVIAPP QAVASPAGPA ARSDAPAPAA
VAAPAVAAPV VTAPAAPAPA TVAPVAAPPP AASETTRPST ETAVRAGARA AAVEAPKLPP
APEPPAAAAN PALSSAARPV EARRNTDGLH LAFAFAAPTP AALFRRGDII WLVLDNTTPF
DLTSIRREGG GIVGDVSRVE LPNGQAIRLR LDRPQLATLS DDDGSGKNWS ITLADSGRGA
ARPLTAVRNI ADPARATVAV ALAGLGQMHR LTDPEAGDAL TVITALPPPR GFIKRQSFVE
FSLLESLHGV VIELKSDDVT VETVADAVVL SRPGGLTLSS AEPAGQAGSS AERPFFDITQ
WAKDQEGRFS DALDARIRTA STATGDDRLP ARLDLARFFM ARGLYHEAKG ALDLALLGVK
PGQEDVATMI GHAAASALMQ RPEQTLKDVA NPVIASTYDA QLWKGVALAG QGKWPEAREK
LKSVQFAITA LPLDMQREVL ATAMRASLEV RDYAGAAKIS SDFDLVGIPP EMKPPLAVMR
GWLDEALGRD PEALKRYKEA MASADRQAAS EAKFRDVVLR SKRGEMTPEE ALPELERLST
TWRGDDLEVR TQQLLSKLYA NAGRYRDSLA AARTATQLAP NSEYARQAQD DSRALFSQLF
LGNKGDDIPP IEALATFYEF RELTPIGRRG DEMIRRLADR LVAVDLLDQA SELLQYQVDK
RLEGAARAQV AARLAMVYLM NRKPDRAIAA LHSSRIADLA GELRQQRLLL EGRAQSDIGR
HDLALDIITN ISGREAIRLR SDIYWASRRW RESSEQIELY LGDRWRDFTP LSQAEKSDVI
RAVVGYALAE DALGLDRFRE KFAPLMTDPA DRAAFDIASR PAAGDTAAFA AIAKMAASVD
TLEGFLREMK QRFPDASARA TPPGADMTST GALPEIPKIR VIKMTR