Gene RPB_3009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3009 
Symbol 
ID3910808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3429731 
End bp3432913 
Gene Length3183 bp 
Protein Length1060 aa 
Translation table11 
GC content67% 
IMG OID637884915 
Productribonuclease 
Protein accessionYP_486622 
Protein GI86750126 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1530] Ribonucleases G and E 
TIGRFAM ID[TIGR00757] ribonuclease, Rne/Rng family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00611074 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.333184 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAACA AGATGCTGAT CGACGCCACC CACCCGGAAG AGACCCGGGT GGTCGTGGTC 
CGCGGCAACC GCGTCGAAGA GTTTGATTTC GAGACCGCAC AGCGCAAACA ACTGCGTGGC
AACATCTACC TCGCCAAAGT GACACGGGTC GAACCGTCGT TGCAGGCCGC CTTCATCGAA
TACGGCGGCA ACCGGCACGG ATTCCTCGCT TTCAGCGAAA TCCATCCCGA CTACTACCAG
ATCCCCGTCG CCGATCGTCA GGCCCTGATC GAGGCCGACG AGCGGGCGCA TCGCGAGGCC
GAGGAAGAAA ACGAGCAGCG CTCGACCCGC CGCCGGTCGA GGCACCGCAG CCATCGCCGC
CGCGGCAGTG GCGAACGGGT CACCAGCGAG ATCGTCGAGG CTTCCGCCGG CGCCGTGGCC
GAGCCGCACG ATCAATTCAG CGATCTTGCC GACAGCGATC CTGCCGACAA TGGACACGCG
CACGCCGACC ACGAACGGCC GATGCACGAC ATTCCTGCGG ATGACGCCGG CGATGTGACG
CAGGCCTCGG CGGACAATGC CGAAGGCCAC GACGGTCATC ACGACGCCGA GCGCCAGGGA
TACCATGCGG CTTCGGCTGA GTATCTCGAC GAGACGGCGG TCGCGCCGAG CATCGCCGAC
GGCGAGCAGC CCACCCTGCC CGACTTCGGC GCCACGCCGC ACGCCCAGAC CCTCTCCCAG
GACGTTGAAG AGAATAGCGG TCACGCCGAA GATGCGCCGT TCCCGATCGC CGAAGGCGCT
GAGCAAGAAC ATGGCGATGA CGCCGGCGAC GAATTGAACG ACGCGGATGG CGACGCGGAC
CTGGAGGACG CTGCCCGGTC CGACACCGAA GCAGACGACG AGGATGAAGA CGAGGACGAG
GACGACGAGG CCGAGGAAGA CATCGTCGAA TCCGTCGGCG GCGACGATGT GCTGGAAGAA
GTTCCCGAAC GCGCCTTCCG GCCGCGCCGG CAGTACAAAA TCCAGGAAGT CATCAAGCGC
CGTCAGGTGA TGCTGGTGCA GGTGGTCAAG GAAGAGCGCG GCAACAAGGG CGCGGCGCTG
ACGACCTACC TGTCGCTCGC CGGCCGTTAC GCCGTTCTGA TGCCGAACAC CGCGCGCGGC
GGCGGCATCA GCCGCAAGAT CACGTCGGCG CAGGATCGCT CCCGCCTCAA GGAAGTGGTT
CAGGATCTCG ACGTGCCGGA AGGCATGGGC ATCATCCTGC GCACCGCCGG CGCCTCGCGC
ACCAAGCCGG AGATCAAGCG CGACTTCGAA TATCTGATCC GGATGTGGGA GACGGTCCGC
GACGTCACGC TGAAGTCGCA GGCGCCGACG CTCGTCTACG AGGAAGGCTC GCTGATCAAG
CGCTCGCTAC GCGATCTCTA CAACAAGGAG ATCGACGAAA TCCAGGTCGC GGGCGAAGCC
GGCTACCAGG AAGCGCGCGA CTTCATGCAC ATGCTGATGC CGTCCAACGT TCGCGCGGTG
AAGCTGTATC GCGACGGGCA ACCTCTGTTC TCGCGAATGG GAGTCGAGAG CCAGCTCGAC
GCGATGTTCT CGCCGACCGT GCAGTTGCGG TCCGGCGGCT ACATCGTGAT CAACCAGACC
GAGGCGTTGG TCTCGATCGA CGTCAACTCC GGTCGCTCGA CCCGCGAGCA CCATATCGAG
GACACCGCGC TCAAGACCAA CCTCGAGGCC TCGGAGGAAG TCGCGCGGCA GCTGCGCCTG
CGCGACCTCG CCGGCCTGAT CGTCATCGAC TTCATCGACA TGGACGAGAA GCGCAACAAT
CGCGCTGTCG AGCGCAAGCT CAGCGATTGC CTGCGGCAGG ATCGCGCCCG CATCCAGGTC
GGTCGGATCT CGCATTTCGG CCTGCTCGAA ATGTCGCGCC AGCGCATCCG CGCCAGCGTG
CTGGAAAGCT CGACCGAGCC CTGCCCGCAT TGCGGCGGCT CCGGCCACGT GCGGTCGGTC
TCGTCGGTGG CGCTGCAATT GCTGCGCGGT CTCGAAGAGA TCCTGATGAA GGGCGCGACC
CATAATCTGA TCGTGCGCAC CCGCGCCGAT GTCGCCCTCT ATGTCCTCAA TCAGAAGCGC
GGGCACCTGC GCGATCTCGA AACCGGCTTC AGGGTGACGC TGTCGGTCGT GTCGGATCCG
ACCGTGACCG GACAGCAGTC GTTCGTGATC GATCGCGGCG AGCAGGTTCA CACGTTGGAA
TCCGCCAAGG CGCTGCTCGC GGCGCAGGTC GCGGCCTATC CGGTCCAGGC CGAAGAGCCT
TTTGAGGACG ACGAGGGTTA CGAGTTCGAG GCCGAGATCG AAACCGACGA GACCGTCGGC
CTGGCCGATG AGCACGCCAG CGACAGTGGC GAAGGCGAAG GAGAAGGCGA CGCCCGCAAG
CGCAAGCGCC GCCGCCGCCG CCGCTCGCGC ACCGGCGAGG CCCGTGAAGC GGGCGCGCCG
CGCGAGGACG GCGATGCTGC GCTCGTTCCC GAGGTTTTGG AAGCCGTTTC CGAAGACGAC
AGCGAAGACG GCGACGACAG CGCCGACGGC GCTGAAGGGG ACGGTGACGC GCGGGCCGAT
CAGGCGAATG GCGAGCGTCG TCCGCGTCGC CGTGGCCGCC GCGGCGGTCG CCGCCGGCGT
GGCGCTGCCG AGGGAGGCCT CGAAGAAGGC GCCTCCGGCT CGATCGTCGA CGACATCGGA
GAGTTGGAGA CCTCGGAGGC CGCCGAGGCG GCCGCCGATA TGGACGGCGG TGGATCGGAC
GGACGCTTGC TCGGCAAGCG GCACGAGCCG GACATCGATG ACGATGAGAT GGCAGAGCCG
GTCACGGCTA CGCTTTCCCC CGTTGCCAGC GAGCCGGAAG AACGCGCAGA GCCGGTGCAG
GCGCAGCAGA GCGAGCCGGT CGCCTCGGCC CCCGCCCAAG ACGCCGCTCC GTCGGCGCAG
GCCCAGGACG ACGCCGCAGC CGAGCGCGCC GCCGCACGCC GCCGGTCGAC GGTGCGCGAA
AAGGTGAATT TCGGGTCCTC GGAACCGAAG GTGGAGGCGT CGACGCCGCT GGCGATCGAG
CCGGCAGCGC AGCAGCCCGA ACCGGCACCG GCCATTCCGG AGCCACAGGC CGCTGCCGAG
CCGGCTCCCG AACCTTCCCG TCGCGCCGGA TGGTGGTCGC GCCGCTTCGG CGGCGGCAAC
TGA
 
Protein sequence
MPNKMLIDAT HPEETRVVVV RGNRVEEFDF ETAQRKQLRG NIYLAKVTRV EPSLQAAFIE 
YGGNRHGFLA FSEIHPDYYQ IPVADRQALI EADERAHREA EEENEQRSTR RRSRHRSHRR
RGSGERVTSE IVEASAGAVA EPHDQFSDLA DSDPADNGHA HADHERPMHD IPADDAGDVT
QASADNAEGH DGHHDAERQG YHAASAEYLD ETAVAPSIAD GEQPTLPDFG ATPHAQTLSQ
DVEENSGHAE DAPFPIAEGA EQEHGDDAGD ELNDADGDAD LEDAARSDTE ADDEDEDEDE
DDEAEEDIVE SVGGDDVLEE VPERAFRPRR QYKIQEVIKR RQVMLVQVVK EERGNKGAAL
TTYLSLAGRY AVLMPNTARG GGISRKITSA QDRSRLKEVV QDLDVPEGMG IILRTAGASR
TKPEIKRDFE YLIRMWETVR DVTLKSQAPT LVYEEGSLIK RSLRDLYNKE IDEIQVAGEA
GYQEARDFMH MLMPSNVRAV KLYRDGQPLF SRMGVESQLD AMFSPTVQLR SGGYIVINQT
EALVSIDVNS GRSTREHHIE DTALKTNLEA SEEVARQLRL RDLAGLIVID FIDMDEKRNN
RAVERKLSDC LRQDRARIQV GRISHFGLLE MSRQRIRASV LESSTEPCPH CGGSGHVRSV
SSVALQLLRG LEEILMKGAT HNLIVRTRAD VALYVLNQKR GHLRDLETGF RVTLSVVSDP
TVTGQQSFVI DRGEQVHTLE SAKALLAAQV AAYPVQAEEP FEDDEGYEFE AEIETDETVG
LADEHASDSG EGEGEGDARK RKRRRRRRSR TGEAREAGAP REDGDAALVP EVLEAVSEDD
SEDGDDSADG AEGDGDARAD QANGERRPRR RGRRGGRRRR GAAEGGLEEG ASGSIVDDIG
ELETSEAAEA AADMDGGGSD GRLLGKRHEP DIDDDEMAEP VTATLSPVAS EPEERAEPVQ
AQQSEPVASA PAQDAAPSAQ AQDDAAAERA AARRRSTVRE KVNFGSSEPK VEASTPLAIE
PAAQQPEPAP AIPEPQAAAE PAPEPSRRAG WWSRRFGGGN