Gene RPB_3029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3029 
Symbol 
ID3910828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3452086 
End bp3453441 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content69% 
IMG OID637884935 
Productagarase 
Protein accessionYP_486642 
Protein GI86750146 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.933436 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACA TTCCGAGAAC CCGCTGGGGC GGCCTCGCCG AAGACAAGGC TGCGGCCACC 
GGCTTTTTCC GCGTCGCGCA GATCGACGGC GTGTGGTGGT TCATCGATCC GGACGGCGGC
CGCTTCCTGT CGAAGGGCGT CACCGCGGTG AATTTCGACC ACGACAGTAT CAAAGGCACC
GAGCGTCACC CCTATCGCGA GGCGAGCCTG CACAAATACG GCAGCCGCAA CGCCTGGCGC
AGCGCCGTCG CCGATCGCCT GCACCGCTGG GGCTTCAACA CGATCGGGGC GTGGTCGGAG
CCGGAGGTGG CATCGGCCGG CTGCGCCCCG CTGGCCTCGG CCGCCGGCGT GGTCTATCTC
GCCACCGCCT ACAGTGACGG CCGCGGCTGG CCGCAATCCG ATCCGTTCGC TCCGGCCTTC
GAGACCTTCG CGCAGCAACG CGCCCGCGAG ATCTGCGCGC CGCGGCGTGA CGATCCGAGC
GTGCTCGGCT GGTTCATCGA CAACGAGTTG CAATGGGGCC CGGACTGGCG CGGCGAGAAC
GAACTGCTGC CGGTGATCCT GCGCGACAAC GCGGCGCCGC ATTCACGCCA GGTAGCGGTC
GACCTGCTGC GCCGCCGCTA CGCCAGCGTC GCAGAGTTCA ACACAGCGTG GCGATGCTCT
GCATCATCGT GGGACGCGCT GGCGACCGTG CCGATCGCGG CCCCGCCCTT CACCCGCAAT
TTCTTCACAC ATGATCACGC GCAGGAACGC GATCCGTTGC GCGGGCGTTA CTTCGCCGAT
TGCGATGCCT TCGCCGGGCT GCTCGCCGAG CGCTACATGG CGGTGAGCGC CGCGGCGATC
CGCGCGGCCG CGCCGCATCA TCTCGTGCTC GGCAGCCGCT TCGCCTACGC GCCGCAGCCG
CAGGTCATCG CTGCCGCCGG CCGGCATTGC GACGTCATCA GCATCAATTG CTACGACGCT
TTGCCCGACG CAGTGATCGA CGCCTATGCC GAATGCGGCC GGCCCTGCCT GATCGGCGAG
TTCTCGTTCC GCGGCGACGA CGCCGGGTTG CCGAACACGC AAGGCGCCGG GCCGCGCGTC
GAGACGCAGG CGGATCGCGC CGCAGGCTTT GCGCGCTATG TCGGTGCCGG CCTGCGCCAC
CCGAACCTGA TCGGCTATCA CTGGTTCCTG CACGCCGATC AGCCGGCGGA AGGCCGCTGG
GACGGCGAGA ATTCCAACTA CGGCGTCGTC ACCATCGACG ATGAGGTTTA TGTCGAACTG
ACCGAGGCGA TGACGGTGGT CAATGACGAC GCCGAATGGC TGCACGCCGG CGCAGCGCAG
GTCCGGCGAC ACATCGCAAC GCCGTCGGCG GCCTGA
 
Protein sequence
MTDIPRTRWG GLAEDKAAAT GFFRVAQIDG VWWFIDPDGG RFLSKGVTAV NFDHDSIKGT 
ERHPYREASL HKYGSRNAWR SAVADRLHRW GFNTIGAWSE PEVASAGCAP LASAAGVVYL
ATAYSDGRGW PQSDPFAPAF ETFAQQRARE ICAPRRDDPS VLGWFIDNEL QWGPDWRGEN
ELLPVILRDN AAPHSRQVAV DLLRRRYASV AEFNTAWRCS ASSWDALATV PIAAPPFTRN
FFTHDHAQER DPLRGRYFAD CDAFAGLLAE RYMAVSAAAI RAAAPHHLVL GSRFAYAPQP
QVIAAAGRHC DVISINCYDA LPDAVIDAYA ECGRPCLIGE FSFRGDDAGL PNTQGAGPRV
ETQADRAAGF ARYVGAGLRH PNLIGYHWFL HADQPAEGRW DGENSNYGVV TIDDEVYVEL
TEAMTVVNDD AEWLHAGAAQ VRRHIATPSA A