Gene RPB_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2044 
Symbol 
ID3909859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2323025 
End bp2324176 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content71% 
IMG OID637883937 
Producthypothetical protein 
Protein accessionYP_485662 
Protein GI86749166 
COG category[R] General function prediction only 
COG ID[COG1568] Predicted methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGACC CCGAAATCCT GAAAACCATC GCCGAGGCGA CCCGGCTGCG CGAAGGCCCG 
GCCGGCGTCG AGGCGATCCT GCGTGCGGTG TATCGCTCGG GCTCGCTGCG GCTGCAGGAC
GTCGCCCGCG AGGCCCGGCT GCCGATGCCG ATTGCCACCG CCGTCCGCCG CGAACTGGAG
AAGGCCGGGC TGCTGGAGCG CAAGCAGGGC CTGGCGCTGA GCCCCGAGGG CCGCGACTTC
GTCGAGCGCG AACTCGGGCT CGGCATCACC ATCGACGTCA CCTGCCCGGC CTGCGCCGGC
CATGGCGTGG TGATCCCCGC GGATTTCCAG GCGCAGGTCG GCCGGCTCGC CGCCATCATC
GCGCAGGCGC CATCGGTCGA TGTCACGCTG GATCAGGCGC CGTGCACCCC GGAGACGTCG
CTGCTGCGTG CGCTTCTGAT GCTGCAGGCC GGCGCGCTGG AAGGCCGCCG GGTGCTGCTG
CTCGGCGACG ACGATTCGGT GTCACTGGCG ATCGGCCTCG TCGGCCAGGC GCTGGGCAAG
GCCGACCTCA CCCGCGGCGT GGTGGTGGTC GACGCGGACG AGCGCCGGCT CGCCTTCCTG
CGCGAGAATG CTGCCCGCGA AGGCATCGCG CTGCGCACGC TGCATCACGA TCTGCGCCAG
CCGCTGCCGG CCGAGTTGCA GGGCGCGTTC GACACCATCG AGACCGACCC GCCCTACACG
CTCGAAGGCG CGAAGCTGTT TCTGACGCGC GGCCGCGAGG CGCTGGCCGG CGACGGGCTG
TGCTACTTTT CGTTCGCGCA ATGGCCGCCG CGGCAGATGC TGGCATTGCA GCGGGTGTTT
CTCGATCTCG GCCTCGCGGT GCAGACGATC CGGCCGGGCT TCAACGCCTA TGCGGGCGCC
ACCGTGCTTG GCAATGTCGG GCAACTGATC GAACTCGCCG CCGCGGGCCC GGCCGCCGCC
GCATTGCCGG CGTGGCAGGG ACCGCTGTAC ACCGCCGAGA TCAATCCGCG GATCCGCGCC
TATGTCTGCA CGTCATGCGG CCGCGAGGCC GTCCTGGGGC GCGGCTCGAC GCCGGAGACG
ATCGAGGCGT TGAAGGATCA GGGATGCGCG AATTGCGGCG GGCACAGCTT CCGCCGCAAG
ACCGGCGGCT GA
 
Protein sequence
MADPEILKTI AEATRLREGP AGVEAILRAV YRSGSLRLQD VAREARLPMP IATAVRRELE 
KAGLLERKQG LALSPEGRDF VERELGLGIT IDVTCPACAG HGVVIPADFQ AQVGRLAAII
AQAPSVDVTL DQAPCTPETS LLRALLMLQA GALEGRRVLL LGDDDSVSLA IGLVGQALGK
ADLTRGVVVV DADERRLAFL RENAAREGIA LRTLHHDLRQ PLPAELQGAF DTIETDPPYT
LEGAKLFLTR GREALAGDGL CYFSFAQWPP RQMLALQRVF LDLGLAVQTI RPGFNAYAGA
TVLGNVGQLI ELAAAGPAAA ALPAWQGPLY TAEINPRIRA YVCTSCGREA VLGRGSTPET
IEALKDQGCA NCGGHSFRRK TGG