Gene RPB_3633 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3633 
Symbol 
ID3911435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4169338 
End bp4170558 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content67% 
IMG OID637885535 
ProductOsmC-like protein 
Protein accessionYP_487239 
Protein GI86750743 
COG category[E] Amino acid transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases
[COG1765] Predicted redox protein, regulator of disulfide bond formation 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.525721 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATCG AACGCTTTGA ATTCCCCGGC AGCGGCGGAC ATCGACTCGC GGCTGCGCTG 
GAACTGCCGG GCTCGGCGCC GCTCGCCTTC GCGCTGTTTG CGCATTGTTT CACGTGCGGC
AAAGACAATC TGGCCGCGCG GCGGATCGCG GCGGGGCTGG CGGCGCGCGG CATCGCGGTG
CTGCGGTTCG ACTTCACCGG GCTCGGCGCC AGCGAGGGCG ACTTCGCCAA TGCGACGTTC
TCGTCAAACG TCGCCGATCT GGTTCTCGCC GCCGATCATC TGCGCAAGGT CCATCGGGCG
CCGTCGCTGC TGATCGGCCA CAGCCTCGGC GGCGCCGCGG TGCTGGCGGC CGCAGCGCAG
ATCCCCGAAG CGAAGGCGAT CGCGACTATC GCCGCGCCGT CGGATCCATC GCATGTCGCC
GGCCTGTTCG CCGAGCATGT CGATGCGATC CGCGAACAGG GCAGCGTCGA GGTCTCGCTC
GCCGGCCGAC CGTTCACGAT CAAGCGCGAA TTCCTCGACG ACGCCGGCGA ACACAATCTG
ATGGCGCAGG TGACCAAGCT GCGCAAGGCG CTGCTGGTGA TGCACGCACC GACCGATGCC
ACCGTCAATA TCGACAACGC CACCCGGATC TTTCTGGCTG CGCGGCATCC CAAGAGCTTC
GTCTCGCTCG ACCATGCCGA TCATCTCCTG AGCGACCGCC GCGATGCGAA CTACGCGGCC
GATGTGATCG CCGCCTGGGC GGAGCGCTAT CTCGACGCCC GGCAACCCGC CGCCGCCGGT
GCGCCGGAGG TGCTGCGCGC CGTCATTGTG CAGGAAACCG GCGAAAGCAA ATTCCAGCAG
CGGATCAGCG TCGGACCGCA TCAGTTGCTC GCCGACGAGC CGGTCGCGGT CGGTGGCGCG
GATTCCGGGC TCGGCCCGTA CGATCTGTTG CTTTCGGCGC TCGGCGCCTG CACCTCGATG
ACGATGCGGC TCTATGCCGA ACGCAAGAAG CTGCCGCTCG ACCGCGTGAC CGTGACGCTG
AGCCACGCCA AGATCCACGC CGAGGACTGC GTCGAATGCG AGACCAAGGT CGGCCTGCTC
GACAGGATCG ACCGCGTGAT CGCGATCGAC GGCGATCTCG ACACCGATCA GCGCGCCCGA
CTGATCGAGA TCGCGGACAA ATGCCCGGTG CATCGCACCC TGACCTCGGA AGTGAAGATC
GTCACCCGCG CGGCGGAGTG A
 
Protein sequence
MPIERFEFPG SGGHRLAAAL ELPGSAPLAF ALFAHCFTCG KDNLAARRIA AGLAARGIAV 
LRFDFTGLGA SEGDFANATF SSNVADLVLA ADHLRKVHRA PSLLIGHSLG GAAVLAAAAQ
IPEAKAIATI AAPSDPSHVA GLFAEHVDAI REQGSVEVSL AGRPFTIKRE FLDDAGEHNL
MAQVTKLRKA LLVMHAPTDA TVNIDNATRI FLAARHPKSF VSLDHADHLL SDRRDANYAA
DVIAAWAERY LDARQPAAAG APEVLRAVIV QETGESKFQQ RISVGPHQLL ADEPVAVGGA
DSGLGPYDLL LSALGACTSM TMRLYAERKK LPLDRVTVTL SHAKIHAEDC VECETKVGLL
DRIDRVIAID GDLDTDQRAR LIEIADKCPV HRTLTSEVKI VTRAAE