Gene Rpal_1930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1930 
Symbol 
ID6409590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2081465 
End bp2082688 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content66% 
IMG OID642711816 
ProductOsmC family protein 
Protein accessionYP_001990928 
Protein GI192290323 
COG category[E] Amino acid transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases
[COG1765] Predicted redox protein, regulator of disulfide bond formation 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0101124 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCACCG AACGTTTCCA ATTTGCAGGC AGCGGCGGGC ATCAACTCGC GGCGGCGCTC 
GATCTGCCGG ATGCGCAGCC TCTCGCCTAT GCGCTGTTCG CGCATTGCTT CACCTGCAGC
AAGGACAACC TTGCGGCACG GCGGATCGCG GCGGCGCTGG CGGCGTGCGG CATCGCGGTG
CTGCGGTTCG ACTTCACCGG GCTTGGCGCC AGCGAGGGCG AGTTCGAAAA CGCGACGTTT
TCGTCCAACG TCGCCGATCT GGTGCTGGCG GCGGACCATC TGCGCGCGAC GCATCGGGCG
CCGTCACTGC TGATCGGCCA CAGTCTCGGC GGCGCTGCGG TGCTGGCAGC CGCAGCACAG
ATTCCCGAAG CGAAGGCGAT CGCCACCATT GCGGCGCCGT CCGATCCGTC GCACGTCACC
GGACTATTTG CCGATGATAT CGAGACGATC CGCACTGAAG GCCGCGTCAA TGTTTCGCTG
GCCGGCCGCC CGTTTACGAT CAAGCGCGAG TTTCTCGACG ACATCGCCGA ACACAATCTG
ATGGCCGAGA TCGGCAAGCT GCACAAAGCG CTGCTGATCC TGCACGCGCC GACCGACGAC
ACCGTCGGCA TCGACAACGC CACCAAGATC TTTCTCGCGG CCAAACATCC GAAGAGCTTC
GTCTCGCTCG ATCACGCCGA CCATCTGCTG AGCGATCGCC GTGACGCGAA CTACGCCGCG
GGGGTGATCG CCGCCTGGGC GCAGCGCTAC ATCGATGCCG AACCGCCGGC CCCGACCGCC
GGCGCGCCAG AAGTGCCGCG CCTCGTCACC GTGCAGGAAA CCGGCGACGG CAAGTTCCAG
CAGCAGATCA GCGTCGGACC GCATCGGCTG CTCGCCGATG AGCCGGCCAA CGTCGGCGGC
CGCGACAGCG GCCCGGGGCC CTACGACCTA CTGCTGTCCG CGCTCGGCGC CTGCACCTCG
ATGACGATGC GGCTCTATGC CGAACGCAAG GCGCTGCCGC TCGATCGCGT CACGGTGACG
CTGAGCCACG CCAAGATCCA CGCCGAGGAT TGCGCCGAAT GCGAAACCAA GGTCGGGCTA
CTCGACCGGA TCGAGCGGGT GATCGGCATC GAGGGTGACC TCTCCGCCGA GCAGCGCGCC
AAGCTGATCG AAATCGCCGA CAAATGTCCG GTGCACCGCA CCCTCACCTC GGAAGTCAGC
ATCATCACGC GCAGCACCGA TTGA
 
Protein sequence
MPTERFQFAG SGGHQLAAAL DLPDAQPLAY ALFAHCFTCS KDNLAARRIA AALAACGIAV 
LRFDFTGLGA SEGEFENATF SSNVADLVLA ADHLRATHRA PSLLIGHSLG GAAVLAAAAQ
IPEAKAIATI AAPSDPSHVT GLFADDIETI RTEGRVNVSL AGRPFTIKRE FLDDIAEHNL
MAEIGKLHKA LLILHAPTDD TVGIDNATKI FLAAKHPKSF VSLDHADHLL SDRRDANYAA
GVIAAWAQRY IDAEPPAPTA GAPEVPRLVT VQETGDGKFQ QQISVGPHRL LADEPANVGG
RDSGPGPYDL LLSALGACTS MTMRLYAERK ALPLDRVTVT LSHAKIHAED CAECETKVGL
LDRIERVIGI EGDLSAEQRA KLIEIADKCP VHRTLTSEVS IITRSTD