Gene RPB_3687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3687 
Symbol 
ID3911489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4227083 
End bp4228078 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content71% 
IMG OID637885589 
Productpeptidase S58, DmpA 
Protein accessionYP_487293 
Protein GI86750797 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3191] L-aminopeptidase/D-esterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGAACC TCATCACCGA CATTGCCGGC GTCCGCGTCG GCCACGCCCA CGACGAACGC 
TTGGCCTCCG GCGTCACCGC GATCCTGTTC GACACGCCGG CCGTCGCCTC GATCGACGTG
CGCGGCGGCG GACCGGGGAT TCGCGACGGC GTGCTGTTGG AGCCGGTCAA TACCGTCGAG
CAGGTCGACG GCTTCACGCT GTCGGGCGGC TCGGCGTTCG GGCTCGATTC CGGCGGCGGC
GTGCAGGCCT GGCTCGCCGA ACAGGGCCGC GGCTTCTCGA TCGGCACCGC GACGATCCCG
ATCGTTCCGG GCGGCGTGAT CTTCGACATG CTCAATGGCG GCGACAAGGC GTGGGGGCAA
TTCTCGCCCT ATCGCGAGCT CGGCTATCAG GCCGCAGCGG CGGCAAGCGA CGCCTTCGCG
CTCGGCAGCG TCGGCGCGGG GCTCGGCGCG ACCACGGCGA ACTACAAAGG CGGGCTCGGC
TCGGCCTCCG CGACGACGCC GGGCGGAATC ATGGTCGGCG CGATCGCGGC GGTGAATGCG
ATCGGCAGCG TCACCATCGG CGACGGCCCG TGGTTCTGGT CGGCGCCTTA CGAAGTCGGC
GATGAGTTCG GCGGCTGCGG CATGCCGCAG CATTTCACCG ACGAGATGCT GGCGGTGAAG
ATCAAGGGCG CCGCCGCCGC GACCGCGGAG AACACCACGC TGGTCGCGGT CGTCACCGAC
GCGCTGCTGA CCAAGACGCA GGTGAAGCGG CTGGCGATGA TGGCGCAGAC CGGGTTTGCG
CGGGCCATCT ATCCGGTGCA TGCGCCGCTC GACGGCGACG TGGTGTTCGC CGCCGCGACC
GGCAACAAGC CGGTCGATCC GCTGGCGGGG CTCACCGAAC TCGGCGCGAT CGCCGCCAAC
ACGGTGGCGC GGGCGATCGC GCGCGGCGTC TACGAGGCGA CCGCGCTGCC GTTCAAGGAC
GCGCAGCCGG CGTGGCGCGA TCAGTTCAGG CGCTGA
 
Protein sequence
MKNLITDIAG VRVGHAHDER LASGVTAILF DTPAVASIDV RGGGPGIRDG VLLEPVNTVE 
QVDGFTLSGG SAFGLDSGGG VQAWLAEQGR GFSIGTATIP IVPGGVIFDM LNGGDKAWGQ
FSPYRELGYQ AAAAASDAFA LGSVGAGLGA TTANYKGGLG SASATTPGGI MVGAIAAVNA
IGSVTIGDGP WFWSAPYEVG DEFGGCGMPQ HFTDEMLAVK IKGAAAATAE NTTLVAVVTD
ALLTKTQVKR LAMMAQTGFA RAIYPVHAPL DGDVVFAAAT GNKPVDPLAG LTELGAIAAN
TVARAIARGV YEATALPFKD AQPAWRDQFR R