Gene RPB_4179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4179 
Symbol 
ID3911987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4751319 
End bp4752749 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content67% 
IMG OID637886083 
Productpeptidase M16-like 
Protein accessionYP_487782 
Protein GI86751286 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.567421 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGCCC GATCCGATCT CTCCTACCGG GATGATTGCC TCGACATGAC CGTATCCGCC 
TCTCGTCCAC GCGCCGCGCT CGCCATCCTC GCGACGGCGC TGATGCTCGC AGGCCCGGCC
GCCGCCCAAA CCGCCCCCGC CAATCCACCC GCGACCTTCA CGCTCGCCAA CGGCCTCGAC
GTTGTGGTGA TTCCGGATCG TCGCACCCCG GTGGTGACGC AGATGATCTG GTACAAGGTC
GGCTCCGCCG ACGAGACGCC GGGCAAATCC GGACTCGCGC ATTTTCTCGA ACATCTGATG
TTCAAGGGCA CAGCCAAGCA TCCGGCCGGC GAGTTCTCGC AGACCGTGCT GAAGGTCGGC
GGCAACGAGA ACGCGTTCAC CTCGCTCGAC TACACCGGCT ACTATCAGCG CGTGCCGCGC
GATCAACTCG ACAAGATGAT GGCGTTCGAG GCCGACCGCA TGACCGGCCT GGTGCTGAAG
GACGAGAACG TGCTGCCGGA GCGCGACGTC GTGCTCGAGG AATACAACAT GCGCGTCGCC
AATAATCCCG ACGCGCGGCT GACCGAACAG ATCATGGCGG CGCTGTATCT CAATCACCCC
TATGGCCGCC CTGTGATCGG CTGGCTGCAG GAGATCCAGA AGCTCGACCG CGAGGACGCG
CTGGCGTTCT ATCGCCGCTT CTACGCGCCC AACAACGCCA CGCTGGTGAT CGCCGGCGAC
GTCGATGCCG AGGCGATCCG GCCGGCGATC GAGCGCACCT ACGGCGCGGT GCCGGCGCAG
CCGGCGATTG CGCCGCAGCG GGTGCGGCCG CAGGAGCCGG CGCCGGCCGG GCCGCGCACC
GTGACGCTGG CCGATCCGCG GGTCGAGCAG CCGAGCGTGC GGCGCTATTA TCTGGTGCCG
TCGGCCCACA CCGCGGCGAA GGGCGACAGC CCCGCGCTCG AAGTGCTGGC GCAATTGCTC
GGCGGCGGCA GCAACTCCTA TCTCTATCGC GCGCTGGTGA TCGACCGGCC GCTCGCGATC
AATGTCGGCG CCAACTATCA GGGCACCGCG CTCGACGACA CCCACTTCAT CGTCGCGGCG
ACACCGAAGC CGGGCGTCGA ATTCTCCGAG ATCGAGAAGG CGATCGACAA TGTGATCGCC
GACATCGTGC GCAATCCTGT TCGGTCCGAG GATCTCGAGC GGGTGAAGAC GCAGCTGATC
GCCCAGTCGA TCTACGCGCA GGACAACCAG ACCACGCTGG CGCGCTGGTA CGGCGCGGCG
CTGACCGCCG GTCTCACCGT GCAGGACATC CAGAGCTGGC CGCAGCGGAT CCGCGCCGTC
ACCTCGGATC AGGTCCGCGC CGTGGCGCAG CAATTCCTCG ACCGCAACCG CTCGGTGACC
GGCTATCTGG TCAAGGGCAC GCTGCCAAAG CCCGAGGAGA AACGCTCGTG A
 
Protein sequence
MPARSDLSYR DDCLDMTVSA SRPRAALAIL ATALMLAGPA AAQTAPANPP ATFTLANGLD 
VVVIPDRRTP VVTQMIWYKV GSADETPGKS GLAHFLEHLM FKGTAKHPAG EFSQTVLKVG
GNENAFTSLD YTGYYQRVPR DQLDKMMAFE ADRMTGLVLK DENVLPERDV VLEEYNMRVA
NNPDARLTEQ IMAALYLNHP YGRPVIGWLQ EIQKLDREDA LAFYRRFYAP NNATLVIAGD
VDAEAIRPAI ERTYGAVPAQ PAIAPQRVRP QEPAPAGPRT VTLADPRVEQ PSVRRYYLVP
SAHTAAKGDS PALEVLAQLL GGGSNSYLYR ALVIDRPLAI NVGANYQGTA LDDTHFIVAA
TPKPGVEFSE IEKAIDNVIA DIVRNPVRSE DLERVKTQLI AQSIYAQDNQ TTLARWYGAA
LTAGLTVQDI QSWPQRIRAV TSDQVRAVAQ QFLDRNRSVT GYLVKGTLPK PEEKRS