Gene RPB_3959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3959 
Symbol 
ID3911766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4519260 
End bp4521386 
Gene Length2127 bp 
Protein Length708 aa 
Translation table11 
GC content67% 
IMG OID637885863 
Productoligopeptidase B 
Protein accessionYP_487563 
Protein GI86751067 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1770] Protease II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACGAG ACCCGCTCAT GACCCTGTCC CCGCCCCGCG ACGCCACGCC TCCCGTCGCG 
CCGCGCCGTC CGCACGCCTT CACCACCCAC GGCATCACCA TCAACGACGA CTATGCCTGG
CTGAAGGACC CGAACTGGCA GGAGGTGCTG CGCGATCCGG CGCTGCTCGA TCCCGATATC
CGCGCCTATC TCGAAGCCGA GAACGGCTAC ACCGACAGCG TGCTCGGCCA CACCGAGGCG
TTGCAGAAGA CGCTGGTTGC GGAAATGCGC GGGCGGATCA AGGAAGACGA TTCCAGCGTG
CCGCAGCCGG ACGGGCCCTA CGCTTACTTG CGCAAATTCC GCGAAGGCGG CCAGCATCCG
CTGTATGGCC GCACGCCGCG CGACGGCGGC GAACTCGACA TCATCCTCGA CGGCGACGAA
CTGGCGAAGG GCACCGAGTA CTTCCAGTTC GGCGGACGGC GGCATTCGCT CGACCACAAG
CTGGAAGCCT GGAGCGCTGA CGCCAAGGGC TCGGAATACT ACACCATCCG CGTGCGCGAC
TGGACGACGA AGCAGGATTT GCCCGACCTC GTCGAGGAGA CCGACGGCGG CGTGGTCTGG
ACCGCGGACT CGAGCGCGTT CTTCTACGTC AAGCTCGACG ACAATCATCG GCCGATGCAG
ATCTGGCTGC ACAAGCTCGG CACCGCGCAG GCCGACGACC TGCTGGTGTA CGAGGAGAAA
GACGCCGGCT GGTTCACCCA CATCCACGAG AGCAGCAGCG GACGGTTCTG CGTGATCGCC
GGCGGCGACC ACGAGACGTC GGAGCAGCGG CTGATCGACG TCTCCGATCC GACCGCGCCG
CCGCGGCTCG TCGCCGCGCG CGAACTCGGC GTGCAATATT CGCTCGCAGA TCGCGGCGAC
GAGCTGTTCA TCCTCACCAA TGCGGACGGC GCGATCGACT TCAAGATCGT CACCGCGCCG
CTGGCTTCGC CTGTGCGCGA CAATTGGCGC GATCTGATCC CGCATCGCGA AGGCATCTAT
ATCATCGACT TCGACCTGTT CTCCGGCCAC ATGATGCGGC TGGAGCGCGC CAACGCGCTG
CCGTCGGTGA CGATCCGCGA TCTCGCGAGC GGCGATGAGC ATGCCATCGC CTTCGACGAG
GCGGCCTACT CGCTGTCCGC CTCCGGCGGC TGGGAATTCG ACACCACGGT GATGCGGTTC
TCCTACTCGT CGATGACGAC GCCGTCGGAA GTCTACGATT ACGACATGGC GACGCGCGAG
CGGAGCTTGC GCAAGCGCCA GGAAATTCCC TCGGGCCAAA ATCCGGCAGA CTACGTCACC
ACCCGGATCA TGGCGAAGGC CGACGACGGC GCCGAGGTGC CGGTGTCGCT GCTGCATCGC
AAGGGGCTCG CGCTCGACGG CGCGGCGCCG CTGCTTCTGT ACGGCTACGG CTCCTACGGC
CACGCGATGC CGGCGGGCTT CTCCGCCAAC GCGCTGTCGC TGGTCGATCG CGGCTTCGTT
TACGCGATCG CGCATATCCG CGGCGGCGCC GACAAGGGCT GGGGCTGGTA TCTCGACGGC
AAGCGCGAGA AGAAGACCAA CTCGTTCGAC GATTTCGCCG CCTGCGCGCA GGCGTTGATC
GACGCCAAGT ACACGTCGGC GAAACGCATT GTCGCCCATG GCGGCAGTGC CGGCGGCATG
CTGATGGGCG CGGTCGCCAA CCGCTCGGGC GAGTTGTTCG CCGGCATCGT CGCCGAAGTG
CCGTTCGTCG ACGTGCTCAA CACCATGCTC GACGATACGC TGCCGCTGAC GCCGCCGGAA
TGGCCTGAAT GGGGAAATCC GATCAGCAGC GAAGCCGACT TCAAAACCAT CCTGTCCTAC
TCGCCCTACG ACAACGTCGC GGCGACGACC TATCCGGCCA TCCTCGCGAT GGGCGGCCTC
ACCGATCCGC GCGTCACCTA TTGGGAGCCG GCGAAATGGG TGGCGCGGCT GCGCGCGACC
ATGACCGGCG GCGGCCCCGT GCTGCTCCGC ATCAACATGG GCGCCGGCCA CGGCGGCGCG
TCCGGCCGGT TCAGCCGGCT CGACGAGGTC GCGATCGTCT ACGCCTTCGC CTTGTGGGCG
GTCGGCCTCG CGGATTCCGC TGCCTGA
 
Protein sequence
MQRDPLMTLS PPRDATPPVA PRRPHAFTTH GITINDDYAW LKDPNWQEVL RDPALLDPDI 
RAYLEAENGY TDSVLGHTEA LQKTLVAEMR GRIKEDDSSV PQPDGPYAYL RKFREGGQHP
LYGRTPRDGG ELDIILDGDE LAKGTEYFQF GGRRHSLDHK LEAWSADAKG SEYYTIRVRD
WTTKQDLPDL VEETDGGVVW TADSSAFFYV KLDDNHRPMQ IWLHKLGTAQ ADDLLVYEEK
DAGWFTHIHE SSSGRFCVIA GGDHETSEQR LIDVSDPTAP PRLVAARELG VQYSLADRGD
ELFILTNADG AIDFKIVTAP LASPVRDNWR DLIPHREGIY IIDFDLFSGH MMRLERANAL
PSVTIRDLAS GDEHAIAFDE AAYSLSASGG WEFDTTVMRF SYSSMTTPSE VYDYDMATRE
RSLRKRQEIP SGQNPADYVT TRIMAKADDG AEVPVSLLHR KGLALDGAAP LLLYGYGSYG
HAMPAGFSAN ALSLVDRGFV YAIAHIRGGA DKGWGWYLDG KREKKTNSFD DFAACAQALI
DAKYTSAKRI VAHGGSAGGM LMGAVANRSG ELFAGIVAEV PFVDVLNTML DDTLPLTPPE
WPEWGNPISS EADFKTILSY SPYDNVAATT YPAILAMGGL TDPRVTYWEP AKWVARLRAT
MTGGGPVLLR INMGAGHGGA SGRFSRLDEV AIVYAFALWA VGLADSAA