Gene RPB_2249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2249 
Symbol 
ID3909032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2594943 
End bp2596109 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content69% 
IMG OID637884144 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_485865 
Protein GI86749369 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.637254 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCCAT CCGTCCCGCC GACCAGGCCT CCGCCCGATC CGCAGATCCG CCGGGCGCAG 
CGCACTGACC TGCTGCTCCG AGCCGCCGTG ATCTGGCTGT TGTTGCTGGC AACGGCCTGG
GTCGCGCAGC CTTATCTCTC GTCGTTGTGG TTCTCGGTGT CCGGGCCCCG GACCGTCACC
GCCCGGGGTG ATCTGGCCCC GGCGGAGACC GCGACCATCG AGCTGTTCAA ACGGGTGTCG
CCGTCGGTGG TGCACGTCTT TGCCCAGTCG AGCCGGCGCT CGCCGTCGCT GTTCGAGCAG
CAACAGGAGG GCGGGGTGCA GTCCGGCTCC GGTGTGATCT GGGACGCCGC GGGCCACGTC
ATCACCAACA ACCACGTCAT CCAGGGCGCA ACTGCACTGG GCGCGCGGCT GTCGACCGGC
GAGTTCGTCA CCGCGCGCGT GGTCGGCACC GCGCCGAACT ACGACCTCGC GGTGCTGCAG
CTCGAGCGAC CGCGCGCCGA GCTGCGCCCG ATTGCGATCG GCAGTTCGTC GGACCTGCAG
GTCGGGCAGT CGGCGTTCGC CATCGGCAGC CCCTATGGTC TGGAGCAGAC GCTGACCACC
GGCATCGTCA GCGCGCTGCA GCGGCGGCTA CCGACCGCGG CTGCCCATGA AGTCAGTGGC
GTGATCCAGA CCGATGCGGC GATCAATCCC GGCAATTCGG GCGGCCCCTT GCTCGACAGC
GCCGGCCGCT TGATCGGCTT GAACACCGCA ATCATTTCCG GATCGGGCGC TTCGGCCGGA
ATCGGCTTCG CGATCCCGGT CGATTCCGTC AATCGGATCG CCACCGCGCT GATCAAGACC
GGCACCGTGC CGGTGCCCGG CATCGGCATC ATCGCCGCCG ACGAGAACGA GGCCGCCCGC
CTCGGCATCG ACGGCGTCGT CGTCGTGCGT ACGCTGCCGG GCTCCCCGGC GGCACGCGCG
GGGCTCACCG GCGCCAGCGA GACCGGCATG GTCGAGGACG TCATCGTCGG TGCCAACGGC
CAGGAGATCC ACAGCATGTC GGATCTCGCC GCGACGCTCG AACGCGTCGG CATCGGCAAC
GAGGTCAAGC TGCAGGTGAT TCGCGACGGC CGCGCCCGGA CCATCAACGT CGAGGTCACC
GATATTGCCC GATTGCGGCG CAGCTAG
 
Protein sequence
MPPSVPPTRP PPDPQIRRAQ RTDLLLRAAV IWLLLLATAW VAQPYLSSLW FSVSGPRTVT 
ARGDLAPAET ATIELFKRVS PSVVHVFAQS SRRSPSLFEQ QQEGGVQSGS GVIWDAAGHV
ITNNHVIQGA TALGARLSTG EFVTARVVGT APNYDLAVLQ LERPRAELRP IAIGSSSDLQ
VGQSAFAIGS PYGLEQTLTT GIVSALQRRL PTAAAHEVSG VIQTDAAINP GNSGGPLLDS
AGRLIGLNTA IISGSGASAG IGFAIPVDSV NRIATALIKT GTVPVPGIGI IAADENEAAR
LGIDGVVVVR TLPGSPAARA GLTGASETGM VEDVIVGANG QEIHSMSDLA ATLERVGIGN
EVKLQVIRDG RARTINVEVT DIARLRRS