Gene RPB_4476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4476 
Symbol 
ID3912292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5066110 
End bp5067087 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content67% 
IMG OID637886379 
Productpeptidase U32 
Protein accessionYP_488070 
Protein GI86751574 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACTGA TCTGTCCGGC GGGCACGCCC GCGGCGCTGC ACGATGCCGT CGCGGTGGGG 
GCCGATGCGA TCTATTGCGG CTTCAACGAC GAGACCAATG CGCGCAATTT CCCGGGGCTG
AATTTCAGCC GCGAGGAAAT GCGCGAGTCG ATCGCGCACG CGCATCGCTA CGGCGTCAAC
GTGCTGGTGG CGATTAACAC CTTCGCCCGC GCCGGCAATG TCGAGCTGTG GCAGCGCGCG
GTCGACGACG CGGTCGAGGC CGAAGCCGAT GCGGTGATCC TCGCCGATGT CGGCGTGATG
GATTATTGCG CCAGGACCCA TCCGCAGCAG CGCCTGCACG TCTCGGTGCA GGCCGCCGCG
GCGAACGCGG ATTCGATCCG GTTCTATGTC GACAGCTTCA ACGCCAAGCG CGTGGTGCTG
CCGCGCGTGC TCAGCGTGCA GGAGATCGCC GCGATCACGC GCGAGGTCAA GTGCGAGACC
GAAGTGTTCA TCTTCGGCGG GCTGTGCGTG ATGGAGGAGG GTCGCTGCTC GCTGTCGTCC
TACGCCACCG GCAAATCGCC GAACATGGAC GGGGTCTGCT CGCCGGCGGG CGCGATCCAG
TATCGCGAGG AGAACGGCGC GCTGATCTCG CGGCTCGGCG ATTTCACCAT CAACAAATTC
GCCAAGGGCG AGGCGGCGGC CTATCCGACG CTGTGCAAGG GGCGCTACCA GACCGACGAG
GGCTGCGGCT ATCTGTTCGA GGACCCGGCT TCGCTCGACG CCACCACGAT GCTGCCGGAG
CTGCGCGCCG CCGGCGTCGC GGCGCTGAAG ATCGAGGGCC GCCAGCGCGG CCGCGCCTAT
ATCGAGCGCG TGGTGAAGAC CTTCAAGGAG GTGCTGAGCG CGCTGGACGA CGGAAGGCCG
TTGCCGGTCG ACGCGCTGCG CGGGCTCAGC GAGGGCCAGT CCAACACCAC CGGCGCCTAC
AAGAAGACCT GGCGCTGA
 
Protein sequence
MELICPAGTP AALHDAVAVG ADAIYCGFND ETNARNFPGL NFSREEMRES IAHAHRYGVN 
VLVAINTFAR AGNVELWQRA VDDAVEAEAD AVILADVGVM DYCARTHPQQ RLHVSVQAAA
ANADSIRFYV DSFNAKRVVL PRVLSVQEIA AITREVKCET EVFIFGGLCV MEEGRCSLSS
YATGKSPNMD GVCSPAGAIQ YREENGALIS RLGDFTINKF AKGEAAAYPT LCKGRYQTDE
GCGYLFEDPA SLDATTMLPE LRAAGVAALK IEGRQRGRAY IERVVKTFKE VLSALDDGRP
LPVDALRGLS EGQSNTTGAY KKTWR