Gene P9303_14351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_14351 
Symbol 
ID4778933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1229884 
End bp1231245 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content46% 
IMG OID640086944 
Productcarboxyl-terminal processing protease 
Protein accessionYP_001017446 
Protein GI124023139 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTATCT CTAGAAGTCT TCGCTCATTA GGCCGCCAGC GCAGTTCATG GCTGCTCCTT 
CTGGGCGCGG GAGGAGCAGC TACTGCAATT GCACTTGCCT CTCCAAGTTT GGGTCTGCCA
AGCAGTTCTT CTTCAGCTAT GAGCGATAGC CCCAAAGAGG TGATTGATCA GGTCTGGCAG
ATTGTATATC GTGACTATCT TGATTCCACA GGTCAATATA ACCCTGAAGA CTGGAAGGGA
CTACGCAAGG ACTTATTAGC CAAGAACTAT TCAGCCACTT CAGAATCATA TGAAGCGATC
CGTGGCATGT TAGCGAGCTT GGATGATCCT TATACACGAT TCTTAGATCC AAAGGAATTT
AAGGAAATGC AGATCGACAC CTCAGGAGAA TTAACAGGTG TTGGCATCCA GCTTTCTCTA
GACAAGGATA CAAAAGAATT AGTTGTGGTT TCACCAATCG AAGGAACTCC TGCCTCTAAG
GCTGGAGTCC AGCCTAAGGA TGTGATCGTT TTCATTAATG GACAATCCAC TAAGGGCATG
TCTACAGCGG ATGCTGTTAA GTTGATTCGT GGTAAAGAAG GTAGTGAGGT CACCCTTGGT
TTGCGTCGCA AGGGTGACGT TATCCAGGTC CCGTTGATAC GTGCACGGAT TGAGATACAG
GCAGTAGATA TACAGTTAAA TACGACAGTT GATGGTACAA AGATTGGTTA TATCCGCCTC
AAACAGTTTA ATGCCCATGC AGCTAAAGGT ATGCGAAGTG CTATCAAGAA TTTAGAAAAA
GATGGTGTTC AAGGCTATGT ACTCGACTTG CGTAGTAATC CTGGAGGGTT GCTTGAGGCC
AGTGTTGATA TCGCACGCCA ATGGCTTGAT GAAGGTACAA TTGTTCGAAC AAAAACTCGT
GATGGCATTC AAGATGTACG CCGAGCGAAT GGTAGTGCCT TAACCAAACT ACCTGTTGTG
GTTTTGGTGA ATGAAGGCTC AGCAAGTGCA AGTGAGATCC TCTCAGGAGC ACTACAGGAC
AATGATCGTG GTGTACTTGT AGGACAGAAG ACCTTTGGCA AGGGCTTGGT TCAATCTGTT
CGAGGTCTTT CTGACGGCTC TGGTCTTACT GTCACGATTG CTAAGTATCT CACCCCAAGC
GGAACGGACA TTCATAAAAA TGGCATCAAG CCAGACATTA AGGCCGTGAT GTCCGAAAAA
GAAATCAATA ATCTTAAGCT TGAGGATCTT GGGTCTGGGA AGGATAGTCA ATATAAGGTG
GCCGAAACAA CACTAATTAA AGCCCTTAAA AAAGTACTTG ATGGGCCATC TTATAAGCCT
GTGGGAATTA ACCTTCCCCA GGCTATTCCG TCAACACTAT AA
 
Protein sequence
MPISRSLRSL GRQRSSWLLL LGAGGAATAI ALASPSLGLP SSSSSAMSDS PKEVIDQVWQ 
IVYRDYLDST GQYNPEDWKG LRKDLLAKNY SATSESYEAI RGMLASLDDP YTRFLDPKEF
KEMQIDTSGE LTGVGIQLSL DKDTKELVVV SPIEGTPASK AGVQPKDVIV FINGQSTKGM
STADAVKLIR GKEGSEVTLG LRRKGDVIQV PLIRARIEIQ AVDIQLNTTV DGTKIGYIRL
KQFNAHAAKG MRSAIKNLEK DGVQGYVLDL RSNPGGLLEA SVDIARQWLD EGTIVRTKTR
DGIQDVRRAN GSALTKLPVV VLVNEGSASA SEILSGALQD NDRGVLVGQK TFGKGLVQSV
RGLSDGSGLT VTIAKYLTPS GTDIHKNGIK PDIKAVMSEK EINNLKLEDL GSGKDSQYKV
AETTLIKALK KVLDGPSYKP VGINLPQAIP STL