Gene P9211_03601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_03601 
Symbol 
ID5731843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp337571 
End bp338881 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content41% 
IMG OID641284709 
Productcarboxyl-terminal protease 
Protein accessionYP_001550245 
Protein GI159902901 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTCAA CTGTTAAAAC CTTGTCAAAA TTTCTCCACA AAATTCTTTG CGCTTTTCTA 
AGCTTTTGCA TGATTTTTCT AGTCACTGCA AGGCCTCTCT ACGCATTGAG CGACGGTCAA
CAACTAGTAC TAGAGGCCTG GAATATCGTT AACGAAGGGT TTTTAAATCA AGAAAAATTC
AACGAGGTTC AATGGAAACG CCTTAGGAAA AAGGCACTGG AAGAAGAAAT TACGACATCA
ACTGAAGCTT ATAATGCTAT TGAAGGCATG CTTGCCCCAC TCGGAGATCC ATATACAAGA
CTCTTAAGGC CAAAAGATTA CGCAGCAATG AAGGAAAGTA ATCTTGGGAG TGAGATAAAT
GGTGTAGGTC TTCAGTTAGG CGCAAGAAAT ATCGATGGGA AGATTGTTGT AATTTGCCCG
CTTGAAGATT CCCCTGCAGC TGATGCCGAA ATTCTCAGTG GATCAATTCT TATAAAAGTC
GATAACGAAT CACCTCAAAG CCTTGGATTA GAAGCTACAG CAGCGAAGCT AAGAGGAGAG
AGTGGAAGCA AAGTGATTAT TGAATTAGAA ACTCCTGATG GAGAACAGAA AGAAATCAAC
CTTGAACGTC GCAGTGTTGA TTTAAGACCA GTAAGAAGCA AGAGAATACG CAATGAACTT
CATACACTTG GATACTTAAG AATTACTCAA TTTAGTGAAG GAGTGCCAGA TCAAGTCCGC
GAAGCCTTAG CAGAACTAAA AGAGAAAGGT GTAGAAGGTT TAATTTTAGA TTTAAGGAAT
AACTCTGGTG GTCTTGTAAG TTCAGGTCTT GCAGTCGCCG ATGCTTTCTT AAGCAATCAA
CCAGTTGTTG AAACTAAAAA TAGAAATGAA ATTAGTGAAC CAATCCCTTC CAATGAGGGA
ACCTTTTACG ATGGTCCAAT GGTAACTCTT GTAAATGCAG GGACCGCTAG TGCAAGTGAG
ATTCTTGCAG GAGCCCTTCA AGATAATTCA CGCTCAGAAT TGGTCGGCGG CAAAACCTTT
GGGAAAGGTC TAATCCAAAC TCTTACAAAC TTAAGCGATG GGAGCGGATT AGCTGTCACA
GTAGCAAGCT ATTTAACCCC AGCAGGCAGA GATATACAAA ACCTTGGCAT AGAACCAGAT
CGATATTTAG AAGCGCCTGA ACCTCTAAAT CCTGGCAGTA ATGAAGATAG ATGGTTGCAA
GATGCAGAGC TATTTATGGA GGCATTGCTA GACCGTGAAG AAGAAGAAGA AGAACCAATC
CAAACAAATG ATATAAATCC TGAAGAAAAG ATGATAGAAA CAAACACCTA A
 
Protein sequence
MPSTVKTLSK FLHKILCAFL SFCMIFLVTA RPLYALSDGQ QLVLEAWNIV NEGFLNQEKF 
NEVQWKRLRK KALEEEITTS TEAYNAIEGM LAPLGDPYTR LLRPKDYAAM KESNLGSEIN
GVGLQLGARN IDGKIVVICP LEDSPAADAE ILSGSILIKV DNESPQSLGL EATAAKLRGE
SGSKVIIELE TPDGEQKEIN LERRSVDLRP VRSKRIRNEL HTLGYLRITQ FSEGVPDQVR
EALAELKEKG VEGLILDLRN NSGGLVSSGL AVADAFLSNQ PVVETKNRNE ISEPIPSNEG
TFYDGPMVTL VNAGTASASE ILAGALQDNS RSELVGGKTF GKGLIQTLTN LSDGSGLAVT
VASYLTPAGR DIQNLGIEPD RYLEAPEPLN PGSNEDRWLQ DAELFMEALL DREEEEEEPI
QTNDINPEEK MIETNT