Gene P9211_12671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_12671 
SymbolsppA 
ID5731537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1143119 
End bp1143928 
Gene Length810 bp 
Protein Length269 aa 
Translation table11 
GC content42% 
IMG OID641285636 
Productsignal peptide peptidase SppA (protease IV) 
Protein accessionYP_001551152 
Protein GI159903808 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000540358 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGATCTGGC CTTGGCGTCG TAAATCCAAA AAAAGAATGG CAAGAATCAT TATTGATGGA 
GCAATCAATG GTGATACAAG GAAGCTTTTC CTTAAAGCAG TGAAGCAAGT TGAGGAAAGA
GAGTTCCCTG CTTTACTTGT GAGAATTGAT AGTCCAGGAG GGACTGTCGG TGATAGTCAA
GAGATTCATG CAGCTCTTCT CAGACTTAGA GAAAGCGGGT GTCATGTAGT GGCAAGCTTT
GGAAACATAT CTGCTTCTGG AGGAGTATAT GTTGGCGTAG CTGCAGAAAA AATTGTCGCG
AACCCAGGAA CTATAACTGG CTCTATCGGA GTAATACTTC GTGGCAATAA CCTCTCAAAG
CTTCTCGAAA AAATAGGTAT TAAATTTGAG ACCGTCAAAA GTGGTCTTTA TAAAGACATT
CTTTCACCTG ACAGGGCCCT CTCTAAAGAA GAGAGAGAAC TACTTCAATC ACTTATAGAC
AGTAGTTATG GCCAATTTGT AGAAGCAGTT GCGAAAGGGA GAGGTCTAAG TGAAGAAGTG
GTACGTGGCT TTGCAGATGG CAGGGTATTT ACAGGAACTC AAGCCAGAGA GCTTGGCCTA
GTAGATGAAT TAGGAGATGA GAATCATGCA AAGCTTTTGG CTGCAAAGCT TGCAGACCTT
GATGAGAAGT TACAACCAAT TACTCTTGGT CGTCCCAAAA AGAAGTTATT AGGATTACTT
CCAGGAGGAA ATATTCTCAG AAATCTTGTA GAACAAGTAA CTATGGAGCT TTCAAATTCA
GGTCAAATCC TTTGGCTCTT CCGACCATAA
 
Protein sequence
MIWPWRRKSK KRMARIIIDG AINGDTRKLF LKAVKQVEER EFPALLVRID SPGGTVGDSQ 
EIHAALLRLR ESGCHVVASF GNISASGGVY VGVAAEKIVA NPGTITGSIG VILRGNNLSK
LLEKIGIKFE TVKSGLYKDI LSPDRALSKE ERELLQSLID SSYGQFVEAV AKGRGLSEEV
VRGFADGRVF TGTQARELGL VDELGDENHA KLLAAKLADL DEKLQPITLG RPKKKLLGLL
PGGNILRNLV EQVTMELSNS GQILWLFRP