Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_06151 |
Symbol | sppA |
ID | 4777596 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 578927 |
End bp | 579739 |
Gene Length | 813 bp |
Protein Length | 270 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640086122 |
Product | signal peptide peptidase SppA (protease IV) |
Protein accession | YP_001016632 |
Protein GI | 124022325 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0720976 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCTGGC TCTGGCGACG CAAATCTCGC CGAAGGATCG CACACATCAG CATTGACGGA GCAATCAGCG GTGCAACGCG CGAGCGGGTG CTGAAAGCGA TCAAGGAAGT TGAAGAGAGA GAGTTTCCCG CTCTTCTGCT GCGCATCGAC AGCCCAGGAG GAACTGTCGG TGACAGTCAG GAAATCCATG CGGCCCTGCT CAGGCTTAGA GAAAAGGGTT GTCATGTCGT AGCAAGCTTC GGCAATGTAT CCGCCTCCGG TGGTGTGTAT GTCGGGGTAG CTGCAGAAAA AATTGTTGCC AATCCCGGCA CAATCACTGG CTCAATCGGG GTCATCTTGC GGGGCAACAA CCTCTCCAAA TTGCTGGAGA GAATCGGCAT CCGCTTTGAG ACAGTAAAAA GCGGCACCTA CAAGGACATT CTCTCGCCTG ATCGCGCACT CACAGCAGAG GAGCGTCAGC TACTGCAATC ACTGATCGAT AGCAGCTATG AGCAATTTGT GAACGCGGTA GCAGAAGGCC GCCATCTCAG TGCCGAAGAG GTGCGCAACT TCGCCGACGG CCGCGTGTTC AGCGGAGCTC AGGCCCATGA ACTCGGACTC ATCGATGAGC TGGGAGATGA AGAACATGCT CGAAAACTTG CGGCCAAGCT GGCGGACCTT GATGAAGCCA ACACCCAAAC GCTCAAACTG GGACGCCCCA AAAAACGGCT AGCGGGATTT CTACCAGGCA GCAAACTCCT ATCCAAACTT GCAGAGCTTC TCAACCTTGA GCTTGGCAAT AGTGGTCAGG TGCTCTGGCT CTTTCTGCCA TGA
|
Protein sequence | MGWLWRRKSR RRIAHISIDG AISGATRERV LKAIKEVEER EFPALLLRID SPGGTVGDSQ EIHAALLRLR EKGCHVVASF GNVSASGGVY VGVAAEKIVA NPGTITGSIG VILRGNNLSK LLERIGIRFE TVKSGTYKDI LSPDRALTAE ERQLLQSLID SSYEQFVNAV AEGRHLSAEE VRNFADGRVF SGAQAHELGL IDELGDEEHA RKLAAKLADL DEANTQTLKL GRPKKRLAGF LPGSKLLSKL AELLNLELGN SGQVLWLFLP
|
| |