Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_13811 |
Symbol | sppA |
ID | 4912298 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | + |
Start bp | 1150679 |
End bp | 1151488 |
Gene Length | 810 bp |
Protein Length | 269 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640160971 |
Product | signal peptide peptidase SppA (protease IV) |
Protein accession | YP_001091605 |
Protein GI | 126696719 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.99906 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTTGGC CTTTTAGACG AAAGTCAAAA AAAAGAATGG CTCGCATAGT TATAGATGAG CCTATTACAA GTTCAACAAG AGTTTCGGTC CTTAAAGCTC TTAAACAAAT AGAGGATAGA GAATTTCCTG CTTTAATCGT GAGAATTGAT TCACCAGGTG GTACTGTCGG AGATAGCCAA GAAATATATT CTGCCATTAA AAGACTAAAA AATAAAGGAT GTAAAGTCAT TGCTAGCTTT GGTAATATTT CAGCATCTGG AGGTGTTTAC ATTGGTGTTG CATCTGACAA GATAGTTGCG AATCCAGGAA CAATTACAGG TTCTATTGGT GTGATTATAA GAGGAAATAA TTTATCTGAA TTATTAGATA AAGTTGGTAT TAAATTTGAA ACCGTTAAAA GCGGAGTATT TAAAGATATA CTTTCTCCTG ATAAGCCTTT AAGCGAGGAA GGGAGGAGAT TGCTTCAAGG CCTAATTGAT GAAAGCTACA AACAATTTAC TGAAGCTGTT GCTGATGGAA GAAATTTACC TGTTGAAGAA GTAAGAAAAT TTGCTGATGG GAGGATTTTT ACTGGCACTC AAGCGAAAGA ATTAGGATTA GTTGATGAGG TTGGAGATGA ATTTGTTGCA AGGGAACTAG CTGCAGAGAT GGTCAATATT GACCCTAAAA TTCAACCCCT AACATTTGGT AAGAAAAAGA AAAAAATACT TGGATTAATT CCTGGAAGTA AAATGATTGA GAAAATCATC AATAATATCT TTTTTGAGTT TGACTCATCT AATAAAGTAC TTTGGTTATA CAAGCCTTAA
|
Protein sequence | MIWPFRRKSK KRMARIVIDE PITSSTRVSV LKALKQIEDR EFPALIVRID SPGGTVGDSQ EIYSAIKRLK NKGCKVIASF GNISASGGVY IGVASDKIVA NPGTITGSIG VIIRGNNLSE LLDKVGIKFE TVKSGVFKDI LSPDKPLSEE GRRLLQGLID ESYKQFTEAV ADGRNLPVEE VRKFADGRIF TGTQAKELGL VDEVGDEFVA RELAAEMVNI DPKIQPLTFG KKKKKILGLI PGSKMIEKII NNIFFEFDSS NKVLWLYKP
|
| |