Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_13731 |
Symbol | sppA |
ID | 4718093 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 1142422 |
End bp | 1143231 |
Gene Length | 810 bp |
Protein Length | 269 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640079093 |
Product | signal peptide peptidase SppA (protease IV) |
Protein accession | YP_001009764 |
Protein GI | 123968906 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTTGGC CTTTTAGACG AAAGTCAAAA AAAAGAATGG CTCGTATAGT AATTGATGAG CCTATTACAA GTTCAACAAG AGTTTCTTTC CTTAAAGCAC TTAAACAAGT TGAGGATAGA GAATTTCCTG CTTTAATCGT GAGAATCGAT TCTCCCGGGG GCACTGTTGG TGATAGCCAA GAAATATACT CTGCTATTAA AAGACTAAAA GATAAAGGAT GTAAAGTCAT TGCTAGTTTT GGGAACATCT CAGCATCAGG AGGTGTTTAC ATTGGTGTTG CATCTGACAA AATAGTTGCG AATCCAGGCA CAATTACAGG GTCTATTGGT GTGATTATAA GAGGAAATAA TTTATCTGAA TTATTAGATA AGATCGGCAT TAAATTTGAG ACTGTTAAAA GTGGTGTATT TAAAGATATA CTTTCTCCAG ATAAACCTCT AAGTGAGGAA GGAAGAGGTC TACTTCAAGG CTTAATAGAT GAAAGTTACA AACAATTTAC TGAAGCTGTT GCTGAAGGAA GAAATTTACC TGTTGAAGAA GTAAGAAAAT TTGCTGATGG AAGAATTTTC ACTGGTACTC AAGCGAAAGA ATTAGGACTA GTTGATAAGA TTGGAGATGA ATTTGTTGCT AGGGAACTTG CAGCAGAAAT GGTTAATATT GATCCTAAGA TTCAGCCCTT GACATTTGGG AAGAAAAAGA AAAAAATACT TGGGCTAATT CCTGGGAGTA GAGTGATTGA GAAAATTATT AAAAATATCT TTTTTGAGTT TGACTCGTCA AATAAAGTAC TTTGGTTATA CAAACCTTAA
|
Protein sequence | MIWPFRRKSK KRMARIVIDE PITSSTRVSF LKALKQVEDR EFPALIVRID SPGGTVGDSQ EIYSAIKRLK DKGCKVIASF GNISASGGVY IGVASDKIVA NPGTITGSIG VIIRGNNLSE LLDKIGIKFE TVKSGVFKDI LSPDKPLSEE GRGLLQGLID ESYKQFTEAV AEGRNLPVEE VRKFADGRIF TGTQAKELGL VDKIGDEFVA RELAAEMVNI DPKIQPLTFG KKKKKILGLI PGSRVIEKII KNIFFEFDSS NKVLWLYKP
|
| |