Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_16541 |
Symbol | sppA |
ID | 4780448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1347942 |
End bp | 1348751 |
Gene Length | 810 bp |
Protein Length | 269 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640084937 |
Product | signal peptide peptidase SppA (protease IV) |
Protein accession | YP_001015476 |
Protein GI | 124026360 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTTGGC CCTTGAGACG AAAATCCAAA AAGAGGATGG CACGAATCTG TATTGAAGGT CCTATCAATT CAGAAACTCG GAAAATTGTT CTAAAAGCAT TAAAGCAAAT TGAGGAAAGA GAGTTTCCAG CACTATTACT TCGCATTGAT AGCCCTGGGG GAACAGTTGG TGATAGCCAA GAAATCCATA GTGCTCTCTT GAGGCTAAGA GAGAAAGGTT GTCATGTTGT AGCTAGTTTT GGCAATATCT CAGCTTCAGG TGGAGTCTAT ATAGGTGTAG GTGCTGAAAA AATTGTTGCA AACCCAGGAA CAATAACAGG ATCTATTGGT GTAATTTTAA GAGGAAACAA TCTATCAAAG TTATTAGAAA AGGTTGGTAT TAAATTCGAG ACTGTAAAAA GTGGAATCTA TAAAGACATT CTTTCCCCTG ATCGCCCTTT ATCAACTGAG GAGAGAGCTC TTCTACAATC TTTAATTGAT AGCAGTTACG AGCAATTTGT TTTAGCAGTT TCAAAAGGAA GAAATTTAAC CCCAGAAGTG GTTAAGAGTT TTGCCGATGG AAGAGTTTTT ACTGGAGAGC AAGCTAAAGA ATTTGGGTTG GTAGATGAAA TAGGTGATGA GAATGATGCA AAACTACTTG CTATAAAAAT TGCGAACCTA GATGAAAAAA CAAAACCCAT AACATTTGGT AAAACCAAAA AGAAATTATT AGGGTTTTTA CCTGGAGGGA AAATAATCCA CAATCTTGCA AATGCATTAA ACCTTGAGTT GGAGGGGAAT GGACAGATCC TTTGGCTCTT TAAGCCATGA
|
Protein sequence | MIWPLRRKSK KRMARICIEG PINSETRKIV LKALKQIEER EFPALLLRID SPGGTVGDSQ EIHSALLRLR EKGCHVVASF GNISASGGVY IGVGAEKIVA NPGTITGSIG VILRGNNLSK LLEKVGIKFE TVKSGIYKDI LSPDRPLSTE ERALLQSLID SSYEQFVLAV SKGRNLTPEV VKSFADGRVF TGEQAKEFGL VDEIGDENDA KLLAIKIANL DEKTKPITFG KTKKKLLGFL PGGKIIHNLA NALNLELEGN GQILWLFKP
|
| |