Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_12671 |
Symbol | sppA |
ID | 5731537 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 1143119 |
End bp | 1143928 |
Gene Length | 810 bp |
Protein Length | 269 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 641285636 |
Product | signal peptide peptidase SppA (protease IV) |
Protein accession | YP_001551152 |
Protein GI | 159903808 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.000540358 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGATCTGGC CTTGGCGTCG TAAATCCAAA AAAAGAATGG CAAGAATCAT TATTGATGGA GCAATCAATG GTGATACAAG GAAGCTTTTC CTTAAAGCAG TGAAGCAAGT TGAGGAAAGA GAGTTCCCTG CTTTACTTGT GAGAATTGAT AGTCCAGGAG GGACTGTCGG TGATAGTCAA GAGATTCATG CAGCTCTTCT CAGACTTAGA GAAAGCGGGT GTCATGTAGT GGCAAGCTTT GGAAACATAT CTGCTTCTGG AGGAGTATAT GTTGGCGTAG CTGCAGAAAA AATTGTCGCG AACCCAGGAA CTATAACTGG CTCTATCGGA GTAATACTTC GTGGCAATAA CCTCTCAAAG CTTCTCGAAA AAATAGGTAT TAAATTTGAG ACCGTCAAAA GTGGTCTTTA TAAAGACATT CTTTCACCTG ACAGGGCCCT CTCTAAAGAA GAGAGAGAAC TACTTCAATC ACTTATAGAC AGTAGTTATG GCCAATTTGT AGAAGCAGTT GCGAAAGGGA GAGGTCTAAG TGAAGAAGTG GTACGTGGCT TTGCAGATGG CAGGGTATTT ACAGGAACTC AAGCCAGAGA GCTTGGCCTA GTAGATGAAT TAGGAGATGA GAATCATGCA AAGCTTTTGG CTGCAAAGCT TGCAGACCTT GATGAGAAGT TACAACCAAT TACTCTTGGT CGTCCCAAAA AGAAGTTATT AGGATTACTT CCAGGAGGAA ATATTCTCAG AAATCTTGTA GAACAAGTAA CTATGGAGCT TTCAAATTCA GGTCAAATCC TTTGGCTCTT CCGACCATAA
|
Protein sequence | MIWPWRRKSK KRMARIIIDG AINGDTRKLF LKAVKQVEER EFPALLVRID SPGGTVGDSQ EIHAALLRLR ESGCHVVASF GNISASGGVY VGVAAEKIVA NPGTITGSIG VILRGNNLSK LLEKIGIKFE TVKSGLYKDI LSPDRALSKE ERELLQSLID SSYGQFVEAV AKGRGLSEEV VRGFADGRVF TGTQARELGL VDELGDENHA KLLAAKLADL DEKLQPITLG RPKKKLLGLL PGGNILRNLV EQVTMELSNS GQILWLFRP
|
| |