Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_2051 |
Symbol | |
ID | 7088344 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | - |
Start bp | 2425043 |
End bp | 2426890 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643460954 |
Product | signal peptide peptidase SppA, 67K type |
Protein accession | YP_002357978 |
Protein GI | 217973227 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00705] signal peptide peptidase SppA, 67K type [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00884416 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.0000000000873335 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCCGCTA ACCCATCGTT TTTCAAGAGA ATATGTTTGT TTATTTGGCA CACCCTCAAC GGGACGCGTA AACTAATCCT CAATCTGATC TTTTTTGGCA TCTTAGCCGC CATTATCATC TCCATTGGCA GCAGCGAAGA TATCCAAGTA GAAGACAACT CGGCGCTGGT GCTTAATCTG GCCGGCTCGA TTGTCGATCA GAAACAACAG GTCGATCCCT TAGAAGCCGC ATTCAAGCAA GGCCAAAATG CCAACGCCGA TGGGGAAATC CTGCTAGCCG ATGTGCTGTA TGTGATTGAC AATGCCGCGC AAGATCAAAG GATTAGTACC CTAGTACTTG ATCTAGCCGA TTTGAAACGT GCCGGGATCA GCAAGCTGCA ATCTATCGGT GATGCACTTA ACCGCTTTAA AGAAAGCGGT AAAAAAGTGG TCGCCATCGG CAATTACTAC GAGCAAAATC AATATTTCCT CGCCAGCTTT GCCGACACCA TTTACCTCAA TCCCCAAGGC GGCGTGTCGC TTGATGGCCT GAGTATGTAC AACCAGTACT TCAAATCGGC CTTAGATAAA CTCAAAGTTA AAGCGCATAT TTTCCGCGTA GGGACTTTTA AATCTGCGGT TGAACCCTAC ATGCGTGACG ATATGTCCGA CGCCGCTCGC GAAGCCAGCA GTGCGCTTTT AGCGGATGTA TGGCAAAGCT ACACCCAGAC TGTCGCAGGC AATCGCAATA TTGAGCCTAA CTCGCTGGTG CCCGATGCAA CCACTTATTT GGCCGAGCTC GATAAAGCCA ATGGTGACTC TGCGGCGATG GCCATCAACA TGAAATGGGT TGATAGCTTA GCGACCACGG AAGATTTTCG TCAAACCATG TTAGAAACCG TGGGCAAGGC CAGCAGCGGT GACAGTTTCA AGCAAGTGAG TTTTTATGAT TATTTAACCT TAGTGACGCC ATTGCCAAGC TTTGTTGAGC AAGACAGTGT CGGCATAATT GTGGCGAGCG GTACGATTCT GAACGGCACT CAACCTGCGG GCCAAATCGG CGGCGAAAGC ACTGCCGAGC TGCTACGTAA GGCACGTTTC GATAAACACG TTAAAGCCTT AGTGCTGCGC GTCGATAGCC CTGGCGGTAG CGCCTTTGCC TCGGAGCAAA TTCGTCAAGA ATTACTGGCA CTTAAAGCGG CTGGCAAACC TGTTGTCGTC AGTATGGGCA GTTTAGCCGC ATCGGGTGGT TATTGGATTT CAGCCAGTGC CGACTATATC TTCGCAACGC CAACCACGCT TACGGGCTCA ATTGGGATCT TCGGCATGAT CACCACCTTC GAAGATTCTC TGGCCAGTAT AGGCGTGCAT ACCGATGGCG TATCAACGTC GGAATGGGCG GGATTGTCTG TGACACGCAC GCTGTCACCG CAAGTTGAAG CGATTATCCA ACGTCATATT GAACGTGGCT ATTTAGACTT TATCTCGCTG GTCGCTAAAG AACGCAAGAT GACCATAGAA CAAGTCGATA AGATTGCCCA AGGCCGTGTT TGGAGTGGTA AAAAAGCCCT TGAGCTTGGC TTAGTCGATG AACTGGGTGA TTTAGATGAG GCGATAGCTA AAGCCGCTAA ACTCGCGGAT ATGACACTAT TTGACACCCG CGTTATCGAA CAAGAGCTGA CACCCGAGCA ACGCTTTATG CAACAGATGT TTGCCTCAGT ATCGAGCTAC CTGCCCGCCT CACTGAGCCA ATCATCGATG TTAGAGCAAA TGCTGCAGCA ATGGACCAGC AGCTTGAAAA CGTTAACCTC TTTTGATGAT CCAAATAACG TCTACATCTA CTGCGATACT TGCAACATGA TGAACTAA
|
Protein sequence | MSANPSFFKR ICLFIWHTLN GTRKLILNLI FFGILAAIII SIGSSEDIQV EDNSALVLNL AGSIVDQKQQ VDPLEAAFKQ GQNANADGEI LLADVLYVID NAAQDQRIST LVLDLADLKR AGISKLQSIG DALNRFKESG KKVVAIGNYY EQNQYFLASF ADTIYLNPQG GVSLDGLSMY NQYFKSALDK LKVKAHIFRV GTFKSAVEPY MRDDMSDAAR EASSALLADV WQSYTQTVAG NRNIEPNSLV PDATTYLAEL DKANGDSAAM AINMKWVDSL ATTEDFRQTM LETVGKASSG DSFKQVSFYD YLTLVTPLPS FVEQDSVGII VASGTILNGT QPAGQIGGES TAELLRKARF DKHVKALVLR VDSPGGSAFA SEQIRQELLA LKAAGKPVVV SMGSLAASGG YWISASADYI FATPTTLTGS IGIFGMITTF EDSLASIGVH TDGVSTSEWA GLSVTRTLSP QVEAIIQRHI ERGYLDFISL VAKERKMTIE QVDKIAQGRV WSGKKALELG LVDELGDLDE AIAKAAKLAD MTLFDTRVIE QELTPEQRFM QQMFASVSSY LPASLSQSSM LEQMLQQWTS SLKTLTSFDD PNNVYIYCDT CNMMN
|
| |