Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SO_3942 |
Symbol | |
ID | 1171580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella oneidensis MR-1 |
Kingdom | Bacteria |
Replicon accession | NC_004347 |
Strand | + |
Start bp | 4087110 |
End bp | 4088462 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637345701 |
Product | serine protease |
Protein accession | NP_719473 |
Protein GI | 24375430 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACGA AATTATCTGT ACTTTCAGCC GCAATGTTAG CCGCAACCCT GACAATGATG CCAGCTGTCT CACAGGCCGC TATTCCGCAA TCTGTTGAGG GACAATCCAT ACCCAGTCTT GCGCCCATGT TAGAGCGTAC GACTCCCGCC GTGGTTTCTG TGGCGGTTTC TGGAACCCAT GTTTCTAAAC AACGCGTGCC CGATGTGTTC CGTTATTTCT TTGGCCCTAA TGCGCCACAA GAACAAGTGC AAGAGCGTCC ATTTAGAGGC TTAGGCTCCG GCGTTATTAT CGATGCCGAC AAAGGCTATA TTGTCACCAA TAACCACGTG ATCGACGGTG CCGATGATAT CCAAGTGGGT TTACACGATG GCCGTGAAGT CAAAGCCAAA TTGATTGGTA CTGACTCCGA ATCCGACATT GCGTTATTGC AAATCGAGGC TAAAAATCTG GTCGCGATTA AAACCTCTGA TTCTGATGAA CTGCGCGTCG GTGACTTTGC CGTCGCTATC GGTAACCCCT TCGGTCTAGG ACAAACCGTA ACATCAGGGA TTGTCAGTGC CCTAGGCCGT AGCGGTTTAG GCATTGAAAT GCTTGAAAAC TTTATCCAAA CCGACGCAGC CATCAACAGT GGTAACTCGG GCGGCGCTCT AGTTAACCTT AAAGGCGAAC TGATCGGTAT TAACACGGCT ATCGTCGCGC CTAACGGCGG TAACGTAGGT ATCGGTTTTG CGATTCCAGC AAACATGGTG AAAAACCTCA TCGCACAGAT TGCTGAGCAT GGTGAAGTTC GCCGCGGCGT ATTGGGGATT GCTGGCCGTG ATTTAGATAG CCAACTTGCC CAAGGCTTTG GCTTAGACAC TCAACACGGT GGCTTTGTGA ATGAAGTTAG CGCGGGCAGT GCCGCCGAAA AAGCTGGTAT TAAGGCGGGC GATATTATCG TTAGCGTCGA TGGCCGTGCG ATCAAGTCGT TCCAAGAGCT GCGTGCTAAA GTCGCGACGA TGGGCGCTGG CGCTAAAGTC GAACTGGGCC TTATCCGTGA TGGCGATAAG AAAACTGTCA ACGTCACCTT AGGTGAAGCA AACCAAACGA CAGAAAAAGC AGCTGGTGCA GTGCATCCTA TGCTACAAGG TGCCTCATTA GAAAATGCCT CTAAAGGGGT CGAAATTACC GATGTTGCAC AAGGATCTCC AGCGGCAATG AGCGGTCTGC AAAAAGGCGA TTTGATTGTC GGTATCAACC GTACTGCGGT TAAAGATCTT AAATCGCTCA AAGAGCTGCT CAAAGATCAA GAAGGTGCTG TCGCCCTGAA GATTGTCCGT GGTAAGAGCA TGCTTTACTT AGTGCTTCGT TAA
|
Protein sequence | MKTKLSVLSA AMLAATLTMM PAVSQAAIPQ SVEGQSIPSL APMLERTTPA VVSVAVSGTH VSKQRVPDVF RYFFGPNAPQ EQVQERPFRG LGSGVIIDAD KGYIVTNNHV IDGADDIQVG LHDGREVKAK LIGTDSESDI ALLQIEAKNL VAIKTSDSDE LRVGDFAVAI GNPFGLGQTV TSGIVSALGR SGLGIEMLEN FIQTDAAINS GNSGGALVNL KGELIGINTA IVAPNGGNVG IGFAIPANMV KNLIAQIAEH GEVRRGVLGI AGRDLDSQLA QGFGLDTQHG GFVNEVSAGS AAEKAGIKAG DIIVSVDGRA IKSFQELRAK VATMGAGAKV ELGLIRDGDK KTVNVTLGEA NQTTEKAAGA VHPMLQGASL ENASKGVEIT DVAQGSPAAM SGLQKGDLIV GINRTAVKDL KSLKELLKDQ EGAVALKIVR GKSMLYLVLR
|
| |