Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SO_3797 |
Symbol | |
ID | 1171441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella oneidensis MR-1 |
Kingdom | Bacteria |
Replicon accession | NC_004347 |
Strand | - |
Start bp | 3948765 |
End bp | 3950681 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637345567 |
Product | U32 family peptidase |
Protein accession | NP_719334 |
Protein GI | 24375291 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAGC CAGAGATTTA CACACCTGAG AAACTTTCTC ACGTTAATAA CCGTTTAGAG TTATTGGCGC CCGCTAAAAA TGCCGATTAT GGTATTGAAG CCATTCGCCA TGGTGCCGAT GCGGTTTATA TCGGTGGTCC TGCATTTGGG GCGCGTGCGA CGGCGGGTAA CAGTGTGGAG GATATCGCGC GCCTGTGTAC TTTTGCGCAT AAGTACCATG CTCAGGTGTT TGTCGCCCTT AATACCATTT TGATGGACGA TGAACTGGCA GTGGCTGAAA AACTGATTTG GGACGTGTAT AACGCTGGCG CCGATGCGCT GATTGTGCAG GATATGGGGG TTTTACAGCT CAACCTGCCG CCGATTGCAC TGCACGCCAG TACCCAAATG GATAACCGTA ATCCCGAGAA AGTCGCCTTT TTGGAGCAAG TGGGATTCTC CCAAGTGGTG TTAGCTCGCG AGCTTGGTTT AAGCCAAATT CGTGAGGTTG CGGCTCACAC CAATATGCAG ATTGAGTTCT TTATCCATGG CGCGCTTTGC GTTGCTTATA GTGGCCTATG TAACTTAAGT CATGCCTTTA GTAATCGCAG TGCTAACCGC GGTGAATGTT CGCAAATGTG CCGCTTACCG GGTAATCTTA AGACTCGTCA AGGGGATGTG TTAGCGAAAA ATGAGCACTT ACTTTCATTA AAAGACAATA ATCAAACCGA TAACCTTGAA GCCCTGATCG ATGCTGGCGT ACGTTCGTTC AAAATCGAAG GCCGTTTAAA AGATTTAAGT TACGTGAAGA ACGTGACTGC TCATTATCGC CAAAAACTCG ATGCTATTAT GGCGCGTCGC CCAGAGTTTG TGGCTTCATC CCATGGCCGT ACTGAGCATA CTTTTACCCC AGATCCAGAA AAAACCTTTA ACCGTGGCAG TACCGACTAC TTTGTCCATG AGCGTAGCCA AGGGATTAAA GATTTCCGCT CGCCTAAATA TATTGGCCAA GATGTCGGTA AAGTGGTCGC TATTGGTAAG GACTTTATTC AAGTCAGTTC AACCCACGAA TTTAATAACG GTGATGGTTT AGCTTATTTT CCACCCAATT ATGCGATGGC AAAGCAGTCC GACGATAAGT TGCAAGGTTT ACGGGTAAAC CGCGCTGAAG GCCATAAGCT GCATGTGTTA CAAGTCCCGC GGGATTTACG TGTTGGTATG ACCTTATACC GTAACCATAA TCAGGCATTC GAAGCCTTGT TAGCTAAAGA GTCGGCCAAG CGTATTATTG CTGTGGACAT GCGCTTAATC GATACGACGG CGGGTGTGGC ACTGACCTTA ACGGATATGT ATGGCTTGAG CGCCTCGGTT GAGTTAGCGG TAGAAAAAAC GCCGGCAACC GATGCTGAAA AAACCTTACA GACTATTCGT ACACAACTGT CAAAATTGGG TAGCACCGAC TTTGTTGCTC GTCAAATCAG CATTGAAACT GCCGAGCCTT GGTTCCTGCC TGCATCAACG CTCAATGGTC TTCGCCGTGA TGCGGTTGCT GCACTTGAAC TTGCCCGCAT AGAAGGCTAC CAACGGCCAA AACCTTGGAA ATATAACCAA GATGCCGTCT ATCCATTCAA ACACTTAAGT TACTTAGGTA ACGTGGCAAA CGAAAAGGCC AAAGATTTTT ATCAACGCCA TGGCGTGATT GAAATTCAAG ATACCTATGA GAAAAACGGC GTCACTGAAG ACGTGCCTTT GATGGTGACT AAGCACTGTC TGAGATTTAA CTTTAATCTT TGCCCTAAGG AAGTACCAGG TATCAAGGCT GACCCTATGG TGCTCGAAAT TGGTAACGAT GTACTTAAGT TAGTATTTGA TTGTCCTAAG TGCGAAATGC TTGTTGTCGG TGAAAACCGC CAGGTTCGCG GCCAAAAAGC ACTTTAA
|
Protein sequence | MSQPEIYTPE KLSHVNNRLE LLAPAKNADY GIEAIRHGAD AVYIGGPAFG ARATAGNSVE DIARLCTFAH KYHAQVFVAL NTILMDDELA VAEKLIWDVY NAGADALIVQ DMGVLQLNLP PIALHASTQM DNRNPEKVAF LEQVGFSQVV LARELGLSQI REVAAHTNMQ IEFFIHGALC VAYSGLCNLS HAFSNRSANR GECSQMCRLP GNLKTRQGDV LAKNEHLLSL KDNNQTDNLE ALIDAGVRSF KIEGRLKDLS YVKNVTAHYR QKLDAIMARR PEFVASSHGR TEHTFTPDPE KTFNRGSTDY FVHERSQGIK DFRSPKYIGQ DVGKVVAIGK DFIQVSSTHE FNNGDGLAYF PPNYAMAKQS DDKLQGLRVN RAEGHKLHVL QVPRDLRVGM TLYRNHNQAF EALLAKESAK RIIAVDMRLI DTTAGVALTL TDMYGLSASV ELAVEKTPAT DAEKTLQTIR TQLSKLGSTD FVARQISIET AEPWFLPAST LNGLRRDAVA ALELARIEGY QRPKPWKYNQ DAVYPFKHLS YLGNVANEKA KDFYQRHGVI EIQDTYEKNG VTEDVPLMVT KHCLRFNFNL CPKEVPGIKA DPMVLEIGND VLKLVFDCPK CEMLVVGENR QVRGQKAL
|
| |