Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_3514 |
Symbol | |
ID | 7088627 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | - |
Start bp | 4175527 |
End bp | 4177443 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 643462398 |
Product | peptidase U32 |
Protein accession | YP_002359419 |
Protein GI | 217974668 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.163924 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGC CAGAGATATA CACTCCCGAG AATCTTTCTC ACGTCCATAA CCGCCTTGAG TTATTGGCGC CTGCAAAAAA TGCTGACTAT GGTATCGAAG CCATACGCCA TGGTGCTGAT GCGGTATACA TAGGGGGGCC CGCATTCGGT GCGCGGGCAA CGGCAGGTAA CAGTGTTGAA GATATCGCGC GACTCTGTAC TTATGCTCAT CAATATCATG CTCAGGTATT CGTCGCCCTC AATACTATTT TGATGGATAA CGAACTGGCC GACGCCGAAA AACTCATTTG GCAATTGTAC GAAGCCGGTG CCGATGCACT GATTGTGCAG GACATGGGTG TATTGCAACT CAATTTGCCG CCGATTGCCC TGCATGCCAG TACCCAAATG GATAACCGCA GCCCAGAAAA AGTGGCGTTT TTAGAGCAAG TTGGTTTTTC ACAGGTGGTG CTCGCCCGTG AATTAGGCTT GAGCCAAATC CGTGATGTGG CCGCGCATAC TAACATGCAG CTCGAATTCT TTATCCACGG CGCTCTGTGT GTGGCTTACA GTGGTTTGTG TAACTTGAGT CATTCCTTTA GTAACCGCAG TGCTAACCGC GGTGAATGCT CGCAAATGTG CCGCCTGCCG GGCAATCTTA AAACCCGTCA AGGCGATGTG TTAGCGCAGA ATGAACATTT ATTGTCTTTA AAAGACAATA ACCAAACCGA AAACCTTGAA GCGCTCATCG ATGCGGGCAT TCGCTCCTTT AAGATTGAAG GTCGCTTAAA AGATTTAAGC TACGTTAAAA ACGTCACCGC GCATTATCGC CAAAAGCTAG ATGCGATTAT GGCGCGTCGC CCTGAGTTTA AGGCCTCGTC CCACGGCCGC TGTGAACATT CCTTCATCCC CGATCCTGAA AAGACCTTTA ACCGCGGCAG CACAGATTAC TTTGTTAACG AACGTAGCCA AGGCATTAAA GATTTCCGTT CGCCAAAATA CATAGGCGAA GAAGTCGGTA AAGTGGTCAG CATAGGCAAA GACTTTATCC AAGTGAGCTC AACCCATGAG TTTAATAACG GCGATGGCTT AGCGTTCTTC CCTGCGAATT TTGCCATGGC AAAACAATCG GATGACAAAC TGCAAGGCCT GCGAGTTAAC CGCGCCGAAG GCTTAAAACT GCATATATTG CAGGTGCCAA AGGATTTAAA AGTCGGCATG ACCTTATATC GTAACCATAA TCAAGCCTTC GAAGCCCTGT TATTGAAAGA ATCTTCTAAG CGGATTATCG GCGTCGATTT ACGTTTGAGC GACACTCAAG ACGGACTGGC GCTGACCTTG ACCGATATCT ATGGCCTAAG CGCCACAGTC AATTTAGTGG TCGAGAAAAC ACCTGCGACA GATGTAGACA AGACCTTGCA AAATACCCGC ACTCAGCTGG GTAAGCTCGG TAGTACCGAC TTTGTTGCCC GCAGTATCAG TATCGATACG GCGCAAGCCT GGTTCTTGCC AGCGTCAGTG CTCAATGGTC TGCGCCGTGA TGGCGTGGCC GCTTTAGAGG CTGCCCGTGT ACAAGGTTAT GTGCGTCCAC TGCCGTGGAA ACATAATCAA GATGCAGTGT ATCCCGTCAA GCATTTGAGT TACTTAGGTA ACGTAGCCAA CGAAAAGGCC AAGGATTTTT ATCAACGTCA CGGCGTGATT GAGATCCAAG ATACCTATGA GAAAAACGGC GTCACTGAAG ACGTGCCATT GATGATCACT AAGCATTGTC TGCGGTTTAA CTTTAATTTG TGTCCGAAGG AAGTACCAGG CATCAAGGCC GATCCTATGG TGCTTGAAAT CGGTAACGAT GTGCTCAAGT TGGTATTCGA CTGTCCTAAA TGTGAAATGA TGGTGGTCGG TGAAAACCGT CAAGTGCGAG GCCAGAAACA CCTTTAA
|
Protein sequence | MSQPEIYTPE NLSHVHNRLE LLAPAKNADY GIEAIRHGAD AVYIGGPAFG ARATAGNSVE DIARLCTYAH QYHAQVFVAL NTILMDNELA DAEKLIWQLY EAGADALIVQ DMGVLQLNLP PIALHASTQM DNRSPEKVAF LEQVGFSQVV LARELGLSQI RDVAAHTNMQ LEFFIHGALC VAYSGLCNLS HSFSNRSANR GECSQMCRLP GNLKTRQGDV LAQNEHLLSL KDNNQTENLE ALIDAGIRSF KIEGRLKDLS YVKNVTAHYR QKLDAIMARR PEFKASSHGR CEHSFIPDPE KTFNRGSTDY FVNERSQGIK DFRSPKYIGE EVGKVVSIGK DFIQVSSTHE FNNGDGLAFF PANFAMAKQS DDKLQGLRVN RAEGLKLHIL QVPKDLKVGM TLYRNHNQAF EALLLKESSK RIIGVDLRLS DTQDGLALTL TDIYGLSATV NLVVEKTPAT DVDKTLQNTR TQLGKLGSTD FVARSISIDT AQAWFLPASV LNGLRRDGVA ALEAARVQGY VRPLPWKHNQ DAVYPVKHLS YLGNVANEKA KDFYQRHGVI EIQDTYEKNG VTEDVPLMIT KHCLRFNFNL CPKEVPGIKA DPMVLEIGND VLKLVFDCPK CEMMVVGENR QVRGQKHL
|
| |