Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_3081 |
Symbol | |
ID | 7088991 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | - |
Start bp | 3652973 |
End bp | 3656257 |
Gene Length | 3285 bp |
Protein Length | 1094 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643461965 |
Product | peptidase S41 |
Protein accession | YP_002358989 |
Protein GI | 217974238 |
COG category | [S] Function unknown |
COG ID | [COG4946] Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00024555 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00566406 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAACTTC GCCATTCTGT CGCCTGCTGC ATGCTTGCCC TAGGCACGTT CACTGCGGCC CAAGCCATCG CTGCCGATGC GGTGAGCAAT CAAGGATATT ACCGCGCGCC AGCCCTGCAC GATCAAACCT TAGTCTTTAC CGCTGAAGGG GATTTATGGA CTCAGACCTT AGGTCAAAAA GCGGCGACTC GCTTAACGAG TTTACCTGCA GAAGAGCTAG GTGCGACCAT TTCCGCCGAT GGCAAGTGGG TTGCTTATGT GGCCAATTAT GAAGGCGCGA GCGAAGTGTA TGTGATCCCG GTTGCCGGCG GTGTGGCCAA GCGGGTGAGT TTTGAGAATA GCCGTGTGCG GGTTCAAGGT TGGACCCCAA AAGGCGAGAT TTTATATTCC ACCGACAGTG GTTTTGGCCC CGCGAATAAT TGGATGCTAC GCAGTGTCGA TCCTAAAACC TTCACGACCA CAGATTTACC CTTAGCCGAT GCGGTCGAAG GTGTGATTGA TGATCGCAAC GAATATGTGT ATTTCACCCG TTTTGGCCTG CAAGTCACTG GCGATAACGC CAAAGTGTAC CGTGGCGGCG CAAAGGGCGA ATTATGGCGC TTTAAACTCG GCAGCAAAAC CGAAGCCCAG TTGCTCAGCG GCCAGCACGA CGGCTCAGTG CGCCAACCTA TGCTATGGCA AGACAGACTC TATTTTATCA GCGACAGCGA TGGTAACGAC AACCTCTGGT CGATGGCGCT CGATGGTTCA GACGCAAAAC AGTTGACCCA ATTTAAAGAT TGGCAAGTCC GCGGCGCGCA GATGGATCAA GGCAAAGTGG TCTTCCAGCA GGGTGCTGAT ATCCATGTAT TTGATATCGC GGCCGCAAAA GACGCATTAG TCAATATTGA ACTCACGTCC GATTTTGCCC AGCGCCGTGA GCACTGGGTG AAAGATCCTA TGGATTACGC ATCCTCCACC GATTTAGCCC TTGCTGGCGA TAAAGTGGTG ATAACCGCCC GCAGCCATGT GGCAATTGCG GGTGTTGATG GCTCGCGCTT AGTTCAAGTT GCGCTGCCAG GGACTTATCG CGTGAGAGAA GCGATATTGA GTCAAGATGG CAAGTCTGTT TATGCGATTA GCGACATGAC AGGTCAGCAA GAAATTTGGC AATTCCCCGC CGATGGCAGC AGTGGTGCTA AGCAGCTGAC AAAAGACGGT CATACCTTAA GAATGAGCCT GAATCTGTCC AATGACGGTC GATTTATCGC CCACGACGAT AACGATGGCA ATGTTTGGCT GCTCGACTTA AAGAAAAATA CCAATCAAAA AATCATCACT AACGGTGAAG GACTTGGGCC TTATGCTGAT ATCCGCTGGT CCGGAGACAG CCGTTTTATT GCATTAACTA AGTCGGAAAT CGGTAAACAA AGACCGCAAA TTGTCCTTTA CTCTGTGGAT GAAAATAAGG CCGAGACGCT CACCAGCGAT AAGTACGAAT CCTATTCGCC GACCTTTAGC AGTGACGGCC AGTGGTTATA TTTCCTGTCA AATCGCCAAT TCAGCGCCAC GCCAAGTTCA CCTTGGGGCG ATAGAAACAT GGGCCCAGTG TTTGATAAAC GCAGCCAAAT TTTTGCCATC GCCTTAGTTA ACAATGCCAA ATTCCCCTTT AGTAAACCCA ATGAACTCAG CGTTAACCAA GCCGATAAAA CCGATGCTAA GGATAAACCG AATCCGGTGA AAATCGATTG GTCTGGTATT TCGCAGCGCC TATGGCAAGT GCCTATCGAT TCAGGCAATT ACAGTGAACT GCGCGCCGTC GATGGTAGGC TTTATGTGCT CGACCAAGCG ATGGGTGATG ACGCCGAACC GAGTCTGATG ACCATCAAGT TTGATCCATT AAGTCCGAAA GCCGAAGTTT TTGCCGAGGA TATTGGTCAT TATGCGGTTT CTGCCGATGG CAAAAAGCTG ATGCTGCGTA AGTTCAGCAA TGACAAGGCG CTGATGATTG TTGACGGTGG CGATAAGCTA GGCGATACGG ATAATGCCAA AGTGCAAACG GATCAATGGC AATTAGCGAT TTATCCCCAG CTTGAATGGC AACAAATGTT TGAAGATGCT TGGTTGATGC ACAGAGAATC TTTCTTCGAT AAGAAGATGC GCGGCCTCGA TTGGCAAGCG ACCAAAGCTA AGTATCAACC TTTGCTTGAC CGCCTGACTG ACCGTAACGA ACTGAACGAT ATCTTCATGC AAATGATGGG CGAATTGGAT TCTTTGCACT CACAGGTTCG CGGCGGTGAT TTACCCAAAG ATCCCGATGC GGCTAAAGCA GCTAGCCTAG GTGCGCGTCT ACAGCAAACC GCCGATGGCG TTAAAATTGC GCATATTTAC AGTAACGATC CTGAGTTGCC AGCTAACGCT TCACCGTTAA ATCGTATCGA AGTCGATGCC AAAGAAGGCG ATGTGCTGCT GGCCATCAAT GGCACGCCAG TGGCGAACGT GGCTGATGTC ACACGGCTGT TGCGCAATCA ACAGGACAAG CAAGTATTAC TGCAACTTAA GCGTGGCAGC CAAACCCATA AAACCATCGT TATGCCCGTC AGCGCTATGG TGGACGGCCA ATTACGTTAT TTAGATTGGG TGAGCCATAA CGCGACTGTG GTCGAGGATG CGAGTAAGGG CAAGATTGGT TATTTGCACT TATATGCCAT GGGCGGCGGC GATATTGAAA GTTTCGCCCG TGAGTTTTAT ACCAATTACG ATAAAGACGG TTTGATTATC GACGTTCGCC GTAATCGCGG CGGCAATATC GACAGTTGGA TTATCGAGAA GTTATTGCGC CGCGCTTGGG CTTTCTGGCA ACCGACCCAC GGCACGCCAA ACACCAATAT GCAGCAAACC TTCCGTGGCC ATTTAGTCGT GTTAACCGAT GAACTAACCT ATTCCGACGG TGAAACCTTC TCAGCGGGAA TTAAAGCGCT GGGTATTGCG CCGCTAATTG GTAAACAAAC GGCAGGCGCG GGCGTGTGGT TATCTGGGCG AAATTCGCTG ACCGATAAAG GTATGGCGCG AGTGGCCGAG TATCCACAGT ATGCAATGGA TGGTCGCTGG GTGCTGGAAG GACATGGCGT AACGCCGGAT ATTGAAGTCG ATAACTTACC TTTCGCCACC TTTAACGGTA AAGATGCACA GCTTGAGACT GCGATAAGCT ATTTGAAAGA TGAGTTGGTT AAGCAGCCCG TTCCAGCCTT AAAAGCGCAA CCTCTGCCTG CAAAAGGAAT GGCACAGGAT ATAAAAGCTA AGTAA
|
Protein sequence | MKLRHSVACC MLALGTFTAA QAIAADAVSN QGYYRAPALH DQTLVFTAEG DLWTQTLGQK AATRLTSLPA EELGATISAD GKWVAYVANY EGASEVYVIP VAGGVAKRVS FENSRVRVQG WTPKGEILYS TDSGFGPANN WMLRSVDPKT FTTTDLPLAD AVEGVIDDRN EYVYFTRFGL QVTGDNAKVY RGGAKGELWR FKLGSKTEAQ LLSGQHDGSV RQPMLWQDRL YFISDSDGND NLWSMALDGS DAKQLTQFKD WQVRGAQMDQ GKVVFQQGAD IHVFDIAAAK DALVNIELTS DFAQRREHWV KDPMDYASST DLALAGDKVV ITARSHVAIA GVDGSRLVQV ALPGTYRVRE AILSQDGKSV YAISDMTGQQ EIWQFPADGS SGAKQLTKDG HTLRMSLNLS NDGRFIAHDD NDGNVWLLDL KKNTNQKIIT NGEGLGPYAD IRWSGDSRFI ALTKSEIGKQ RPQIVLYSVD ENKAETLTSD KYESYSPTFS SDGQWLYFLS NRQFSATPSS PWGDRNMGPV FDKRSQIFAI ALVNNAKFPF SKPNELSVNQ ADKTDAKDKP NPVKIDWSGI SQRLWQVPID SGNYSELRAV DGRLYVLDQA MGDDAEPSLM TIKFDPLSPK AEVFAEDIGH YAVSADGKKL MLRKFSNDKA LMIVDGGDKL GDTDNAKVQT DQWQLAIYPQ LEWQQMFEDA WLMHRESFFD KKMRGLDWQA TKAKYQPLLD RLTDRNELND IFMQMMGELD SLHSQVRGGD LPKDPDAAKA ASLGARLQQT ADGVKIAHIY SNDPELPANA SPLNRIEVDA KEGDVLLAIN GTPVANVADV TRLLRNQQDK QVLLQLKRGS QTHKTIVMPV SAMVDGQLRY LDWVSHNATV VEDASKGKIG YLHLYAMGGG DIESFAREFY TNYDKDGLII DVRRNRGGNI DSWIIEKLLR RAWAFWQPTH GTPNTNMQQT FRGHLVVLTD ELTYSDGETF SAGIKALGIA PLIGKQTAGA GVWLSGRNSL TDKGMARVAE YPQYAMDGRW VLEGHGVTPD IEVDNLPFAT FNGKDAQLET AISYLKDELV KQPVPALKAQ PLPAKGMAQD IKAK
|
| |