Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal195_1309 |
Symbol | |
ID | 5753037 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS195 |
Kingdom | Bacteria |
Replicon accession | NC_009997 |
Strand | + |
Start bp | 1550505 |
End bp | 1553789 |
Gene Length | 3285 bp |
Protein Length | 1094 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641287579 |
Product | peptidase S41 |
Protein accession | YP_001553744 |
Protein GI | 160874428 |
COG category | [S] Function unknown |
COG ID | [COG4946] Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00413627 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.677768 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTTC GCCATTCTGT CGCCTGCTGC ATGCTTGCCC TAGGCACGTT CACTGCGGCC CAAGCCATCG CTGCCGATGC GGTGAGCAAT CAAGGATATT ACCGCGCGCC AGCCCTGCAC GATCAAACCT TAGTCTTTAC CGCTGAAGGG GATTTATGGA CTCAGACCTT AGGTCAAAAA GCGGCGACTC GCTTAACGAG TTTACCCGCA GAAGAGCTAG GTGCGACCAT TTCCGCCGAT GGCAAGTGGG TTGCTTATGT GGCCAATTAT GAAGGCGCGA GCGAAGTGTA TGTGATCCCG GTTGCCGGCG GTGTGGCCAA GCGGGTGAGT TTTGAGAATA GCCGTGTGCG GGTTCAAGGT TGGACCCCAA AAGGCGAGAT TTTATATTCC ACCGACAGTG GTTTTGGCCC CGCGAATAAT TGGATGCTAC GCAGCGTCGA TCCTAAAACC CTCACGACCA CAGATTTACC CTTAGCCGAT GCGGTCGAAG GTGTGATTGA TGAGCGCAAC GAATATGTGT ATTTCACCCG CTTTGGCCTG CAAGTCACTG GCGATAACGC CAAAGTGTAC CGTGGCGGCG CAAAGGGCGA ATTATGGCGC TTCAAGCTTG GTAGCAAAAC CGAAGCCCAG TTGCTCAGCG GCCAGCACGA CGGCTCAGTG CGCCAACCTA TGCTATGGCA AGACAGACTC TATTTTATCA GCGACAGTGA TGGTAATGAC AATCTCTGGT CGATGGCGCT CGATGGTTCA GACGCAAAAC AGTTGACCCA ATTTAAAGAT TGGCAAGTCC GCGGCGCGCG GATGGATCAA GGCAAAGTGG TCTTCCAGTT AGGTGCGGAT ATCCATGTAT TTGATGTTGC GGCCGCGAAA GACGCATTAC TCGATATCGA ACTCACGTCC GATTTTGCCC AGCGCCGTGA GCACTGGGTG AAAGATCCTA TGGATTATGC ATCCTCTACC GATTTAGCCC TTGCTGGCGA TAAAGTGGTG ATCACCGCCC GCAGCCATGT GGCTATTGCG GGTGTTGATG GTTCGCGCTT AGTGCAAGTT GCGCTGCCGG GGACTTATCG AGTTAGAAAC GCGATTATGA GTCAAGATGG CAAGTCGGTT TATGCGCTTA GCGATATGAC AGGCCAACAG GAAATTTGGC AATTCCCCGC CGATGGCAGC AGTGGCGCTA AGCAGCTGAC GAAGGACGGC CATACCTTAA GAATGAGTTT GAATCTGTCC AATGACGGTC GATTTATCGC CCACGACGAT AACGATGGCA ATGTTTGGTT GCTCGACTTA AAGAAAAATA CCAATCAGAA AATCATCACT AACGGCGAGG GACTTGGACC TTATGCTGAT ATTCGCTGGT CCGGAGACAG CCGTTTTATC GCATTAACTA AGTCAGAAAT CGGTAAACAA AGACCGCAAA TCGTCCTTTA CTCTGTGGAT GAAAATAAGG TCGAGACGCT CACCAGTGAT AAGTACGAAT CCTATTCGCC AACCTTTAGC AGTGACGGTC AGTGGCTATA TTTCCTGTCA AATCGCCAAT TTAGTGCTAC GCCAAGTTCA CCTTGGGGCG ATAGAAACAT GGGCCCAGTG TTTGATAAAC GCAGCCAAAT TTTTGCCATC GCCTTAGTTA GCAATGCCAA ATTTCCCTTT AGTAAGCCCA ATGAACTCAG TGTTAACCAA GCCGACAAAA CCGATGCTAA AGATAAGCCG GAACCGGTGA AAATTGATTG GTCAGGTATT TCGCAGCGCT TATGGCAAGT ACCTGTCGAT TCAGGCAATT TCAGTGAACT GCGCGCCGTC GATGGCAGAC TTTATGTGCT TGATCAAGCA ATGGGTGATG ACGCGGAACC AAGCCTGATG ACCATCAAGT TTGATCCATT GAGACCTAAG GCCGAGGTTT TTGCCGAAGA TATTGGTCAT TATGCGGTTT CTGCCGATGG CAAAAAGCTG ATGCTAAGAA AGTTCAGCAA TGACAAGGCG CTGATGATTG TTGACTGTGG CGATAAGCTT GGCGATACCG ATAATGCCAA AGTGCAAACG GATCAATGGC AATTAGCGAT TTCTCCCCAG CTTGAATGGC AACAAATGTT TGAAGATGCT TGGTTGATGC ACAGGGAATC TTTCTTCGAT AAGAAGATGC GCGGCCTCGA TTGGCAAGCG ACCAAAGCTA AGTATCAACC TTTGCTCGAC CGCCTGACTG ACCGTAATGA ACTGAACGAT ATCTTTATGC AAATGATGGG CGAATTAGAT TCATTGCATT CACAGGTTCG CGGCGGTGAT TTACCTAAAG ATCCCGATGC GGCTAAAGCG GCAAGTCTTG GTGCGCGCCT ACAGCAAACT GCAGATGGCG TTAAAATTGC GCACATTTAC AGTAACGATC CTGAGTTGCC CGCTAACGCA TCACCGTTAA ATCGTATCGA AGTCGATGCC AAAGAAGGCG ATGTGTTGTT AGCTATCAAT GGCACGCCAG TGGCGAACGT GGCCGATGTC ACGCGTCTGT TACGTAATCA ACAGGACAAG CAAGTATTGC TGCAACTTAA GCGTGGCAGC AAAACCCACA AAACTATCGT CATGCCTGTT AGCGCTATGG TGGATGGCCA GTTACGTTAC TTAGATTGGG TGAGCCATAA CGCGGCTGTG GTCGAGGATG CGAGTAAGGG CAAGATTGGC TATTTGCACT TATATGCGAT GGGCGGCGGC GATATAGAAA GTTTTGCCCG TGAGTTTTAT ACCAATTATG ATAAAGACGG TTTGATTATC GACGTTCGCC GTAACCGCGG CGGCAATATC GACAGTTGGA TTATTGAGAA GTTATTGCGC CGCGCTTGGG CGTTTTGGCA ACCGACCCAC GGCACGCCAA ACACCAATAT GCAGCAAACC TTCCGCGGCC ATTTAGTCGT GTTAACCGAT GAGCTGACTT ATTCCGACGG TGAGACCTTC TCCGCTGGGA TTAAAGCGCT GGGTATTGCA CCGCTAATTG GCAAGCAAAC GGCAGGGGCG GGCGTGTGGT TATCTGGGCG AAACTCGCTG ACCGATAAAG GAATGGCGCG GGTGGCCGAG TATCCACAGC ATGCAATGGA TGGTCGTTGG GTGCTAGAAG GGCATGGCGT AACGCCGGAT ATCGAAGTTG ATAACTTACC TTTTGCGACC TTTAACGGTA AAGATGCACA GCTTGAGACG GCAATAAGCT ATTTAAAGGA TGAATTGGTT AAGCAGCCCG TTCCAGCCTT AAAAGCGCAG CCTATGCCGG CAAAAGGAAT GGCACAGGAT ATAAAAGCTA AGTAA
|
Protein sequence | MKLRHSVACC MLALGTFTAA QAIAADAVSN QGYYRAPALH DQTLVFTAEG DLWTQTLGQK AATRLTSLPA EELGATISAD GKWVAYVANY EGASEVYVIP VAGGVAKRVS FENSRVRVQG WTPKGEILYS TDSGFGPANN WMLRSVDPKT LTTTDLPLAD AVEGVIDERN EYVYFTRFGL QVTGDNAKVY RGGAKGELWR FKLGSKTEAQ LLSGQHDGSV RQPMLWQDRL YFISDSDGND NLWSMALDGS DAKQLTQFKD WQVRGARMDQ GKVVFQLGAD IHVFDVAAAK DALLDIELTS DFAQRREHWV KDPMDYASST DLALAGDKVV ITARSHVAIA GVDGSRLVQV ALPGTYRVRN AIMSQDGKSV YALSDMTGQQ EIWQFPADGS SGAKQLTKDG HTLRMSLNLS NDGRFIAHDD NDGNVWLLDL KKNTNQKIIT NGEGLGPYAD IRWSGDSRFI ALTKSEIGKQ RPQIVLYSVD ENKVETLTSD KYESYSPTFS SDGQWLYFLS NRQFSATPSS PWGDRNMGPV FDKRSQIFAI ALVSNAKFPF SKPNELSVNQ ADKTDAKDKP EPVKIDWSGI SQRLWQVPVD SGNFSELRAV DGRLYVLDQA MGDDAEPSLM TIKFDPLRPK AEVFAEDIGH YAVSADGKKL MLRKFSNDKA LMIVDCGDKL GDTDNAKVQT DQWQLAISPQ LEWQQMFEDA WLMHRESFFD KKMRGLDWQA TKAKYQPLLD RLTDRNELND IFMQMMGELD SLHSQVRGGD LPKDPDAAKA ASLGARLQQT ADGVKIAHIY SNDPELPANA SPLNRIEVDA KEGDVLLAIN GTPVANVADV TRLLRNQQDK QVLLQLKRGS KTHKTIVMPV SAMVDGQLRY LDWVSHNAAV VEDASKGKIG YLHLYAMGGG DIESFAREFY TNYDKDGLII DVRRNRGGNI DSWIIEKLLR RAWAFWQPTH GTPNTNMQQT FRGHLVVLTD ELTYSDGETF SAGIKALGIA PLIGKQTAGA GVWLSGRNSL TDKGMARVAE YPQHAMDGRW VLEGHGVTPD IEVDNLPFAT FNGKDAQLET AISYLKDELV KQPVPALKAQ PMPAKGMAQD IKAK
|
| |