Gene Sbal223_3081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_3081 
Symbol 
ID7088991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp3652973 
End bp3656257 
Gene Length3285 bp 
Protein Length1094 aa 
Translation table11 
GC content50% 
IMG OID643461965 
Productpeptidase S41 
Protein accessionYP_002358989 
Protein GI217974238 
COG category[S] Function unknown 
COG ID[COG4946] Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00024555 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00566406 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACTTC GCCATTCTGT CGCCTGCTGC ATGCTTGCCC TAGGCACGTT CACTGCGGCC 
CAAGCCATCG CTGCCGATGC GGTGAGCAAT CAAGGATATT ACCGCGCGCC AGCCCTGCAC
GATCAAACCT TAGTCTTTAC CGCTGAAGGG GATTTATGGA CTCAGACCTT AGGTCAAAAA
GCGGCGACTC GCTTAACGAG TTTACCTGCA GAAGAGCTAG GTGCGACCAT TTCCGCCGAT
GGCAAGTGGG TTGCTTATGT GGCCAATTAT GAAGGCGCGA GCGAAGTGTA TGTGATCCCG
GTTGCCGGCG GTGTGGCCAA GCGGGTGAGT TTTGAGAATA GCCGTGTGCG GGTTCAAGGT
TGGACCCCAA AAGGCGAGAT TTTATATTCC ACCGACAGTG GTTTTGGCCC CGCGAATAAT
TGGATGCTAC GCAGTGTCGA TCCTAAAACC TTCACGACCA CAGATTTACC CTTAGCCGAT
GCGGTCGAAG GTGTGATTGA TGATCGCAAC GAATATGTGT ATTTCACCCG TTTTGGCCTG
CAAGTCACTG GCGATAACGC CAAAGTGTAC CGTGGCGGCG CAAAGGGCGA ATTATGGCGC
TTTAAACTCG GCAGCAAAAC CGAAGCCCAG TTGCTCAGCG GCCAGCACGA CGGCTCAGTG
CGCCAACCTA TGCTATGGCA AGACAGACTC TATTTTATCA GCGACAGCGA TGGTAACGAC
AACCTCTGGT CGATGGCGCT CGATGGTTCA GACGCAAAAC AGTTGACCCA ATTTAAAGAT
TGGCAAGTCC GCGGCGCGCA GATGGATCAA GGCAAAGTGG TCTTCCAGCA GGGTGCTGAT
ATCCATGTAT TTGATATCGC GGCCGCAAAA GACGCATTAG TCAATATTGA ACTCACGTCC
GATTTTGCCC AGCGCCGTGA GCACTGGGTG AAAGATCCTA TGGATTACGC ATCCTCCACC
GATTTAGCCC TTGCTGGCGA TAAAGTGGTG ATAACCGCCC GCAGCCATGT GGCAATTGCG
GGTGTTGATG GCTCGCGCTT AGTTCAAGTT GCGCTGCCAG GGACTTATCG CGTGAGAGAA
GCGATATTGA GTCAAGATGG CAAGTCTGTT TATGCGATTA GCGACATGAC AGGTCAGCAA
GAAATTTGGC AATTCCCCGC CGATGGCAGC AGTGGTGCTA AGCAGCTGAC AAAAGACGGT
CATACCTTAA GAATGAGCCT GAATCTGTCC AATGACGGTC GATTTATCGC CCACGACGAT
AACGATGGCA ATGTTTGGCT GCTCGACTTA AAGAAAAATA CCAATCAAAA AATCATCACT
AACGGTGAAG GACTTGGGCC TTATGCTGAT ATCCGCTGGT CCGGAGACAG CCGTTTTATT
GCATTAACTA AGTCGGAAAT CGGTAAACAA AGACCGCAAA TTGTCCTTTA CTCTGTGGAT
GAAAATAAGG CCGAGACGCT CACCAGCGAT AAGTACGAAT CCTATTCGCC GACCTTTAGC
AGTGACGGCC AGTGGTTATA TTTCCTGTCA AATCGCCAAT TCAGCGCCAC GCCAAGTTCA
CCTTGGGGCG ATAGAAACAT GGGCCCAGTG TTTGATAAAC GCAGCCAAAT TTTTGCCATC
GCCTTAGTTA ACAATGCCAA ATTCCCCTTT AGTAAACCCA ATGAACTCAG CGTTAACCAA
GCCGATAAAA CCGATGCTAA GGATAAACCG AATCCGGTGA AAATCGATTG GTCTGGTATT
TCGCAGCGCC TATGGCAAGT GCCTATCGAT TCAGGCAATT ACAGTGAACT GCGCGCCGTC
GATGGTAGGC TTTATGTGCT CGACCAAGCG ATGGGTGATG ACGCCGAACC GAGTCTGATG
ACCATCAAGT TTGATCCATT AAGTCCGAAA GCCGAAGTTT TTGCCGAGGA TATTGGTCAT
TATGCGGTTT CTGCCGATGG CAAAAAGCTG ATGCTGCGTA AGTTCAGCAA TGACAAGGCG
CTGATGATTG TTGACGGTGG CGATAAGCTA GGCGATACGG ATAATGCCAA AGTGCAAACG
GATCAATGGC AATTAGCGAT TTATCCCCAG CTTGAATGGC AACAAATGTT TGAAGATGCT
TGGTTGATGC ACAGAGAATC TTTCTTCGAT AAGAAGATGC GCGGCCTCGA TTGGCAAGCG
ACCAAAGCTA AGTATCAACC TTTGCTTGAC CGCCTGACTG ACCGTAACGA ACTGAACGAT
ATCTTCATGC AAATGATGGG CGAATTGGAT TCTTTGCACT CACAGGTTCG CGGCGGTGAT
TTACCCAAAG ATCCCGATGC GGCTAAAGCA GCTAGCCTAG GTGCGCGTCT ACAGCAAACC
GCCGATGGCG TTAAAATTGC GCATATTTAC AGTAACGATC CTGAGTTGCC AGCTAACGCT
TCACCGTTAA ATCGTATCGA AGTCGATGCC AAAGAAGGCG ATGTGCTGCT GGCCATCAAT
GGCACGCCAG TGGCGAACGT GGCTGATGTC ACACGGCTGT TGCGCAATCA ACAGGACAAG
CAAGTATTAC TGCAACTTAA GCGTGGCAGC CAAACCCATA AAACCATCGT TATGCCCGTC
AGCGCTATGG TGGACGGCCA ATTACGTTAT TTAGATTGGG TGAGCCATAA CGCGACTGTG
GTCGAGGATG CGAGTAAGGG CAAGATTGGT TATTTGCACT TATATGCCAT GGGCGGCGGC
GATATTGAAA GTTTCGCCCG TGAGTTTTAT ACCAATTACG ATAAAGACGG TTTGATTATC
GACGTTCGCC GTAATCGCGG CGGCAATATC GACAGTTGGA TTATCGAGAA GTTATTGCGC
CGCGCTTGGG CTTTCTGGCA ACCGACCCAC GGCACGCCAA ACACCAATAT GCAGCAAACC
TTCCGTGGCC ATTTAGTCGT GTTAACCGAT GAACTAACCT ATTCCGACGG TGAAACCTTC
TCAGCGGGAA TTAAAGCGCT GGGTATTGCG CCGCTAATTG GTAAACAAAC GGCAGGCGCG
GGCGTGTGGT TATCTGGGCG AAATTCGCTG ACCGATAAAG GTATGGCGCG AGTGGCCGAG
TATCCACAGT ATGCAATGGA TGGTCGCTGG GTGCTGGAAG GACATGGCGT AACGCCGGAT
ATTGAAGTCG ATAACTTACC TTTCGCCACC TTTAACGGTA AAGATGCACA GCTTGAGACT
GCGATAAGCT ATTTGAAAGA TGAGTTGGTT AAGCAGCCCG TTCCAGCCTT AAAAGCGCAA
CCTCTGCCTG CAAAAGGAAT GGCACAGGAT ATAAAAGCTA AGTAA
 
Protein sequence
MKLRHSVACC MLALGTFTAA QAIAADAVSN QGYYRAPALH DQTLVFTAEG DLWTQTLGQK 
AATRLTSLPA EELGATISAD GKWVAYVANY EGASEVYVIP VAGGVAKRVS FENSRVRVQG
WTPKGEILYS TDSGFGPANN WMLRSVDPKT FTTTDLPLAD AVEGVIDDRN EYVYFTRFGL
QVTGDNAKVY RGGAKGELWR FKLGSKTEAQ LLSGQHDGSV RQPMLWQDRL YFISDSDGND
NLWSMALDGS DAKQLTQFKD WQVRGAQMDQ GKVVFQQGAD IHVFDIAAAK DALVNIELTS
DFAQRREHWV KDPMDYASST DLALAGDKVV ITARSHVAIA GVDGSRLVQV ALPGTYRVRE
AILSQDGKSV YAISDMTGQQ EIWQFPADGS SGAKQLTKDG HTLRMSLNLS NDGRFIAHDD
NDGNVWLLDL KKNTNQKIIT NGEGLGPYAD IRWSGDSRFI ALTKSEIGKQ RPQIVLYSVD
ENKAETLTSD KYESYSPTFS SDGQWLYFLS NRQFSATPSS PWGDRNMGPV FDKRSQIFAI
ALVNNAKFPF SKPNELSVNQ ADKTDAKDKP NPVKIDWSGI SQRLWQVPID SGNYSELRAV
DGRLYVLDQA MGDDAEPSLM TIKFDPLSPK AEVFAEDIGH YAVSADGKKL MLRKFSNDKA
LMIVDGGDKL GDTDNAKVQT DQWQLAIYPQ LEWQQMFEDA WLMHRESFFD KKMRGLDWQA
TKAKYQPLLD RLTDRNELND IFMQMMGELD SLHSQVRGGD LPKDPDAAKA ASLGARLQQT
ADGVKIAHIY SNDPELPANA SPLNRIEVDA KEGDVLLAIN GTPVANVADV TRLLRNQQDK
QVLLQLKRGS QTHKTIVMPV SAMVDGQLRY LDWVSHNATV VEDASKGKIG YLHLYAMGGG
DIESFAREFY TNYDKDGLII DVRRNRGGNI DSWIIEKLLR RAWAFWQPTH GTPNTNMQQT
FRGHLVVLTD ELTYSDGETF SAGIKALGIA PLIGKQTAGA GVWLSGRNSL TDKGMARVAE
YPQYAMDGRW VLEGHGVTPD IEVDNLPFAT FNGKDAQLET AISYLKDELV KQPVPALKAQ
PLPAKGMAQD IKAK