Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SO_3411 |
Symbol | |
ID | 1171086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella oneidensis MR-1 |
Kingdom | Bacteria |
Replicon accession | NC_004347 |
Strand | - |
Start bp | 3547932 |
End bp | 3551216 |
Gene Length | 3285 bp |
Protein Length | 1094 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637345212 |
Product | protease, putative |
Protein accession | NP_718964 |
Protein GI | 24374921 |
COG category | [S] Function unknown |
COG ID | [COG4946] Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTTC GTCACTCAGT TGCCTGCTGC ATGCTCGCCC TCGGCACTTT GTCTGCAGCT CAAGTGTTTG CAGCACAATC GACTAGCAAT CAAGGTTATT ACCGTGCTCC GGCTTTACAT GACCAGACCT TAGTGTTTAC GGCTGAAGGG GATCTTTGGA CGCAAACCCT CGGCCAAAAG GCGGCGACGC GGTTGACCAC CTTACCAGCT GAAGAACTCG GCGCGGCAAT TTCTGCCGAT GGTAAATGGG TGGCCTATGT GGCGAATTAC GAAGGTGCGA GTGAGGTTTA TGTTATTCCC GTGACTGGCG GTGTCGCAAA ACGGGTGAGT TTTGAGAATA GCCGAGTGCG AGTGCAAGGG TGGACCTCTA AGGGAGAAGT GCTTTATTCC ACCGACGCTG GTTTTGGCCC CGCGAATAAT TGGATGTTGC GTCTTGTTAA TCCCGAGAGT TTAACCACAA CCGATCTGCC CTTGGCCGAT GCGGTGGAAG GGGTGATCGA TGCCAATCAT CAGTATGTTT ATTTTACCCG TTTCGGCCTG CAGGTGACGG GTGACAATGC GAAGGTGTAT CGCGGCGGGG CTAAGGGCGA ATTATGGCGC TTTAAACTGG GTGGTAAAGA CGAGGCGCAG TTACTCAGTG GACAGCACCA AGGCTCTGTC CGCCAGCCGA TGCTATGGCA GGACCGACTC TATTTTATTA GCGATAGCGA TGGTAACGAC AATCTCTGGT CAATGGCCCT TGATGGTAGT GATGCTAAAC AGCTAACCCA ATATAAAGAT TGGCAAATGC GCGGGGCGCG CATGGACCAA GGTAACGTTG TCTTTCAACT AGGGGCGGAT ATTCATGTTT TTGATATCGC CTCTGCTAAG GATTCATTAT TAAACATTGA ATTAATTTCA GATTTTGCCC AGCGCCGCGA ACATTGGGTA AAAGATCCTA TGGATTATGC TACCTCCGCT AATCTTGCCC TTGCTGGCGA TAAAGTCGTT ATTACTGCCC GTAGCCATGT GGCTATTACT GGGATTGATG GCTCGCGGTT AGTGCAAGTG GCGCTTCCTG GTACTTACAG AGTACGAAAT GCCATCCTGA GTCAGGATGG CAAATCGGTT TATGCCATCA GCGATATGAC TGGCCAGCAG GAAATTTGGC AATTTCCTGC TGATGGAACC AGTGGTGCTA AACAGTTGAC CAAGGATGGC CATACCTTAA GAATGTCGCT GAGTTTATCT AACGATGGTC GCTATCTTGC CCATGACGAT AACGACGGTA GTGTGTGGTT ATTGGATCTA AAGAAAAATA CCAATCAAAA AATCATTAGT AATGGTGAAG GCCTTGGTCC CTATGCCGAT ATCCGTTGGT CCGGCGACAG CCGTTTTATC GCACTAACTA AATCTGAAAT CGGTAAGCAA AGGTCGCAAG TCGTGCTTTA TTCGGTGGAT GAAAATAAGG CGCAAGCGCT CACCAGCGAT AAATATGAAT CCTATTCGCC GACCTTTAGC AGCGATGGCC AGTGGTTATA TTTCCTTTCT AATCGCCAGT TTACCGCAAC CCCAAGCTCG CCTTGGGGGG ATCGCAATAT GGGGCCCGTT TTTGATAAAC GTAGCCAGAT TTTTGCGATT TCTTTAGTGA AAAATGCCAA GTTCCCCTTC AGCAAACCAA CTGAGTTGAC GGCAAAAACG GCTGAAAAAG CCGAATCGAA AGATAAACCG ACTCCCGTAA AAATTGATTG GGCGGGGATC AGTGAGCGTC TATGGCAAGT CCCAGTCGAT TCAGGCAATT ATAGCCAGCT ATCTGCTATT GAAGATCGTT TGTATGTACT TGACCAAGCT ATTGGTGACG AAAGCCAACC TAACCTCATG ACGGTTAAGT TTACCGAGCA GCGTCCTAAA GCTGAGGTGT TTGCTGAAGA TGTGGCTAAC TACAGTGTCT CAGCTGACGG TAAGAAGTTG TTACTGCGCA AAAAAAGCAA TGAAAAGTCG CTGATAATTG TTGATGTGGG CGATAAGTTG GGTGATACCG ACAATGCCAA GGTACAGACT GACCAGTGGC AATTAGTAAT TTCGCCAACC TTAGAGTGGC AACAAATGTT TGAAGATGCA TGGTTAATGC ATCGAGATTC TTTCTTTGAC AAAAATATGC GCGGTCTCGA TTGGCTGGCG ACTAAGGCCA AGTATCAACC ACTGCTCGAT CGCTTAACTG ACCGTAATGA ATTAAACGAC ATCTTTATGC AGATGATGGG CGAGTTAGAT TCACTGCACT CGCAAGTGCG TGGCGGTGAT TTGCCTAAAG ATCCTGATGC TGCGAAAGGG GCGAGTTTAG GCGCGCGGCT GCAACAAACC AGCGATGGGG TGAAAATTGC CCATATTTAT CGTAATGATC CCGAACTGCC GAGCCAAGCT TCACCCCTAA GCCGTATCGA AGTCGATGCC AAAAAGGGCG ATCAGTTACT TGCCATTAAT GGCACGCCTG TGACCAATGT TGCCGACGTG ACGCGTTTAT TACGTAATCA GCAGGATAAA CAGGTTCTGC TAGAGCTTAA GCGCGGCGGG CAAAGCCATA AAACTGTGGT AATGCCAGTT AGCACTATGG TCGATAGTCA GTTACGTTAT TTAGATTGGG TCAACCATAA TGCCAGCGTT GTGACTGAGG CGAGTAAGGG CAAGATTGGT TATCTGCACT TATATGCCAT GGGCGGCGGC GATATTGAGA GCTTCGCCCG TGAGTTTTAC ACCAACTATG ACAAAGACGG TTTGATTATC GACGTACGTC GTAACCGCGG CGGTAATATT GATAGCTGGA TCATCGAAAA ATTATTACGC CGAGCTTGGG CTTTCTGGCA GCCAACCCAT GGCACGCCCA ATACCAATAT GCAGCAAACC TTCCGTGGGC ATTTAGTGGT GTTAACGGAC GAGCTGACCT ACTCCGATGG CGAAACTTTC TCTGCGGGGA TCAAGGCACT AGGAATTGCA CCGCTGATCG GTAAACAAAC TGCCGGGGCG GGTGTATGGC TATCGGGTCG TAATACCTTA ACCGATAAAG GGATGGCGCG AGTTGCTGAA TATCCACAAT ATGCCATGGA TGGACGCTGG ATTCTTGAGG GACATGGTGT TACGCCGGAT ATTGAGGTCG ATAACTTACC CTTTGCCACT TTTAATGGCC AAGATGCACA GCTTGAAACC GCCATTAGTT ATCTGAAGGA TGAGTTAATA AAGCAGCCGA TTCCTGCATT GAAGGCACAA CCAATGCCAG CTAAAGGTAT GGCAGAAGAT ATAAAAGCTA AGTAG
|
Protein sequence | MKLRHSVACC MLALGTLSAA QVFAAQSTSN QGYYRAPALH DQTLVFTAEG DLWTQTLGQK AATRLTTLPA EELGAAISAD GKWVAYVANY EGASEVYVIP VTGGVAKRVS FENSRVRVQG WTSKGEVLYS TDAGFGPANN WMLRLVNPES LTTTDLPLAD AVEGVIDANH QYVYFTRFGL QVTGDNAKVY RGGAKGELWR FKLGGKDEAQ LLSGQHQGSV RQPMLWQDRL YFISDSDGND NLWSMALDGS DAKQLTQYKD WQMRGARMDQ GNVVFQLGAD IHVFDIASAK DSLLNIELIS DFAQRREHWV KDPMDYATSA NLALAGDKVV ITARSHVAIT GIDGSRLVQV ALPGTYRVRN AILSQDGKSV YAISDMTGQQ EIWQFPADGT SGAKQLTKDG HTLRMSLSLS NDGRYLAHDD NDGSVWLLDL KKNTNQKIIS NGEGLGPYAD IRWSGDSRFI ALTKSEIGKQ RSQVVLYSVD ENKAQALTSD KYESYSPTFS SDGQWLYFLS NRQFTATPSS PWGDRNMGPV FDKRSQIFAI SLVKNAKFPF SKPTELTAKT AEKAESKDKP TPVKIDWAGI SERLWQVPVD SGNYSQLSAI EDRLYVLDQA IGDESQPNLM TVKFTEQRPK AEVFAEDVAN YSVSADGKKL LLRKKSNEKS LIIVDVGDKL GDTDNAKVQT DQWQLVISPT LEWQQMFEDA WLMHRDSFFD KNMRGLDWLA TKAKYQPLLD RLTDRNELND IFMQMMGELD SLHSQVRGGD LPKDPDAAKG ASLGARLQQT SDGVKIAHIY RNDPELPSQA SPLSRIEVDA KKGDQLLAIN GTPVTNVADV TRLLRNQQDK QVLLELKRGG QSHKTVVMPV STMVDSQLRY LDWVNHNASV VTEASKGKIG YLHLYAMGGG DIESFAREFY TNYDKDGLII DVRRNRGGNI DSWIIEKLLR RAWAFWQPTH GTPNTNMQQT FRGHLVVLTD ELTYSDGETF SAGIKALGIA PLIGKQTAGA GVWLSGRNTL TDKGMARVAE YPQYAMDGRW ILEGHGVTPD IEVDNLPFAT FNGQDAQLET AISYLKDELI KQPIPALKAQ PMPAKGMAED IKAK
|
| |