Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_0644 |
Symbol | |
ID | 7089775 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | + |
Start bp | 769237 |
End bp | 772044 |
Gene Length | 2808 bp |
Protein Length | 935 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643459558 |
Product | peptidase M16 domain protein |
Protein accession | YP_002356588 |
Protein GI | 217971837 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.273767 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCAGT GGTTACTGCT GGCATTATTA AGTGTGGCGT TGCCGCTACA GGCGAGCGAA GAGCCGCTGT GGATGGGGAC GGCTGACTTG CCCATGAGTG GGCGAATCCA TACCGGTGAA CTTGCCAATG GCATGCGCTA TTTATTAGTG AGCAATAAGA CGCCAGAGCA GGCTGTCATA GTGCGCATGC GGGTGGACGT GGGCTCAGTG GTGGAGTCTG ATACCGAGCA GGGGTTAGTG CATTTTCTTG AGCACATGGC CTTTAATGGC TCGACGGGTT TGGCTGCGGG GGAGATGATC CCGACATTAC AGCGTCTTGG CCTGAGTTTT GGCGCCGATA CCAATGCTGT GACTGAGTTC CAGCAGACGG TTTATCAATT CAACTTGCCC AGTAATAGCC AAGATAAAGT CGATACCGCT TTGTTTTTAA TGCGAGAAAT TGGCAGTAAT CTATTACTCG ACCCAGCGCT AATTGAACGT GAAAAAGCTG TGGTATTGGC TGAACTGCGT GAGCGTAGCG GTGCGAATCT GGAGAATTAC CGCAATCAAT TACAGTTCTT GATGCCGCAA ACATTGCTGT CAAAGCGCTT ACCTGTGGGT GAGGCGAACA GCATCAAGAA TGCTACTCGC GAGACGCTGC TGTCTTTATA TCAGCGTTTT TATACGCCCT CACGTACTAC TTTAATTGTG GTCGGTGATA TCGAGGTGGC CGCAGTTGAA CAAAAGATAA AACAACAGTT TACCAGTTGG CAGGCGGCGC CTTTAGCGGC GAAGGTGAAG CCGCAAGCGA TTGGCACTGT CGCTGAGCGT CAACGCGTCG AGGCAGCGGC ATTTTTCGAT CCTAGCCTAT CGACCTCGGT TTCACTCGGC ATGCTCAAAC CTATGGCATA CCCTACCGAC AGCCCTGCCG TGCGCGAGCA GGAAATACTG CTCGAACTCG CCCACGGCAT CTTGTATCGC CGAATGGAGT CGCAGTTACT GCATAGCCAA GGTCTATCTG GCGTTAGCCT GCAGGTTGGG GAGCAATTTG ATCTCGCCTA TGGCACTCAA ATGAGCTTAG GTACGCAGGA GAATAACTGG CAAGAGGGCA TAGCAATATT GGAGCAAACC CTGCGCCAAG CACAAGAGTT TGGTTTTAGT CAGCAGGAAA TTGACCAACA AATCAAACGT ATGCACAAAG GATATCAGCT CAGCGCAGCG GGGAGCAGTA CCATTCACAG TGTGGATATT GCCGAGGGGT TAGTGTATTC AGTCGCGGAA AAGCGCGTGC CTGTAGAGCC TGAGTGGCAG CTAGCATTTT TTGAAAAAAT ACTGCCGACA GTGACGCCGC AAAAGCTAAA GCAAGTGTTT AATCAGACAT GGAATGCTAC GTCGTATTTG TACCTGACGA GCAATAAGCC CATCGAAAAT GTTGAAAAAC AGCTTATTGC TAGCTATGAG AAAAGCCGCA AACAAGTTGT GAGTGCGCCA GCAACTAAGG CGATTGACGA GTTTGCATAC ACTCAATTTG GCGATCAAGG CAAGTTAGTG GCCGATAGTC GCGATGCCGA AACGGGCATT CGTCAGTTGC AGTTTGCCAA CGGTGTGCGC CTTAATCTTA AACCTACTGA CTTTAATAAA GGCACTACTT TGGTCAGCCT CAACATAGGC TTTGGTGAAG TGCCATTCCC TGAGTTAGAT GGCTTATCTT ATCTGTTTAA CAGTGCGTTT GTGCAGGGTG GACTCAAGGC CCATGATTAT GAAAGCCTGC AGGAGATTTT TGCTGGGCAA GATATTTCCA TCAACCTAGG CGTGCGTGAG CAGAGTTTTG GCGGTGAGAT TAGTACCAAT GCCGCAGAAC TGCGCACGCA GCTTAGCTTA ATGACGGCCT TTTTGATTGA ACCTGGCATG AATAAGCAGG CCGAGCAATT GTTCCGCGAG CAAGTGATTG CCGAGCAGCA AAGTCTCCAT AGCAACCCGC AAACTGAGTT TTCTAATCAG TTTGACCGTA TCTCCCACAG TGGTGATAAA CGTTATGGTT ATGGGGAACC AGAAGAAATT TTAAAACGTC AGTTTGCAGA GCTGGCGCCG AGTTTCCATT CAGCCGTCGA GCAAGGGGCA ATTGAAATCG CCATAGTGGG TGATTTTGAC GAAGCCAGCG CCATCGCTGC AGTGGCTGAA ACTCTCGGGG CAATCAAACG CAGCCCCACT AAAAATAGCC AGTCGTTAGT GCCTATGTTC CCGAAAGTGC CTGCTAAGAT GACGTTAACA CACTATGGTC ATCCAGATTC AGCGGCGCTG GCCATGGTGT GGCCAACAAC GGATATGACT CACCTAAGCC AACACGCTGG ACTGGGATTG TTGGAGCAAG TGCTCAGCAT CCTATTAACG GAAAATGTGA GAGAAAAAGC GGGCGCGAGT TATTCGCCAT CGGCCTTTTC TTACAATGAT CTCAATGCCA GCGGTTATGG TTATCTCGGT CTATTTAGCG CGACGACTCA AGCCATGTTG CCAACGGTTT CCGAGTATTT CACCGCTGCA GTTAACCAAG TGAAGCAGCC GCAGGGGATT AGCGAGGACT TACTTAACCG AGCCCGCCAA CCTGTGCTTG AATGGATGCA AGCCGCGCCG CAGAGCAATG GTTTTTGGTT AGATTTGGCA TCAAATGCCC AGAGTTATCC TGGACGCTTT GCTGCTTTCA AACAAAGGCA AGTGTTGGCA CAGAAGATGA CACCGGCTGA GCTCAGTAAA CTGGCACAGC AATATTTGCC AGCTGATGGC AGTTTAACCA TCCAAGTCCT TCCTGCACCA CTTGAACCCC AGCAATGA
|
Protein sequence | MRQWLLLALL SVALPLQASE EPLWMGTADL PMSGRIHTGE LANGMRYLLV SNKTPEQAVI VRMRVDVGSV VESDTEQGLV HFLEHMAFNG STGLAAGEMI PTLQRLGLSF GADTNAVTEF QQTVYQFNLP SNSQDKVDTA LFLMREIGSN LLLDPALIER EKAVVLAELR ERSGANLENY RNQLQFLMPQ TLLSKRLPVG EANSIKNATR ETLLSLYQRF YTPSRTTLIV VGDIEVAAVE QKIKQQFTSW QAAPLAAKVK PQAIGTVAER QRVEAAAFFD PSLSTSVSLG MLKPMAYPTD SPAVREQEIL LELAHGILYR RMESQLLHSQ GLSGVSLQVG EQFDLAYGTQ MSLGTQENNW QEGIAILEQT LRQAQEFGFS QQEIDQQIKR MHKGYQLSAA GSSTIHSVDI AEGLVYSVAE KRVPVEPEWQ LAFFEKILPT VTPQKLKQVF NQTWNATSYL YLTSNKPIEN VEKQLIASYE KSRKQVVSAP ATKAIDEFAY TQFGDQGKLV ADSRDAETGI RQLQFANGVR LNLKPTDFNK GTTLVSLNIG FGEVPFPELD GLSYLFNSAF VQGGLKAHDY ESLQEIFAGQ DISINLGVRE QSFGGEISTN AAELRTQLSL MTAFLIEPGM NKQAEQLFRE QVIAEQQSLH SNPQTEFSNQ FDRISHSGDK RYGYGEPEEI LKRQFAELAP SFHSAVEQGA IEIAIVGDFD EASAIAAVAE TLGAIKRSPT KNSQSLVPMF PKVPAKMTLT HYGHPDSAAL AMVWPTTDMT HLSQHAGLGL LEQVLSILLT ENVREKAGAS YSPSAFSYND LNASGYGYLG LFSATTQAML PTVSEYFTAA VNQVKQPQGI SEDLLNRARQ PVLEWMQAAP QSNGFWLDLA SNAQSYPGRF AAFKQRQVLA QKMTPAELSK LAQQYLPADG SLTIQVLPAP LEPQQ
|
| |