Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal_0917 |
Symbol | |
ID | 4844783 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS155 |
Kingdom | Bacteria |
Replicon accession | NC_009052 |
Strand | - |
Start bp | 1051851 |
End bp | 1055063 |
Gene Length | 3213 bp |
Protein Length | 1070 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640118136 |
Product | hypothetical protein |
Protein accession | YP_001049311 |
Protein GI | 126173162 |
COG category | [S] Function unknown |
COG ID | [COG5373] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGTTAA AGGATGACGT CGCCCAACTA AAAGCTGAGC TGGCACAATT ACAGTCGCTG CATTTATCGC AGCAATCTTC GCTCAGCCGT CAATTGGCAG AATTTTCAAC TAAGCTCGAT ACCTTAAGTC AGCAAATTGC GACTGAAGAC GCCTCTGATA CCAGTCTTAG TATGGCGGCA GATAGTATGA CAGCGGGTGC TGCATCGATT GCGGCAGTGG TCCCCGCCGC CGATAACGCG CCCACATTAA CCTATGCGAT ACACACACCA ATCCTTGAGT CCACTCCCGT AGAGCCAGTT CCTGTAGAAC CCAGCCCATG GCAGCAAAAC GCTGTGCAAG GAGACCCTTG GCAAAGAAAC ACAAAAAATA CCTCAGCAGA ACAAGTCGCT AAAACCGAAT ACCAGGCGCA GGGCCAACAA CTGAGTGATG AAGTCAAATT ACAGGCAAGC GTGCAAGTGG CGAGTCAATT TGATGATCTG CTATCCCAAG GATTAGCGGC CATCATGGCG CCCTTTGGCG CAATTACCGA ACAAATCAAA TCCTTCTACC ATCATTATCA AGCTAAGGGA TTAGGCCCGG TCTTTTTGAT GACAGTTGCA GGGATCATCA CCCTGACCTT AGGCTTTGGT TATTTACTGC AATATTCCAT CAACCATTGG TTCTCTGAAC TGGGCAAAGC CCTACTCGGG TTTGCCAGTG CGAACGCTAT CATAGCCGGC GGGATTTTTA TTCGCCAAAA ACGCGCGGGC ATGGCGGACT TTGGCTCAGG GATAGTCGGA TTAGGCCTTA TCCTAAACTT TCTCTGCGCT TACTTTATTG GGCCATATTT TGAAATCATC CCCAATAGCG CGAGTTTTAT CTTACTGTTA TTGATCACTC TGGCGGGTTA TGGCCTGTCA ATGCGCTTAG ATGCCAAAGT CATTGCCGTC ATCGCCTTAG TCGGCGGCTC GACCGCGCCT ATGATGCTAT TATCCCAAAG CTACGCGCCG CTACTGTATC TTCCTTATTT ACTGCTTATT GGTGCCGGCG CCCTTGCCCA GAGTCGTAAG TTAAAGTGGC CGCTATTAGT GGAAATCACC GCCCTGCTGC ACATTGGCTG CATCGAAGCC TTTAGTTATT TTGTGCCATT GCCACTCAGC GACTTTGGCG GCGGCAGTCT GCTGGCGCTT ATCAGTATTA ATGCGACCTT TTATCTCTAC GGCATAACCG GCCTAATATT TCCCCATCAA TTTACCCATC AAGATAAACA ACAAAGCAAT ACGCTGAGCC ATCGCATACT CGCCCTGCCA ATAGCATTGC TCGCCTTTGT GCTATTCGAA TTAACGCAAT TTACTGAGTT CGCAGGGGAA ATCTTTGCGG TCAACGCGCT GATCTGTGCA GCCCTCTATT GGCAGCTAAA AAGCCGCTTA GAAAAAGCCC GCAGCGGCTT ATTGTTAGTC TTTGCAGGAA GTTTTGCCGG ATTCGCGGCG CTGTATTTAC TTAGCCATGA TTTCCTCGGC TTAGTCTTAC TGCTCGAAGC CTTATTACTG TTATGGATTG GCACCAAAGA AGAACTAATT TCAGTACGCG CCGAAGCCTA TGTATTACTG CTGATGGGAT TAGGGCTAAA TGCGTTTAGC GTGCTAGATA GCATGGCACT GTTAGAATCC AGCATGTTAG ATGCCTTAGC GATTTCACCT TTATCGGCCT TTGGCTTTTC GCTGATAGCG CTGGCTTTAA GTTGCGCCGC CTTAGTCTTT GCCATTCGAC TATTAACATT CACAAATGCG CCGCTTTCTG CGTTAGAACA TCAGCTTTGC AGAATACTAA AAGAACTATT AAGCGGTTTT TATGTGGCGA CAATTATCCT CGCGGCTTAC CTTTTGAGCA GCGATTACTA CCTCGCCATT TTGCCACTGG TCAGCCTATT ACTGCTGTAT TTAAGCGCAA AGGACAAACT CGTTATCAGC GAATTTGCCG CTTGGATCTT GCTATTACCG CTGCTCTTTA AGGTCGTTGA AGGCATAACA CTCGCGGACA GTTTTAGCTT TAGCGCCCAG CCGCTCATGG CAAAACTGGC TCGGATTGAG TTATTTACCG CCTTACTCTT GGCCCATTAT TGGTATCGCC GCCACTATAA AGATGCCCTA TTTGCTAAAG CCGCTTACAG CGTGCAAATC TGCTGTTACT TGATGCTGCC GCTTATTTTA CTGCCCAAAG TTATCCGCAA CTATTGGGAA TATACCGCTA TCGCCCTTTG GCTGAGCACC TTTATGAGCC TAGGTTTAGC CTATTTTGTT AAGCATAAGA GTTTGAACAT AGAGGCCAAA ATCCTCACTT GGCTAGCGGT TATGATGACC GCCTCGCTAT GTCTTATTCA CGTTTGGCAA GGACTCGCGG CGCTGGTTAT TGGCGCCCTG TTTATGGGCT TCACCCTGCT TCGTTATCGC CAATTACCCG AGACATGGCG CCCCTTGCTG CAGCTACAAT GGCAACTGAG TCCTTATTAC TTTGCACTGG TATTGGCCGT GATCGTTTAT GGCTTTAATC ATTCTGAGCT CGTTGGAATA GCGATGACAG CGTTAGCCTT AAGTGGTTAT TTTGCATTGC TTATCCAAAA AAGTTTAAGC AAAAGAGCTG AAGGCCAAGC CTCACATAGC ATGAAGTTAA CCCTAGTTGC CGAAATCCAA GCCGCAATAA AGGAAAGCTA CCATCTCGCC TATGGCCTCA CTTTAGGCTT AGCCCTGTTG CCCATCATGT TGCATTTTGA GATCACGCTT GGGCTTAATC GTGATAATGC CTCGTTTGTG TTAATCGAGT TTTTATCACT AGCACTACTA GCAAGGCTTA TCTTGCAGCA CGGCGCAGCA ATACGCTTGC ATAGACGTAT ATTGCCATTA CAAGGACTTA AGTGGGGTTG GCATCTGTTA CTCGCCTTGA GTTACTTCAT GTGGAGTTAT AGTTTTGATA GCATGATTGC CGCCCCACTC AGCGCGATTT TATTGGTTAT CCATGGCAGT GTGTTGATGT TTATCAGCTT AAAACCACAA AATGCCGATA TGATCCGCCT CGCCGCTGGG CTATTTATCC TATCGACGCT CAAGGTGCTA TTACTGGATA TGGCGTCCTT CGAGCTAGTG CAAAAGGTCA TTGCCTTTAT GCTGATCGGG GTTATTTTGC TTACCGTTTC TTACTTCTAC CAGAAGGCGA GAAATCGATT GCAGCAAGAT TAA
|
Protein sequence | MPLKDDVAQL KAELAQLQSL HLSQQSSLSR QLAEFSTKLD TLSQQIATED ASDTSLSMAA DSMTAGAASI AAVVPAADNA PTLTYAIHTP ILESTPVEPV PVEPSPWQQN AVQGDPWQRN TKNTSAEQVA KTEYQAQGQQ LSDEVKLQAS VQVASQFDDL LSQGLAAIMA PFGAITEQIK SFYHHYQAKG LGPVFLMTVA GIITLTLGFG YLLQYSINHW FSELGKALLG FASANAIIAG GIFIRQKRAG MADFGSGIVG LGLILNFLCA YFIGPYFEII PNSASFILLL LITLAGYGLS MRLDAKVIAV IALVGGSTAP MMLLSQSYAP LLYLPYLLLI GAGALAQSRK LKWPLLVEIT ALLHIGCIEA FSYFVPLPLS DFGGGSLLAL ISINATFYLY GITGLIFPHQ FTHQDKQQSN TLSHRILALP IALLAFVLFE LTQFTEFAGE IFAVNALICA ALYWQLKSRL EKARSGLLLV FAGSFAGFAA LYLLSHDFLG LVLLLEALLL LWIGTKEELI SVRAEAYVLL LMGLGLNAFS VLDSMALLES SMLDALAISP LSAFGFSLIA LALSCAALVF AIRLLTFTNA PLSALEHQLC RILKELLSGF YVATIILAAY LLSSDYYLAI LPLVSLLLLY LSAKDKLVIS EFAAWILLLP LLFKVVEGIT LADSFSFSAQ PLMAKLARIE LFTALLLAHY WYRRHYKDAL FAKAAYSVQI CCYLMLPLIL LPKVIRNYWE YTAIALWLST FMSLGLAYFV KHKSLNIEAK ILTWLAVMMT ASLCLIHVWQ GLAALVIGAL FMGFTLLRYR QLPETWRPLL QLQWQLSPYY FALVLAVIVY GFNHSELVGI AMTALALSGY FALLIQKSLS KRAEGQASHS MKLTLVAEIQ AAIKESYHLA YGLTLGLALL PIMLHFEITL GLNRDNASFV LIEFLSLALL ARLILQHGAA IRLHRRILPL QGLKWGWHLL LALSYFMWSY SFDSMIAAPL SAILLVIHGS VLMFISLKPQ NADMIRLAAG LFILSTLKVL LLDMASFELV QKVIAFMLIG VILLTVSYFY QKARNRLQQD
|
| |