Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sama_0456 |
Symbol | |
ID | 4602711 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella amazonensis SB2B |
Kingdom | Bacteria |
Replicon accession | NC_008700 |
Strand | + |
Start bp | 562057 |
End bp | 563256 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639779792 |
Product | MSHA biogenesis protein MshN, putative |
Protein accession | YP_926336 |
Protein GI | 119773596 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGTCA TCAATCAAAT GCTGAAAGAT TTGGATAAGC GTGCCGAGCC GCATCAGCTG CAGCAGCTGC CATCCACTGT GGCCATTCCA GTGAACAAAT CTCCCGGGCT GCCCTGGCGT TGGCTTATCC TGCTGTCGCT GCTCCTTGTT GCCATCGCGC TGGTGCTTTG GAACCTGAAA GCATCCAATG CCAAAGACGC TATGGTATTA AGCCCTATGG TTTCAGGACA GCAAGCTGAG TTAAACACAG CAAGTGCTGC GCCGGTAAAA ATGGAACCTG CGAGCAATGC TCAGGTGTCA CATGTTCAGG AGCCCGCAGA TGCCAGCAGC GACACAGCGT CGGGCTTGGC GAAAGAGCCC GCGGGCGAGT CAGATAAATC AATCAAAGAC ACGACGGGGA ATGGCGAGTC TGAAACCTTA GCCGAGTCAG TGGCAGTATC ATTCGCAGGA TCAGCAAACG CGCCTGCCCC GTCAGAAACG TCAGTTGCAG GCGAGGCAAC TGTACTGGCT TTGCAAGTCA ATACCTCGAG CGAGCCGCCG GTGATTGACT CTTCCAGCGT CGCCATGAAT ACCAATACGC CATCCGCAGG CAGCATGGCT GTCACCGAAG TGGTGCTGCC CCCGGCAGAG CAAGCCGATC GCGCCATGCT AAAGGCCAAT GGAGCTAGGG ATGCAGGCAA GCTGGATGAG GCGATGCGCC AATACGCCAT GGCGCTTTCT TATGAGCCCG CGCGCCATGA AGCAAGACGC CAATTGGCGG CGCTTCACTA TGGTCAGGGG CAAGCAGGCG AAGCCATCAA ACTGCTCGAG CGTGGGTTGG TGCAATTCCC TGAGCAGTCG TCTTTTGCAC TACTTCTGGG CCGTTTGTGG CGGGAACAGG GCAATAAATC GCAGGCGCTG GCAGCCCTTG ATGTCATTGG GGATACAGAT TCTCTTTCTC GGGATAAGTG GCTATTGGTG GCCGATATTG CCCGTGAACA AGACGATCAC GCGCTGGCCG AAGCCGCCTA CCAAAAGTTG TTGGGCACCG GTATGGAAAA AGCACAATGG TGGTTGGGGC TTGCCTACGC ACAGGATGCA CAGGGCAAGA TGGCAGATGC CCGTTATCAC TATCAGCGTG CCCTTGGCAC TGCCGGGTTA TCATCCGACG CCCGCGCCTA CATAGAAAAC AGATTGATGC AGTTGGGAGA TAACCAATGA
|
Protein sequence | MSVINQMLKD LDKRAEPHQL QQLPSTVAIP VNKSPGLPWR WLILLSLLLV AIALVLWNLK ASNAKDAMVL SPMVSGQQAE LNTASAAPVK MEPASNAQVS HVQEPADASS DTASGLAKEP AGESDKSIKD TTGNGESETL AESVAVSFAG SANAPAPSET SVAGEATVLA LQVNTSSEPP VIDSSSVAMN TNTPSAGSMA VTEVVLPPAE QADRAMLKAN GARDAGKLDE AMRQYAMALS YEPARHEARR QLAALHYGQG QAGEAIKLLE RGLVQFPEQS SFALLLGRLW REQGNKSQAL AALDVIGDTD SLSRDKWLLV ADIAREQDDH ALAEAAYQKL LGTGMEKAQW WLGLAYAQDA QGKMADARYH YQRALGTAGL SSDARAYIEN RLMQLGDNQ
|
| |