Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A2817 |
Symbol | mshI |
ID | 5136897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 2967382 |
End bp | 2968821 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640534261 |
Product | MSHA biogenesis protein MshI |
Protein accession | YP_001218667 |
Protein GI | 147674439 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3166] Tfp pilus assembly protein PilN |
TIGRFAM ID | [TIGR01709] general secretion pathway protein L |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 53 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAC CGAGTTGGAT AGAGAAACTG ATTGCCCCTA AGGTTGCCTC GCAACAGTTA TATGTTGTGG TGCAGCCAGA GCACCTGCAC TTCACATCCG ATGATTTATC GCCTATTCCT CCCCAGCCGT TACAGCAACA AAGTTGGCAA GCGGTATTGG TGCAAACGCT GCAAAAGCAC GCTGTCCATG ATGTACAAAT CCACCTTGTA CTGCATTCAC AGCTTTATCA AACTTATCAG ATTGAACAGC CGAGTATTCC GCGTGAAGAG TGGTCGGCAG CCTTGCCTTT CTTGCTCAAA GATATGTTGA GTGAGAAAGT GACGGATGTA GTGGCGGATG CTCACCCTCT TCCCGGCAGT GGTAAAGTAC AAGCCTATGT GATCAGCAAG CGTACTATTC TTGAGTTGCA AAGCATGGCG GTGTCGGCGG GATTAACGCT AGGACGAGTG ATCCCCGAGC AAGCGATTTG GGGATTGGTG GGCGGAGAAT TGAGCCACTT CCTGTTGCTG CATCGCAGCA TGGGTGGCAG CTTTAAGCTG GATGCTTTCG TTGATCGTCA GTGCAGTTTT CAACGCACTT TACGTGGGAT CACTGCGCCT GTGACTGATA ACGCAGCCAG TGCCCTTCAG TTAGATAGCT TGGCGTTAGA GCTACAGCGC TCGATTGATT ACTTATCCGC CCAGTTAAAG GGCGGCTCTT TACAACAGCT AAAAGTGTGT TGTGATGGTG AAGATCAACA GGCTTTGATC ACCGGACTCA ATGAGCGCTT AAGTGTGCGA GCGTCAGGAC TGGATGGTGA AGCGACCATC TGTGGTGAAC AACTGGCACG TTATGCGCGC AATATCCCGC AAGAAACCAT CAATTTCTAT CAAGATCACC TCAAGCCGAA GCGTGAAAAG TTCACCTTAA CCAATCTCTT GTTAGCGTGG TTGGCCTTGA GCGTTGTGTT ATTGCTTGGG TACGCAGGGG TGGGTTATCA AAACTGGGTG ATCCAACAGC AGTGGCAAGA GCAGCAACAA CATAATCAAT CGTTAACAGA ACAAGCGGCT CACTTACGTC AGCAGGTGGC GGTTCATCTT CCTTCGCCCG CTAAACAGGC GGCGATAGGG CGCATAAAGC AAGAGATCTC TAGCAAACAG CAAGCATTAG ACGCGATTGG GCAGTTTGAT GTGGCTCAGC AAACGGGCTA TTCCGGCGTA TTGAACTCTT TGGCTCAATT GGCGCGTAGC GATATCTCTT TAAGCAGTAT TACTTTGGAT TCCTCGCAAT TAAATGTGCA GGGACTCGCT CGTGATCCTG CCGCGATTCC AAACTGGATC AGTCAATTTA AACAAGAACT GCATCTGATG GGCAGAAGCT TTGAGCAACT GAAAATTGGC CGTAATGATC AAGACATGAT CACCTTTGAA CTCAACACTC AGCGAGGAGA ACAAAGATGA
|
Protein sequence | MKKPSWIEKL IAPKVASQQL YVVVQPEHLH FTSDDLSPIP PQPLQQQSWQ AVLVQTLQKH AVHDVQIHLV LHSQLYQTYQ IEQPSIPREE WSAALPFLLK DMLSEKVTDV VADAHPLPGS GKVQAYVISK RTILELQSMA VSAGLTLGRV IPEQAIWGLV GGELSHFLLL HRSMGGSFKL DAFVDRQCSF QRTLRGITAP VTDNAASALQ LDSLALELQR SIDYLSAQLK GGSLQQLKVC CDGEDQQALI TGLNERLSVR ASGLDGEATI CGEQLARYAR NIPQETINFY QDHLKPKREK FTLTNLLLAW LALSVVLLLG YAGVGYQNWV IQQQWQEQQQ HNQSLTEQAA HLRQQVAVHL PSPAKQAAIG RIKQEISSKQ QALDAIGQFD VAQQTGYSGV LNSLAQLARS DISLSSITLD SSQLNVQGLA RDPAAIPNWI SQFKQELHLM GRSFEQLKIG RNDQDMITFE LNTQRGEQR
|
| |