Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A2824 |
Symbol | mshG |
ID | 5135595 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 2975196 |
End bp | 2976419 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640534268 |
Product | MSHA biogenesis protein MshG |
Protein accession | YP_001218674 |
Protein GI | 147674016 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1459] Type II secretory pathway, component PulF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGACGT TTTATTATCA GGGGCGTAAT GCCGATGGCA GCAAAGCCTC TGGGTTAGTC GAGGCTGCCA CTGAGGAATT AGCCGCAGAA ATGCTGCTCA ACAAAGGTAT TGTGCCCACT TCGATTGCGC AGGGGGCGGC GGAAAAAAGT GCCTTCGATT TTAACTGGAA AGCGCTACTG ACTCCCTCCG TGCCGCTGGA AGTGTTGGTG ATTTTTTGCC GACAAATGTT CAGCTTAACC AAAGCAGGGG TGCCTTTACT GCGCTCTATG CGCGGCTTAG CCCAGAACTG CCACAATAAG CAGCTCAAAG CAGCGCTTGA TTCAGTCTGT AATGAGCTGA CCAATGGCCG CAACTTGTCG GCTTCCATGC AGTTGCATCC CGCGATTTTT AGTCCTTTGT TTGTTTCCAT GATTCAAGTG GGAGAAAACA CAGGGCGATT AGATCAGGCT TTGTTGCAAT TGGCTGGCTA TTACGAACAA GAAGTGGAAA CGCGCAAAAG AATCAAAACG GCGATGCGCT ACCCGACCTT CGTGATTACG TTTGTGTTGT TGGCGATGTT TATTTTGAAC GTCAAAGTGA TCCCACAATT TACCAGCATG TTTAGCCGCT TTGGGGTCGA CTTACCCTTA CCAACGCGCA TTTTGATTAC CACGTCCGAT TTCTTTGTGA ACTACTGGGG CTTACTGCTT GGCATCATAG TCGGTTTATT GTTTGCGTTT CGGGCTTGGG TTAATACCAC GAATGGCCGC ATTCGGTGGG ATCATTTGCG TCTGCGTATG CCGATTGTGG GAGACATAGT GAATCGTGCG CAGCTCTCAC GTTTTGCGCG TACTTTTTCC TTGATGCTTT CGGCCGGCGT GCCGCTCAAC CAATCGCTAG CGCTGTCGGC AGAAGCGATA GACAACAAGT TTCTAGAGCA GCGTATTTTA GAGATGAAAA GCCAGATTGA ATCTGGGGTG GCGGTTTCTG CTACGGCGAT CAATGCCAAC ATTTTTACCC CTCTAGTGAT TCAGATGATG TCGGTAGGTG AAGAAACCGG GCGTATCGAT GAACTTCTGT TGGAAGTGTC CGATTTTTAT GATCGTGAAG TCGACTATGA TTTAAAAACA CTCACGGCAC GTATTGAACC TATTTTATTG GTGTTTGTCG CGGCCATGGT ACTGGTATTG GCGCTGGGCA TCTTCCTTCC TATGTGGGGC ATGATGGATG CACTCAAGGG CTGA
|
Protein sequence | MATFYYQGRN ADGSKASGLV EAATEELAAE MLLNKGIVPT SIAQGAAEKS AFDFNWKALL TPSVPLEVLV IFCRQMFSLT KAGVPLLRSM RGLAQNCHNK QLKAALDSVC NELTNGRNLS ASMQLHPAIF SPLFVSMIQV GENTGRLDQA LLQLAGYYEQ EVETRKRIKT AMRYPTFVIT FVLLAMFILN VKVIPQFTSM FSRFGVDLPL PTRILITTSD FFVNYWGLLL GIIVGLLFAF RAWVNTTNGR IRWDHLRLRM PIVGDIVNRA QLSRFARTFS LMLSAGVPLN QSLALSAEAI DNKFLEQRIL EMKSQIESGV AVSATAINAN IFTPLVIQMM SVGEETGRID ELLLEVSDFY DREVDYDLKT LTARIEPILL VFVAAMVLVL ALGIFLPMWG MMDALKG
|
| |