Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A2820 |
Symbol | mshL |
ID | 5137651 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 2969793 |
End bp | 2971472 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640534264 |
Product | MSHA biogenesis protein MshL |
Protein accession | YP_001218670 |
Protein GI | 147674773 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1450] Type II secretory pathway, component PulD |
TIGRFAM ID | [TIGR02519] pilus (MSHA type) biogenesis protein MshL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 49 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTAAAA TCGTACTCGC TTCAGTGGTG ACTTCTTTGG TGGGATGCTC AATGGGACAT CGTGATCCTG TTGAAGCTAA ACAAGCCCTG AACCAAGCTA TTAACGAGAC GAACAGTCGT CAAATTGACC AATTACCGCC TTCGGTAGAG GCTGATTTGA TGCCTGATAT GGATACTCTT ACTGCCAGTG AGCCGAAAAC TTTGCAGCGT TTTCGAATCC AAGCTGAGGA TGTCGAAGCC AAGGCCTTTT TTGCCAGTTT AGTGCAGGGA ACCGAGTACA GTGCGGCAAT CCACCCTGCG GTGACTGGGC GAATTACTCT CAACTTGACC GACGTTACCC TAGATGAAGC CTTAGGTGTC GTGCGTGATT TGTACGGTTT TGAGGTGGTA AAAGAGGGTA AAGTGATCCA AGTCTATCCG GCTGGATTGC GTACGGTCAC GATTCCGGTT GATTATCTGC AATTTAAACG CACAGGGCGT TCGTTAACGT CGATTACGAC GGGCACCATC ACCAATACGG ACACCAATAA CTCAAATTCA AGTAGTAGCT CCTCGTCCAG CATCAGTAGT AATAGCTCTT CGGATGGTTC TTCGAGCAAT TCTAATTCCA ACAGAAGCGA TGCTCGTGGC GGAACGGAAA TTGAAACCAC GAACGAGAGT GATTTCTGGC CTTTGTTAGA AAAGGCGGTG GCTCAGTTGC TTGGCGGTAG CGGTGGCCAA ACGGTCATTG TCAATCCACA GGCGGGAGTA TTAACCCTGC GCGCTTATCC CGATGAAATT CGTCAAGTAA ACGAGTTTCT GGGGATCTCG CAACAGCGAA TGCATCGACA AGTGATCCTC GAAGCTAAGA TTCTTGAAGT GACCCTCAGT GATGGTTACC AGCAGGGGAT TAATTGGAGT AAAGCCTTCT CCTCCAATGG TGCCAATTAC AAGATAGGTT CCGGATCCAT TACTCAAGAC AGTAATGGCA ATCCTATCAC TTCTGTATTA CCTGGCTTAG ATGCGATAGG TAATTTGTTA GGTGGTCAAT CCAATGTGGT GATCTCCAGT GGCAGCTTTG ATGCCGTGAT CAGTTTTATG GCGACGCAAG GTGATTTAAA TGTTCTGTCT AGCCCGCGAG TAACTGCGTC CAACAACCAG AAAGCGGTGA TCAAAGTCGG GACGGATGAA TACTATGTGA CCGACTTATC CAGTGTGGTT GGAACTGGGG ATAACGCGCA AGCGTCGCCA GATATTACGC TTACGCCTTT CTTCTCTGGG ATTTCATTGG ATGTCACGCC GCAAATTGAC GATCAAGGCA ACGTATTACT GCATGTGCAT CCTGCGGTGA TTGAAGTCGA GCAGCAAACC AAGAAAATTT TATACCGAAG TGAAGAGATT GAGCTGCCAT TGGCGAGAAG TTCAATTAGA GAGTCGGATT CGGTTATTCG GGCGAAAGAC GGCGATGTAG TGGTGATCGG TGGTTTGATG AAGTCAAATA CCGTTGACCA AGTGTCAAAA GTGCCATTTT TAGGCGATGT TCCCGCGTTA GGGCATCTGT TCCGTAACAC CACAAAACTG ACACAGAAAA CAGAACTGGT TATTTTGCTC AAGCCGACGG TCGTTGGGGT AAATACTTGG CAAAAAGAGC TGGAGCGCTC GCGCAGTTTG CTACAGGAAT GGTTCCCGGA TAGTCAATAA
|
Protein sequence | MRKIVLASVV TSLVGCSMGH RDPVEAKQAL NQAINETNSR QIDQLPPSVE ADLMPDMDTL TASEPKTLQR FRIQAEDVEA KAFFASLVQG TEYSAAIHPA VTGRITLNLT DVTLDEALGV VRDLYGFEVV KEGKVIQVYP AGLRTVTIPV DYLQFKRTGR SLTSITTGTI TNTDTNNSNS SSSSSSSISS NSSSDGSSSN SNSNRSDARG GTEIETTNES DFWPLLEKAV AQLLGGSGGQ TVIVNPQAGV LTLRAYPDEI RQVNEFLGIS QQRMHRQVIL EAKILEVTLS DGYQQGINWS KAFSSNGANY KIGSGSITQD SNGNPITSVL PGLDAIGNLL GGQSNVVISS GSFDAVISFM ATQGDLNVLS SPRVTASNNQ KAVIKVGTDE YYVTDLSSVV GTGDNAQASP DITLTPFFSG ISLDVTPQID DQGNVLLHVH PAVIEVEQQT KKILYRSEEI ELPLARSSIR ESDSVIRAKD GDVVVIGGLM KSNTVDQVSK VPFLGDVPAL GHLFRNTTKL TQKTELVILL KPTVVGVNTW QKELERSRSL LQEWFPDSQ
|
| |