Gene VC0395_A2820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A2820 
SymbolmshL 
ID5137651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp2969793 
End bp2971472 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content48% 
IMG OID640534264 
ProductMSHA biogenesis protein MshL 
Protein accessionYP_001218670 
Protein GI147674773 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1450] Type II secretory pathway, component PulD 
TIGRFAM ID[TIGR02519] pilus (MSHA type) biogenesis protein MshL 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTAAAA TCGTACTCGC TTCAGTGGTG ACTTCTTTGG TGGGATGCTC AATGGGACAT 
CGTGATCCTG TTGAAGCTAA ACAAGCCCTG AACCAAGCTA TTAACGAGAC GAACAGTCGT
CAAATTGACC AATTACCGCC TTCGGTAGAG GCTGATTTGA TGCCTGATAT GGATACTCTT
ACTGCCAGTG AGCCGAAAAC TTTGCAGCGT TTTCGAATCC AAGCTGAGGA TGTCGAAGCC
AAGGCCTTTT TTGCCAGTTT AGTGCAGGGA ACCGAGTACA GTGCGGCAAT CCACCCTGCG
GTGACTGGGC GAATTACTCT CAACTTGACC GACGTTACCC TAGATGAAGC CTTAGGTGTC
GTGCGTGATT TGTACGGTTT TGAGGTGGTA AAAGAGGGTA AAGTGATCCA AGTCTATCCG
GCTGGATTGC GTACGGTCAC GATTCCGGTT GATTATCTGC AATTTAAACG CACAGGGCGT
TCGTTAACGT CGATTACGAC GGGCACCATC ACCAATACGG ACACCAATAA CTCAAATTCA
AGTAGTAGCT CCTCGTCCAG CATCAGTAGT AATAGCTCTT CGGATGGTTC TTCGAGCAAT
TCTAATTCCA ACAGAAGCGA TGCTCGTGGC GGAACGGAAA TTGAAACCAC GAACGAGAGT
GATTTCTGGC CTTTGTTAGA AAAGGCGGTG GCTCAGTTGC TTGGCGGTAG CGGTGGCCAA
ACGGTCATTG TCAATCCACA GGCGGGAGTA TTAACCCTGC GCGCTTATCC CGATGAAATT
CGTCAAGTAA ACGAGTTTCT GGGGATCTCG CAACAGCGAA TGCATCGACA AGTGATCCTC
GAAGCTAAGA TTCTTGAAGT GACCCTCAGT GATGGTTACC AGCAGGGGAT TAATTGGAGT
AAAGCCTTCT CCTCCAATGG TGCCAATTAC AAGATAGGTT CCGGATCCAT TACTCAAGAC
AGTAATGGCA ATCCTATCAC TTCTGTATTA CCTGGCTTAG ATGCGATAGG TAATTTGTTA
GGTGGTCAAT CCAATGTGGT GATCTCCAGT GGCAGCTTTG ATGCCGTGAT CAGTTTTATG
GCGACGCAAG GTGATTTAAA TGTTCTGTCT AGCCCGCGAG TAACTGCGTC CAACAACCAG
AAAGCGGTGA TCAAAGTCGG GACGGATGAA TACTATGTGA CCGACTTATC CAGTGTGGTT
GGAACTGGGG ATAACGCGCA AGCGTCGCCA GATATTACGC TTACGCCTTT CTTCTCTGGG
ATTTCATTGG ATGTCACGCC GCAAATTGAC GATCAAGGCA ACGTATTACT GCATGTGCAT
CCTGCGGTGA TTGAAGTCGA GCAGCAAACC AAGAAAATTT TATACCGAAG TGAAGAGATT
GAGCTGCCAT TGGCGAGAAG TTCAATTAGA GAGTCGGATT CGGTTATTCG GGCGAAAGAC
GGCGATGTAG TGGTGATCGG TGGTTTGATG AAGTCAAATA CCGTTGACCA AGTGTCAAAA
GTGCCATTTT TAGGCGATGT TCCCGCGTTA GGGCATCTGT TCCGTAACAC CACAAAACTG
ACACAGAAAA CAGAACTGGT TATTTTGCTC AAGCCGACGG TCGTTGGGGT AAATACTTGG
CAAAAAGAGC TGGAGCGCTC GCGCAGTTTG CTACAGGAAT GGTTCCCGGA TAGTCAATAA
 
Protein sequence
MRKIVLASVV TSLVGCSMGH RDPVEAKQAL NQAINETNSR QIDQLPPSVE ADLMPDMDTL 
TASEPKTLQR FRIQAEDVEA KAFFASLVQG TEYSAAIHPA VTGRITLNLT DVTLDEALGV
VRDLYGFEVV KEGKVIQVYP AGLRTVTIPV DYLQFKRTGR SLTSITTGTI TNTDTNNSNS
SSSSSSSISS NSSSDGSSSN SNSNRSDARG GTEIETTNES DFWPLLEKAV AQLLGGSGGQ
TVIVNPQAGV LTLRAYPDEI RQVNEFLGIS QQRMHRQVIL EAKILEVTLS DGYQQGINWS
KAFSSNGANY KIGSGSITQD SNGNPITSVL PGLDAIGNLL GGQSNVVISS GSFDAVISFM
ATQGDLNVLS SPRVTASNNQ KAVIKVGTDE YYVTDLSSVV GTGDNAQASP DITLTPFFSG
ISLDVTPQID DQGNVLLHVH PAVIEVEQQT KKILYRSEEI ELPLARSSIR ESDSVIRAKD
GDVVVIGGLM KSNTVDQVSK VPFLGDVPAL GHLFRNTTKL TQKTELVILL KPTVVGVNTW
QKELERSRSL LQEWFPDSQ