Gene VC0395_A2824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A2824 
SymbolmshG 
ID5135595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp2975196 
End bp2976419 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content49% 
IMG OID640534268 
ProductMSHA biogenesis protein MshG 
Protein accessionYP_001218674 
Protein GI147674016 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1459] Type II secretory pathway, component PulF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGACGT TTTATTATCA GGGGCGTAAT GCCGATGGCA GCAAAGCCTC TGGGTTAGTC 
GAGGCTGCCA CTGAGGAATT AGCCGCAGAA ATGCTGCTCA ACAAAGGTAT TGTGCCCACT
TCGATTGCGC AGGGGGCGGC GGAAAAAAGT GCCTTCGATT TTAACTGGAA AGCGCTACTG
ACTCCCTCCG TGCCGCTGGA AGTGTTGGTG ATTTTTTGCC GACAAATGTT CAGCTTAACC
AAAGCAGGGG TGCCTTTACT GCGCTCTATG CGCGGCTTAG CCCAGAACTG CCACAATAAG
CAGCTCAAAG CAGCGCTTGA TTCAGTCTGT AATGAGCTGA CCAATGGCCG CAACTTGTCG
GCTTCCATGC AGTTGCATCC CGCGATTTTT AGTCCTTTGT TTGTTTCCAT GATTCAAGTG
GGAGAAAACA CAGGGCGATT AGATCAGGCT TTGTTGCAAT TGGCTGGCTA TTACGAACAA
GAAGTGGAAA CGCGCAAAAG AATCAAAACG GCGATGCGCT ACCCGACCTT CGTGATTACG
TTTGTGTTGT TGGCGATGTT TATTTTGAAC GTCAAAGTGA TCCCACAATT TACCAGCATG
TTTAGCCGCT TTGGGGTCGA CTTACCCTTA CCAACGCGCA TTTTGATTAC CACGTCCGAT
TTCTTTGTGA ACTACTGGGG CTTACTGCTT GGCATCATAG TCGGTTTATT GTTTGCGTTT
CGGGCTTGGG TTAATACCAC GAATGGCCGC ATTCGGTGGG ATCATTTGCG TCTGCGTATG
CCGATTGTGG GAGACATAGT GAATCGTGCG CAGCTCTCAC GTTTTGCGCG TACTTTTTCC
TTGATGCTTT CGGCCGGCGT GCCGCTCAAC CAATCGCTAG CGCTGTCGGC AGAAGCGATA
GACAACAAGT TTCTAGAGCA GCGTATTTTA GAGATGAAAA GCCAGATTGA ATCTGGGGTG
GCGGTTTCTG CTACGGCGAT CAATGCCAAC ATTTTTACCC CTCTAGTGAT TCAGATGATG
TCGGTAGGTG AAGAAACCGG GCGTATCGAT GAACTTCTGT TGGAAGTGTC CGATTTTTAT
GATCGTGAAG TCGACTATGA TTTAAAAACA CTCACGGCAC GTATTGAACC TATTTTATTG
GTGTTTGTCG CGGCCATGGT ACTGGTATTG GCGCTGGGCA TCTTCCTTCC TATGTGGGGC
ATGATGGATG CACTCAAGGG CTGA
 
Protein sequence
MATFYYQGRN ADGSKASGLV EAATEELAAE MLLNKGIVPT SIAQGAAEKS AFDFNWKALL 
TPSVPLEVLV IFCRQMFSLT KAGVPLLRSM RGLAQNCHNK QLKAALDSVC NELTNGRNLS
ASMQLHPAIF SPLFVSMIQV GENTGRLDQA LLQLAGYYEQ EVETRKRIKT AMRYPTFVIT
FVLLAMFILN VKVIPQFTSM FSRFGVDLPL PTRILITTSD FFVNYWGLLL GIIVGLLFAF
RAWVNTTNGR IRWDHLRLRM PIVGDIVNRA QLSRFARTFS LMLSAGVPLN QSLALSAEAI
DNKFLEQRIL EMKSQIESGV AVSATAINAN IFTPLVIQMM SVGEETGRID ELLLEVSDFY
DREVDYDLKT LTARIEPILL VFVAAMVLVL ALGIFLPMWG MMDALKG