Gene VC0395_A0906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0906 
SymbolmdoH 
ID5137385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp920845 
End bp923010 
Gene Length2166 bp 
Protein Length721 aa 
Translation table11 
GC content51% 
IMG OID640532364 
Productglucosyltransferase MdoH 
Protein accessionYP_001216852 
Protein GI147675546 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2943] Membrane glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00127362 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAACC CAATGGTTGA ACAAGGAGCC TCCCAGTTAA TGGGAGGCTC TGCCATGCCA 
CCAGAACAAC ACGGCGAAAT GCCGGAACAA AATCTTAAAC GACTGAGTGA AGGTTTTCCG
CGTGACGCCA TTCAAACGGG TGGGGTTAAA TCCTGCTCGT GGCGACGTGT GTTTGTGGTT
GGTTTTGCTT TACTCATTTC TGCTTTTGCC ATCTTTGAAA TGCGCGGCGT ATTTTTGGTC
GGTGGACTAA CGCCTATCGA ATACGCGGTA CTGGTTCTCT TTGCTATCAA CTTCTGCTGG
ATTGCCTTAG CCTTCTCAAG CTCGATTGCC GGATTTTTTG TTCTCGCGAG TCGCAAACCT
GCCCCCAATA CAGAACAACC CCTAACGACA CGTACCGCAA TTTTGATGCC GACTTACAAC
GAGGCACCCG ATCGAGTCTT TGCCGCAGTA GAAACCATGG CTTTGGCTCT GGCGAAAACT
GAACACGGAC ATGCCTTCGA TTGGTTTATT CTCAGTGACA CCACCGATCC TGAAGTGGCG
TTATCTGAAG AGCAAGCCTT CTGGTTGCTG CGTCAACAAA CCGCAGGTAA AGCTAACGTT
TATTACCGCC GCCGTCGTAA AAACATTGCG CGCAAAGCGG GGAATATTGC CGATTTCTGC
CGTCGTTGGG GCTCGGGGTA CGACCACTTA TTAGTATTGG ATGCCGATAG TGTGATGCAG
CCAAGTACTA TGATTTCATT GGCTCAACGG ATGCAAAGTG ACCCAGATGC TGGATTGATT
CAAACCATTC CGGCGCTGAT CAATGGCACG ACGCTCATGG CTCGTGTGCA GCAGTTTGCC
GCTCGCATCT ATGGTCCAGT GGTCGGTACT GGTCTGGCTT GGTGGGTACA AAAGGAAGGT
AACTTCTGGG GTCACAACGC GATTATTCGT ACGGAAGCCT TTATGAGTGC CGCTGGCCTA
CCGCATTTAT CTGGTAGACC GCCTTTTGGT GGACATATCC TCAGTCACGA TTTCGTGGAA
GCTGCTTTAA TTCGTCGCGC AGGTTGGAGT GTTACCATCG CCGCGGATCT GAGTGGCTCT
TTTGAAGAGT GCCCTCCTTC GATTATCGAT TTAGCCGTGC GCGATCGCCG TTGGTGTCAA
GGCAACCTGC AACACAGCCG CATCATAGGC ACCAAAGGTC TGCACTGGAT CAGTCGTTTG
CACCTGACTA CGGGCATTAT GTCCTACCTC TCTTCGCCAT TTTGGCTGCT GTTGATTTTG
TCGGGTTTGT TACTGGCACT GCAAGCGCAC TTTATTCGTC CTGAGTATTT TACCGAGCAG
TTTTCACTGT TCCCGACTTG GCCGGTCATG GACTCCGCAC GCGCATTGCA ACTATTTTAC
ATCACCATGG GTATTCTATT TAGCCCGAAA ATTTTCGGAT TACTGCTGCT GATGTTTGAT
GGTGAAATGT GTCGTACCTT AGGGGGACGA CTGAGAGTCA TACTCAGCGC GGTGACAGAG
ATCCTACTGT CTGCACTGGT CGCTCCCATC ATGATGCTCA TCCACTGTGG CGCGGTTGTG
TCGATTTTGT TTGGACGTGA TAGTGGTTGG GCTCCTCAAC GACGTGATGA CGGTAGCTTG
CCAATCAAAG ACTTGCTGTA CCGCCACCGT TGGCATATGA CAGCGGGAGT ACTCCTTGGT
TATGCAGCGA TGCTCGACTC ATGGACCCTA CTGGCGTGGA TGTCTCCAGC CTTAATCGGT
TTGTGGTTCT CTGTACCACT CTCTGGGATC ACGGCGTCAT ACACCATCGG AGCTTGGTTT
AAACAAAAAC GCATTCTTGC GACTCCAGAA GAAATTGAAA CACCGGCTAT TGTGCTTGCT
GCGCAAGCTC GTCGTGACGA GTATGTGGTG GATCTTCAGG AAGTGTGGAA CGCACGCATG
GTACTTGCTG ATCACAACCT GATCGCTCTA CATATTGCGA TGATGGATAA ACTGCCGTCG
CGCCAACCCG GCACCGCGAT TGAGCCTTTG GATGCGGTAG CACGCATTAA AGTACAGGAA
GCGGAGAGTC AGGAAAGCCT ATTGGCGCTA CTGACCAAAG TCGAACTGAG TTATGTTCTT
GGCAATCCCC TGCTGATCCA GCAGGTAGCC AAACTGCCGC CCAGCTTAGC GAACCAGACT
GTCTAA
 
Protein sequence
MTNPMVEQGA SQLMGGSAMP PEQHGEMPEQ NLKRLSEGFP RDAIQTGGVK SCSWRRVFVV 
GFALLISAFA IFEMRGVFLV GGLTPIEYAV LVLFAINFCW IALAFSSSIA GFFVLASRKP
APNTEQPLTT RTAILMPTYN EAPDRVFAAV ETMALALAKT EHGHAFDWFI LSDTTDPEVA
LSEEQAFWLL RQQTAGKANV YYRRRRKNIA RKAGNIADFC RRWGSGYDHL LVLDADSVMQ
PSTMISLAQR MQSDPDAGLI QTIPALINGT TLMARVQQFA ARIYGPVVGT GLAWWVQKEG
NFWGHNAIIR TEAFMSAAGL PHLSGRPPFG GHILSHDFVE AALIRRAGWS VTIAADLSGS
FEECPPSIID LAVRDRRWCQ GNLQHSRIIG TKGLHWISRL HLTTGIMSYL SSPFWLLLIL
SGLLLALQAH FIRPEYFTEQ FSLFPTWPVM DSARALQLFY ITMGILFSPK IFGLLLLMFD
GEMCRTLGGR LRVILSAVTE ILLSALVAPI MMLIHCGAVV SILFGRDSGW APQRRDDGSL
PIKDLLYRHR WHMTAGVLLG YAAMLDSWTL LAWMSPALIG LWFSVPLSGI TASYTIGAWF
KQKRILATPE EIETPAIVLA AQARRDEYVV DLQEVWNARM VLADHNLIAL HIAMMDKLPS
RQPGTAIEPL DAVARIKVQE AESQESLLAL LTKVELSYVL GNPLLIQQVA KLPPSLANQT
V