Gene VC0395_A0458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0458 
Symbol 
ID5135940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp489981 
End bp491378 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content45% 
IMG OID640531916 
Productputative capsular polysaccharide biosynthesis glycosyltransferase 
Protein accessionYP_001216409 
Protein GI147673705 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATGAAGG AAAAAAGCAG AATACGCATT ACTAACTATC ACGGTAAATT TTTCTACCGA 
TTAATTGATA GCATTTTTAT ATTAGTAATT ATGTATATAT CGATTAAGTC GAATCAGCAT
GTTGTTACCA TGTCTTATAT TTCGGTTGCC TTGCTAGGGG TGCTTTTTTA CTCCTACATT
GCAGAAAGTT TAGATATTTA CAGTGGTTGG CGAACAGCAA AGCTGCGTTC ACTTTCTTTA
CATACGGCAT TCTGTTGGGC TGCGACCATC GCTAGCTTGA CCCTATTGGC TTACTTCTCC
AAAACGGGGA TCGAGTTTTC TCGCATGGTG ATGGGCGCTT GGTTTGTCGG TTCTTTTATT
GGCTTGATTG GTTGGCGTGT ATTGGCATTT GCGACCATTC ACTACATGCA CAAAAAGGGT
CTGCATACCA AGAACGCGGT CATTATCGGT ATGACTACTC AAGGGCAAGA GTTATCAAAT
AATCTGCTCA AAAACCCTGA ACTAGGGATT GTGATGCAAG GTTTTTATGA TGACCGTGCG
CCTGAGCGCC TTAAAGAGGG GGCGCCTGTG CTGGGCAATA TCAATGATGC ATTAAGCCTT
GCGAAAACTG GGCAGGTGCA AAATGTCTAT ATTGCTTTGC CGATGCAAGC GCAGCGCCGT
ATCAACCAAA TTTTGGATGC GTTTTCGGAC AGTACGGTCA ATACCTACAT AGTGCCTGAC
TTTTTTACTT TTAATTTACT GCACTCACGT TGGTACACGA TTGGTGATGT CAACGCGTTT
AGCATTTTTG ATACCCCCTT TAATGGTTTG CTCAACTGGG TGAAACGCTT TGAAGATCTG
GTGCTGAGCA GCTTGATTTT GCTCTTGATC AGCCCAGTGT TATTGGCCGT CGCTATTGGC
GTTAAGCTGA GCTCGCCTGG CCCCATCATT TTTAAACAAA ACCGCTATGG TTTAGATGGC
AAGCCAATCC AAGTTTGGAA ATTTCGCAGT ATGCGAGTGA TGGATAATGG CAGTCATGTG
CAGCAAGCCA CCAAAGGCGA TCCACGAGTT ACACGATTTG GTGCGTTCAT TCGTCGAACA
TCGCTGGATG AGTTACCGCA GTTTTTCAAT GTGTTGCAAG GAAGTATGTC GATCGTGGGG
CCAAGGCCAC ATGCCGTTGC GCACAACGAG CAATATCGCA CCATAGTGAA TCGCTACATG
CTGCGTCACA AAGTGAAGCC CGGCATTACT GGATGGGCAC AGATCAATGG TTGGCGTGGT
GAAACCGATA CGCTCGACAA GATGGAGAAG CGAGTGCAGT TCGATCTGGA CTACATCCAT
CGCTGGTCGC TGTGGTTTGA TCTGAAAATC GTCTTCCTGA CTATTTTCAA AGGATTTGTT
GGAAAAAACG CGTATTAA
 
Protein sequence
MMKEKSRIRI TNYHGKFFYR LIDSIFILVI MYISIKSNQH VVTMSYISVA LLGVLFYSYI 
AESLDIYSGW RTAKLRSLSL HTAFCWAATI ASLTLLAYFS KTGIEFSRMV MGAWFVGSFI
GLIGWRVLAF ATIHYMHKKG LHTKNAVIIG MTTQGQELSN NLLKNPELGI VMQGFYDDRA
PERLKEGAPV LGNINDALSL AKTGQVQNVY IALPMQAQRR INQILDAFSD STVNTYIVPD
FFTFNLLHSR WYTIGDVNAF SIFDTPFNGL LNWVKRFEDL VLSSLILLLI SPVLLAVAIG
VKLSSPGPII FKQNRYGLDG KPIQVWKFRS MRVMDNGSHV QQATKGDPRV TRFGAFIRRT
SLDELPQFFN VLQGSMSIVG PRPHAVAHNE QYRTIVNRYM LRHKVKPGIT GWAQINGWRG
ETDTLDKMEK RVQFDLDYIH RWSLWFDLKI VFLTIFKGFV GKNAY