Gene VC0395_A0444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0444 
Symbol 
ID5135689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp473126 
End bp474274 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content43% 
IMG OID640531902 
Productputative exopolysaccharide biosynthesis protein EpsF 
Protein accessionYP_001216395 
Protein GI147673642 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAA TATTGCTTAT CATGCCACTC TCTACGCTCA ATTGGGGGGA GAAAAATCTT 
GGTGGTGTGG ATTCAGTATG CCAAATGTTG GTACGTCAAC TCGCTCAGCA AGAGCGTCCA
TACCATTACC GAGTGTTAGC GTTTGATCCG CTCAACAATC ACTCTTATTC AGGCGAGATT
ATCCAGTTGT CAAAACATCT GGAAGTGGTG ATTTGTCCGT TGCGGGAAAA GCGATTTGGG
CTGCCTTTGC CTAGCTTATT ATCCAATTGG CTCCGCATCC AAGAGCAACT GAAAGACTAT
CAGCCAGATT TGGTTCACTC TCATCTCAAC AGTTGGATGA TGGGACTTGG GCAGAAAACT
CGTAATGTGT TGACCTTGCA TTCTTATCGC AAGATTGGGC GTAAGCCAGT TTCAAAACTC
AATGATTTTG TTTATGAACA AATCATTCCA TGGGTAAGTC ATTTTTCCGT CGATTTTTAT
ACTTGTGTTG GCGAAGAGTT GCGTCAAGCG TTATCTCTGG AGACGAATAA GTCAATTCAA
GTGATTGGCA ATCCAGTCGA TCCTGACTAT TTCTCTGCCA ACTCTGCGAA CCAGAATCTG
CCCCAGAATG AAGTCAATTT AGTCACTTGT GCATTGATTA CTCGACGCAA GCGTATTGAT
CGAGCCATTG TATTACTGCG TGAGCTAAAA CAGCGAGGGC AAGCGGCTAC TTTACGTATT
ATCGGACTCA ATATGGATTC TGCTTATTAC GCGCAATTGC AGCAATTGAT CAAAGAGTAT
GAGCTTGAAC AAGATGTCAT TTTTCTTGGC AAACTTAATC AGCGTGAAAT TGTACAACAG
TACCAACAAG CGAATATTGG AATATTTACG TCACAACAAG AAACGTTCGG CTTAGCGCCA
TTGGAAATGA TGGCGGCGGG CTTGCCATTA ATTAGTACTC CTGTCGGGAT TTTAGGTGAA
CGGCAAGCAA CGTTTGACCA GTTAGGGGTG GTTTTTATGC AAGAAGGGCA GGAAGCGATG
ATTGCTGAGC GAATCAGTCA GATAAAAATA ACGGATACTC AAGCGATTCA AACCTATTTG
CGCGATCAAT TCGCAGTAGA AAATGTGATT GAACATTACC AGAATCTATA TCGAGAGGTA
CTGAGTTGA
 
Protein sequence
MSKILLIMPL STLNWGEKNL GGVDSVCQML VRQLAQQERP YHYRVLAFDP LNNHSYSGEI 
IQLSKHLEVV ICPLREKRFG LPLPSLLSNW LRIQEQLKDY QPDLVHSHLN SWMMGLGQKT
RNVLTLHSYR KIGRKPVSKL NDFVYEQIIP WVSHFSVDFY TCVGEELRQA LSLETNKSIQ
VIGNPVDPDY FSANSANQNL PQNEVNLVTC ALITRRKRID RAIVLLRELK QRGQAATLRI
IGLNMDSAYY AQLQQLIKEY ELEQDVIFLG KLNQREIVQQ YQQANIGIFT SQQETFGLAP
LEMMAAGLPL ISTPVGILGE RQATFDQLGV VFMQEGQEAM IAERISQIKI TDTQAIQTYL
RDQFAVENVI EHYQNLYREV LS