Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A0444 |
Symbol | |
ID | 5135689 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 473126 |
End bp | 474274 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640531902 |
Product | putative exopolysaccharide biosynthesis protein EpsF |
Protein accession | YP_001216395 |
Protein GI | 147673642 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 45 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAAA TATTGCTTAT CATGCCACTC TCTACGCTCA ATTGGGGGGA GAAAAATCTT GGTGGTGTGG ATTCAGTATG CCAAATGTTG GTACGTCAAC TCGCTCAGCA AGAGCGTCCA TACCATTACC GAGTGTTAGC GTTTGATCCG CTCAACAATC ACTCTTATTC AGGCGAGATT ATCCAGTTGT CAAAACATCT GGAAGTGGTG ATTTGTCCGT TGCGGGAAAA GCGATTTGGG CTGCCTTTGC CTAGCTTATT ATCCAATTGG CTCCGCATCC AAGAGCAACT GAAAGACTAT CAGCCAGATT TGGTTCACTC TCATCTCAAC AGTTGGATGA TGGGACTTGG GCAGAAAACT CGTAATGTGT TGACCTTGCA TTCTTATCGC AAGATTGGGC GTAAGCCAGT TTCAAAACTC AATGATTTTG TTTATGAACA AATCATTCCA TGGGTAAGTC ATTTTTCCGT CGATTTTTAT ACTTGTGTTG GCGAAGAGTT GCGTCAAGCG TTATCTCTGG AGACGAATAA GTCAATTCAA GTGATTGGCA ATCCAGTCGA TCCTGACTAT TTCTCTGCCA ACTCTGCGAA CCAGAATCTG CCCCAGAATG AAGTCAATTT AGTCACTTGT GCATTGATTA CTCGACGCAA GCGTATTGAT CGAGCCATTG TATTACTGCG TGAGCTAAAA CAGCGAGGGC AAGCGGCTAC TTTACGTATT ATCGGACTCA ATATGGATTC TGCTTATTAC GCGCAATTGC AGCAATTGAT CAAAGAGTAT GAGCTTGAAC AAGATGTCAT TTTTCTTGGC AAACTTAATC AGCGTGAAAT TGTACAACAG TACCAACAAG CGAATATTGG AATATTTACG TCACAACAAG AAACGTTCGG CTTAGCGCCA TTGGAAATGA TGGCGGCGGG CTTGCCATTA ATTAGTACTC CTGTCGGGAT TTTAGGTGAA CGGCAAGCAA CGTTTGACCA GTTAGGGGTG GTTTTTATGC AAGAAGGGCA GGAAGCGATG ATTGCTGAGC GAATCAGTCA GATAAAAATA ACGGATACTC AAGCGATTCA AACCTATTTG CGCGATCAAT TCGCAGTAGA AAATGTGATT GAACATTACC AGAATCTATA TCGAGAGGTA CTGAGTTGA
|
Protein sequence | MSKILLIMPL STLNWGEKNL GGVDSVCQML VRQLAQQERP YHYRVLAFDP LNNHSYSGEI IQLSKHLEVV ICPLREKRFG LPLPSLLSNW LRIQEQLKDY QPDLVHSHLN SWMMGLGQKT RNVLTLHSYR KIGRKPVSKL NDFVYEQIIP WVSHFSVDFY TCVGEELRQA LSLETNKSIQ VIGNPVDPDY FSANSANQNL PQNEVNLVTC ALITRRKRID RAIVLLRELK QRGQAATLRI IGLNMDSAYY AQLQQLIKEY ELEQDVIFLG KLNQREIVQQ YQQANIGIFT SQQETFGLAP LEMMAAGLPL ISTPVGILGE RQATFDQLGV VFMQEGQEAM IAERISQIKI TDTQAIQTYL RDQFAVENVI EHYQNLYREV LS
|
| |