Gene VC0395_A0157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0157 
Symbol 
ID5136002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp157072 
End bp158172 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content49% 
IMG OID640531617 
Producthypothetical protein 
Protein accessionYP_001216122 
Protein GI147674168 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTCCGC TCAAAACTTC TTCACGTCAA GCGCGCGACT ACTTTCTGCA TCAACCTTGG 
CTGATTTCTT TTTCCTTCAT TATTCTGTTA GCACTCTGGT TAGGCTTAGG CGATTCCAAC
GCCGAAGAGC CGCAAACGGC GATGGAAATA CAAGTCCCTT TAGCCAAAGT CGGTTATCAA
ACGTTTAGCG CAACTCAAAC CGATAAAGTT ATCGAGCTGT ACGGACGCAC TGCGCCGGAT
CGGCAGGCTA AAATTGGTGC TGAAATTGCT GGGCGGATCG CCGAGGTGAA GATTGCGAAA
GGGCAAATGG TCACCAAAAA CCAGATAATC GCCTTGATTG ATAAAGGCGA TCTGGATGCC
CAGTTAGAGC GAGCGAAAGC GCAGCTCAAA GTGCGACAAC AAGAGTTTAA TGCGGCTAGT
GCACTGAAAA ATAAAGGCTT ACAAGGCGAA GTTGCTTTTA CCAATGCTGC TGCTGCTCTC
ACAGATGCAC AATCCTCGCT CAGTACTGTG CAACGTTTAC TGGATAATAC CCAAGTTCGC
GCCCCCTTTG ATGGTGTCGT GGAGACTTTA CCTATCGAAA AAGGCGATTT TGTTGGGATT
GGCGATCCGG TTGCCTCCAT TATCGATCTG CACAAATTGG TGATAGAGGC TGATGTCAGT
GAGCGACACA TTCAACATGT ACAGCTTGAA CAAGCGGCGC GGATCCGCTT TATCGATGGC
ACGCAAACAC AAGGTAAAGT CCGCTATATT TCAAGGCTCT CATCGCCTGC AACCAATACC
TTTGCCATTG AAGTTGAGGT CGATAACCCA CAACAAGCCA TTCCTGCAGG GGTAAGCGCA
GAAGTTGAGT TGAGCCTAAT GTCTCAAGCC GCGATCAAAA TCACCCCCGC CATGCTCGCG
CTCGATGAAG AGGGCAACCT TGGCGTGAAA ACGTTGCAAG GTGAACACGT CAAGTTTGTG
CCAATCCAAC TGGTGAAAGC AGAACAAGAT GGCGTCTGGT TAACCGGACT TGGCGAGCAA
GTGGATATTA TTACCCGTGG ACAAGGATTT GTTCGCGATG GAGATAAGGT TCTCGCCACT
CAACTCAGCG CAACCCACTA G
 
Protein sequence
MFPLKTSSRQ ARDYFLHQPW LISFSFIILL ALWLGLGDSN AEEPQTAMEI QVPLAKVGYQ 
TFSATQTDKV IELYGRTAPD RQAKIGAEIA GRIAEVKIAK GQMVTKNQII ALIDKGDLDA
QLERAKAQLK VRQQEFNAAS ALKNKGLQGE VAFTNAAAAL TDAQSSLSTV QRLLDNTQVR
APFDGVVETL PIEKGDFVGI GDPVASIIDL HKLVIEADVS ERHIQHVQLE QAARIRFIDG
TQTQGKVRYI SRLSSPATNT FAIEVEVDNP QQAIPAGVSA EVELSLMSQA AIKITPAMLA
LDEEGNLGVK TLQGEHVKFV PIQLVKAEQD GVWLTGLGEQ VDIITRGQGF VRDGDKVLAT
QLSATH