Gene VC0395_0301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_0301 
Symbol 
ID5134133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009456 
Strand
Start bp334908 
End bp335933 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content48% 
IMG OID640530624 
Producthypothetical protein 
Protein accessionYP_001215142 
Protein GI147672292 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGTGT CGAGTGCGTT TTTTCAATCT TGTCTTCCTA TCGCTCGCAA TGTCTTTTTA 
GAGATAGTTG TAATTATAAA GACCTGCCAC ACTATAACCC AAGGAACAAA TTGCGACGAG
GCGCACATGA TCTATCTTCT GCCATTTTTT ACTGTGCTGA TTTGGGGTGG CAACTCGATT
GTCAATAAAC TCGCGGCCTC GACCATCGAA CCTAGTGCGA TGAGCTTTTA TCGCTGGCTA
TTGGCTATGG CGATCCTCAC TCCTTTCTGT CTACCTAGCG CTATCCGCCA ATGGTCAACG
GTGAAACGCC ACTTGAGTAA ACTCGCATTT TTGGCTTTAC TCGGCATGGT GCTTAACCAA
TCCCTAGGCT ACTACGCGGG GCTCACCACC ACGGCGACCA ATATGTCACT CATCACTTCT
TTTGTTCCTT TAATGAGTGT TTTCATTAGC TTGCCGTTAC TTAACAAACC CATCTCAGCC
CTTAGCGTGG TGGGTGGCGT ACTGTCGCTG AGCGGGCTTG CCTACATGCT CGGAGAAGGA
AATCCGCTGT TTTTCCTCCA TCAAAGCGTG ACCGAAGGGG ATGCCTTAAT GGTGATGGCA
GCACTGGTGT ACGCCTTGTA CTGCGTGCTT TTGAAACGCT GGAAAATGCC GTTTAGCAAC
TGGACTTTAA TTTATCTGCA AGGGGTATGC GCCGTATTCA TGCTGATCCC TTTATGGCTC
ACCAGCGATA CGCTATTACC GACCGAAGGT TCACTCTCTC TGATCGCTTA TGCTGGCATT
GCTGCTTCTC TATTAGCACC TTGGATGTGG GTAAAAGCCA TTGATGCGAT TGGCGCAGAC
TCCACCGCCA TGTTTATGAA TCTGCTACCC GTTTTTTCTG TTTCTTTGGC CGCCACTTTA
TTGGGCGAAA AAGTTCATCC CTACCATTTA ATTGGTGATC TATTAGTCAT CAGCGGTGTC
GCGCTGTCAC AATTGAAAAT TCAACGTCGA AACGATGACG GCGTCGAAAA AGTGACGCAG
GTATAA
 
Protein sequence
MQVSSAFFQS CLPIARNVFL EIVVIIKTCH TITQGTNCDE AHMIYLLPFF TVLIWGGNSI 
VNKLAASTIE PSAMSFYRWL LAMAILTPFC LPSAIRQWST VKRHLSKLAF LALLGMVLNQ
SLGYYAGLTT TATNMSLITS FVPLMSVFIS LPLLNKPISA LSVVGGVLSL SGLAYMLGEG
NPLFFLHQSV TEGDALMVMA ALVYALYCVL LKRWKMPFSN WTLIYLQGVC AVFMLIPLWL
TSDTLLPTEG SLSLIAYAGI AASLLAPWMW VKAIDAIGAD STAMFMNLLP VFSVSLAATL
LGEKVHPYHL IGDLLVISGV ALSQLKIQRR NDDGVEKVTQ V