Gene VC0395_0610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_0610 
Symbol 
ID5134610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009456 
Strand
Start bp665959 
End bp667254 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content47% 
IMG OID640530932 
Productsugar transporter family protein 
Protein accessionYP_001215449 
Protein GI147671938 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value0.861991 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACCA AAAAACACAT GAGCGAAGTT CCTATGATTC AAAGAGTAGC CGCCTATCTG 
GCGATCTTGG TTGGTTACTT TTTCTATTGT TATAACTTTG TAATTATTGA CTACGTACGC
CCTTACATCG TTGAGGCTTA TGAGGGCATT AGCCTTTCAG ATACTGCTCA ATTCTATACA
TGGCAATCCG TTGGGGCGCT GATTGGCGCA TTAAGTTGCG CTTGGTTTGC TGGCCGCTTT
GGCAAGAAAT ACACCTTGAT CACGATTACT GCACTCAACG GCGGTGCAAC CATAGTGAAC
ATGATGTTTA CTGACTACGC GACATGGGCA GCGATGCGTT TCATCATAGG TTTGTCACTG
GGCGGCTACT TTACCGTCGC TGTCAGCTTA ATGATTGGCC TCTTTACCCC AACGGTTCGC
GGCAAACTGA CCGCATTTGC CTCATCCATG TTCTCTGTAG CTTTAATGGT AATGGGGGCA
TACGCGGCCT TTATCTCTAG CATAGATGCC CCCTGGGAAA GTTTGATGTG GGTTGGTGGA
ATTCCGCCTT TAGCTGCCGC ATTTGCCATG GTATTTGTCT TACCTAGCGA TAAAAACGTC
ATCGCTTACG GCGAAGAAGA TTCCTCAGCT AATACAGGGC AAAACACACC TGCGAAAAAA
GGCTCTTGGG GTGAAATGCT CAGCAAACCT TATCGGCTCC TTACCATCAC CTGTTTGCTT
CTAGCTGGCC TTAATTTTTA CGGTTTTCAA TTCTTTAGCG GTTTCGTCAC CACGTATTTG
AAAGAAGTTC GTCAGTTTGA TGGTGCAACA ATCGGTGTGA TCTTCTCAAT CTCCGCTTTT
GGTTCTCTAT TTGGAGCTTG GGTTTGGGGC GCGGTCGCCG ACAAATTTGG ACGTAAAGTG
AATGCGTTTG GCTTCATTCT TGCGGGCATC ATGGCTTCTA TCTTTTTCAT CGCACCGAGC
GATCTGATGA TTGGCAGCCT CAATATGCTG GCAATCTTGG GCTTAATTTA TAACTTTGGC
CTCTCGTCTT CAGCGGTTTG GGGTGGCTAC TTCTCAGAAT TATTCCCTGC TCATTTGCGC
AGTTATGGTG CTGCACTCTT CCACGGCGGG CGAATTATTG GAATGTGGGC ACCTATGGTT
CTCATTTTTA TCAAAGAGCG CACCGACTTA CAGACTGCAA TGTGGGGCTC ACCGATTGTG
TGGATAGTGG CTGGCTTATT ATGGCTATCG CTTCCAGAAA CATTAAAAGG CGGTTTATTC
GATAAACGTA AGAGCAACCA ACCGGCTAAC GCGTAA
 
Protein sequence
MTTKKHMSEV PMIQRVAAYL AILVGYFFYC YNFVIIDYVR PYIVEAYEGI SLSDTAQFYT 
WQSVGALIGA LSCAWFAGRF GKKYTLITIT ALNGGATIVN MMFTDYATWA AMRFIIGLSL
GGYFTVAVSL MIGLFTPTVR GKLTAFASSM FSVALMVMGA YAAFISSIDA PWESLMWVGG
IPPLAAAFAM VFVLPSDKNV IAYGEEDSSA NTGQNTPAKK GSWGEMLSKP YRLLTITCLL
LAGLNFYGFQ FFSGFVTTYL KEVRQFDGAT IGVIFSISAF GSLFGAWVWG AVADKFGRKV
NAFGFILAGI MASIFFIAPS DLMIGSLNML AILGLIYNFG LSSSAVWGGY FSELFPAHLR
SYGAALFHGG RIIGMWAPMV LIFIKERTDL QTAMWGSPIV WIVAGLLWLS LPETLKGGLF
DKRKSNQPAN A