Gene VC0395_0208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_0208 
Symbol 
ID5135048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009456 
Strand
Start bp241051 
End bp242427 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content49% 
IMG OID640530531 
Productputative extracellular solute-binding protein 
Protein accessionYP_001215049 
Protein GI147672363 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0085185 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAACA AGTGGATTGG AGCTCTAGGT CTCCTACTAT CGGGTCAGTT GATGGCCAGC 
GAATTGGTGA TTGAAAGTTG GCGCGCCGAT GATAAGGCGC TGTGGGAGCA AAAAATCATC
CCCGCTTTTG AAGCTGCGAA TCCTGGCATC AAAGTGAAAT TTAACCCAGT GCCGAATGTG
AACTATACGC CAACGTTGTG GGAAAACTTA AAAGCTGGAA AAGCGGGAGA TTTGATCACT
TGCCGCCCGT TTGATGATTC TTTGGCACTT TTCAAAGCCG GGCACTTGGC AGAAATCACC
GAAATGGCTG GGATGGAAAA TTTCCCGAGC TTTGCCCAAG CGCCTTGGCA AACCGATTCT
GGGGCACAAA CCTTCTGTGT CCCTATGGCT TCAGTGATAC ATGGTTTTTT CTATAATAAG
AAAATTTTCA ATGAGTTAGG GTTGAGTGTG CCGCAAACCC GTGAACAGTT TTTTACCGTT
CTCGATAAAG TGAAAGCGGA TGGACGTTAT ATCCCCTTAT CGATGAGTGG TTCTGAAAGC
TGGGTCGCAT CTGAATTGGG CTATCAAAAT ATCGGCCCCA ATTATTGGAA AGGGGAAGAT
GGTCGTTTGG CTTTGATTAA TGGCCAAGAG CATTTGGACG ATTCACAGTA CGTGAAGGTG
TTTGAAGAGC TAGCGCGTTG GCGAGCCTAT TTAGGTGAGG ACGGTGAGCT GAGGGATTAT
GGCACCAGTA ATGAGCTCTT TACCTCGGGT AAAGCCGCGC TTTATGTGGC GGGTTCGTGG
GAAATTGCAC CATTTACCGA TAAAGTCGAT TTTGGCGTTA TGCGCCCTCC TGTCGCAAAG
CAAGGGGATG GCTGTTTCTT TACTGACCAC ACTGACATCG GTATGGGGAT GAACCCGGCC
AGCAAAAATC CGCAAGCAGC GATGGCCTTT TTACAATGGT TAACCACACC AGAGTTTGCT
GAGTTGTATA CCAATTCGCT ACCGGGGTTT TTCTCGTTGT CTAACCACTT CTTTGACGTC
ACCAATCCTG CGGCGCGTGA AATGATGGAG TGGCGCGATC AATGTGACTC AACCATTCGG
GTCGCAACGC AAATTCTTTC GCGTGGTCAA CCTAAGTTAG GTGATGAGCT TGCCGAAGTG
AGCCAAGCGG TATTGGTTGG CAAAATGACA CCAACCGCTG CGGCAGAGCG CCTAGAGCAA
GGTTTAAAAC GCTGGTACGC TCCCCATCAA ACTCGTAAAG CGAAAGAACA AGAGTGTCAA
TGTGTTGAGC CCATATCTCC GACGACGCGC CTAAATACAC TTCCTGTTGT TGATATTGCT
CCTGTCGTTG CAAGCGATCC TGTACCCCCA ACGGAGGCCA CAATATTGTC GGAATAA
 
Protein sequence
MTNKWIGALG LLLSGQLMAS ELVIESWRAD DKALWEQKII PAFEAANPGI KVKFNPVPNV 
NYTPTLWENL KAGKAGDLIT CRPFDDSLAL FKAGHLAEIT EMAGMENFPS FAQAPWQTDS
GAQTFCVPMA SVIHGFFYNK KIFNELGLSV PQTREQFFTV LDKVKADGRY IPLSMSGSES
WVASELGYQN IGPNYWKGED GRLALINGQE HLDDSQYVKV FEELARWRAY LGEDGELRDY
GTSNELFTSG KAALYVAGSW EIAPFTDKVD FGVMRPPVAK QGDGCFFTDH TDIGMGMNPA
SKNPQAAMAF LQWLTTPEFA ELYTNSLPGF FSLSNHFFDV TNPAAREMME WRDQCDSTIR
VATQILSRGQ PKLGDELAEV SQAVLVGKMT PTAAAERLEQ GLKRWYAPHQ TRKAKEQECQ
CVEPISPTTR LNTLPVVDIA PVVASDPVPP TEATILSE