Gene VC0395_A0892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0892 
Symbol 
ID5137429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp905994 
End bp907199 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content49% 
IMG OID640532350 
Productextracellular solute-binding protein 
Protein accessionYP_001216838 
Protein GI147675043 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4663] TRAP-type mannitol/chloroaromatic compound transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000149436 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTACTTG GCTGCGTGGT CTCGCCGATT TCTTGTCTTA CGAATATCGA CCACATTGTT 
GACGAGCCTG TTGCAGGGTG CGCAGGTATG GATAACCAAG GACGTCAAAC CATGAGTCTG
ATTCATCAAT CTGTATCACG AGTTATGAAG CGAAGCGTGC TGCTTATTGC AGCGGGTGTT
GCGCTGTTCA CAACTTCCGC TTTTGCAGAA GAGAAAGTTT ACCGTTTAAC ACTGGCTGAG
ACATGGGGAC CTAACTTCCC AATCTTCGGC GACACCACAA AGAATATGGC GGCAATGGCG
GAGAAAATGT CCAATGGCCG TTTACAAATT CGCATTGATT CTGCCAACAA ACATAAAGCG
CCACTGGGTG TGTTTGATAT GGTGAAATCG GGCCAATACG ACATGGGACA CTCCGCGTCT
TATTACTGGA AAGGCAAAGT TCCGAACACT TTGTATTTTA CCTCTATGCC TTTTGGTATG
ACGACCGGCG AGCAATACGC ATGGTTCTAC CACGGTGGTG GTATGGAACT GATGGAGAAG
GTTTACTCTC CACACAATAT GCTCTCATTC CCCGGTGGTA ACACCGACGT GCAGATGGGC
GGCTGGTTCC AAAAAGAGAT CAACAGCGTT GAAGATCTGC AAGGGCTGAA AATGCGCATT
CCCGGTTTTG CCGGAGAAGT GTTAGCGGAG CTCGGTGCCA AACCCACCAA CATTGCTCCG
GGAGAGCTAT ACACTTCGCT CGAGCGTCGC ACGATTGATG CGCTAGAGTG GGTAGGGCCG
TCACTTGATC TACGTATGGG TTTCCACAAG ATTGCGCCTT ATTACTACAC CGGTTGGCAT
GAGCCAGCAA CTGAGCTGCA ATTCTTAGTC AACCAAAAGA CGTGGGATAA ATTGCCTGAA
GATCTGCGTG AAATCTTACG TGTTGCGATG CGCACTGCGG CTTACGACAT GTATGTTCAA
TCAGTTCATG AAAGTGGCAA AAACTGGGTT TCGATTACCC AAGAGTTCCC AGATGTGAAA
GTGAAGACCT TCCCAGCGCC TGTGATTAAA GCCCTGCGTG AGGTTAACGA CCGACTACTG
GCGAAACACG CAGCAGAAGA TCCACTGGCT AAAGAGATTC AAGAATCGCA AGCCAACTAT
CTAAAACAAA CACGTAGTTG GACAGATATT TCACTGCGAG CCTACCTCAA TAGCGAATCA
CAATGA
 
Protein sequence
MLLGCVVSPI SCLTNIDHIV DEPVAGCAGM DNQGRQTMSL IHQSVSRVMK RSVLLIAAGV 
ALFTTSAFAE EKVYRLTLAE TWGPNFPIFG DTTKNMAAMA EKMSNGRLQI RIDSANKHKA
PLGVFDMVKS GQYDMGHSAS YYWKGKVPNT LYFTSMPFGM TTGEQYAWFY HGGGMELMEK
VYSPHNMLSF PGGNTDVQMG GWFQKEINSV EDLQGLKMRI PGFAGEVLAE LGAKPTNIAP
GELYTSLERR TIDALEWVGP SLDLRMGFHK IAPYYYTGWH EPATELQFLV NQKTWDKLPE
DLREILRVAM RTAAYDMYVQ SVHESGKNWV SITQEFPDVK VKTFPAPVIK ALREVNDRLL
AKHAAEDPLA KEIQESQANY LKQTRSWTDI SLRAYLNSES Q