Gene VC0395_A1196 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A1196 
SymbolgalM 
ID5136975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp1259574 
End bp1260626 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content51% 
IMG OID640532654 
Productaldose 1-epimerase 
Protein accessionYP_001217142 
Protein GI147675555 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2017] Galactose mutarotase and related enzymes 
TIGRFAM ID[TIGR02636] galactose mutarotase 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCGT TATTCACCAG CATGACAGCA CAGGTCGCCT ATGATGGTCA GCCTGCCAAG 
CTTATTGAGC TCACTAACCG CCGCGGTATG CGTGTGGTAG TGATGGACAT CGGTGCCACT
TGGCTCAGTT GCACTCTACC GATGGGCGAT GAATCAAGAG AAGTGCTACT TGGCGTAAGC
AGCATGGATG ATTTTGTGCG CCAAGGCAGT TATTTAGGCG CAACGGTGGG GCGTTATGCC
AATCGGATTG CGCGTGGCGA ACTCAAGATA GGGACACAAA CGTATGCTTT GTCGGTCAAT
CAAGCTGGCA ATACGTTACA CGGTGGCGTT GTAGGGTTTG ATCGTCGTCG CTGGCAAATC
ACGCAGCAAA GCGCACAGCA TGTGACCTTT CAACTGCTTT CTGCTGACGG AGAACAAGGC
TTTCCGGGCA ACCTCCACGT TGCAGTGACC TACCGGTTGG ATGAGCAAGG TGGGGTGAAT
ATCGACTACC AAGCCACCAC CGATCGTGCG ACCGCCGTGA ATCTAACGAA CCACGCCTAC
TTTAATTTGA ATGGCGCTGA GCAAGGTAGT GATTGCCTCA ATCATCAGCT CTGGATTGAT
GCAAAGCAGT TCTTACCAAC GGATGCCTCG GGTATCCCGC TCGGGGAGTT GCAATCGGTA
CTGGGTAGCG GTTTTGATTT CACTCAACCG AAAAGGGTTG GGGAGGATTT GCTTCAAGAT
AAACAGCAAA TCCGTGCGAA AGGCTATGAC CACAGTTATT TCTTTGCGCC AGAGCGAGAT
ATGCACACGC CTATCGCTAA GGTGTGGTCT GCCGATGAGA AAGTGCAACT GCTCGTCAGT
ACGGATAAAC CTGCTATGCA GCTTTATACC GGTAATTGGT TGGCGGGAAC ACCCAATCGC
CTTGGTTCGC ACTACAAGGA TTACGCTGGC CTCGCTTTAG AAACGCAGTT TTTACCCGAT
TCCCCTCATC ATCCAGAATG GCTGCAACCG AGCTGCATCC TGCAACCCGG AGAAGTCTAT
CGCTATCAAA CGCGCTATCA GTTTGTTTTT TAA
 
Protein sequence
MNALFTSMTA QVAYDGQPAK LIELTNRRGM RVVVMDIGAT WLSCTLPMGD ESREVLLGVS 
SMDDFVRQGS YLGATVGRYA NRIARGELKI GTQTYALSVN QAGNTLHGGV VGFDRRRWQI
TQQSAQHVTF QLLSADGEQG FPGNLHVAVT YRLDEQGGVN IDYQATTDRA TAVNLTNHAY
FNLNGAEQGS DCLNHQLWID AKQFLPTDAS GIPLGELQSV LGSGFDFTQP KRVGEDLLQD
KQQIRAKGYD HSYFFAPERD MHTPIAKVWS ADEKVQLLVS TDKPAMQLYT GNWLAGTPNR
LGSHYKDYAG LALETQFLPD SPHHPEWLQP SCILQPGEVY RYQTRYQFVF