Gene VC0395_A0266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0266 
SymbolaceB 
ID5135210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp279659 
End bp281329 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content49% 
IMG OID640531724 
Productmalate synthase 
Protein accessionYP_001216222 
Protein GI147675709 
COG category[C] Energy production and conversion 
COG ID[COG2225] Malate synthase 
TIGRFAM ID[TIGR01344] malate synthase A 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00405334 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCTTGG TGAGTGTTTC AAGGAAGGAA TTTATGTCAA TCAATACTGC TGAATTAACC 
CAACACACCA CGGAAACCAA AATCACCCAA GGCAAGCTTG CTGTGGTGGG TACCCTTTCA
CCCGAACATC AAGCGATTTT CCCTGTCGAA GCCCAAACTT TTTTACAGCA ACTCTGTGAG
CGCTTTGCGC CGCGAGTTGA TGAATTGCTG AACTTGCGTG AACAAAAACA AACGCTTATC
GACCAAGGCC AGCTACCTGA TTTTCTTGCC GAAACCAAAG ATATCCGTGA AGGAAGCTGG
AAGATTCTCG GCATTCCAGA CGATCTGCAA GATCGTCGCG TAGAGATCAC GGGGCCAATC
GATCGCAAAA TGGTGATTAA TGCACTGAAC GCGAATGTGA AAGTCTTTAT GGCTGACTTT
GAAGATTCGA TGTCGCCCGC GTGGGACAAG GTGTTGGATG GGCAGATCAA CCTGCGTGAT
GCGATTCTGG GCTGCATAAG CTATACCAAC CCAGATAATG GCAAGCGTTA TGAGCTCAAT
GCCAACCCAG CGGTGCTTAT CTGCCGCGTG CGCGGTCTGC ATCTCAAAGA AAAACATGTC
ACTTACAACG GCTTGGCGAT TCCCAGTTCA CTGTTCGATT TTGCGCTCTA TTTTTACAAC
AACCATCGCG CACTGTTGAA AAAAGGCAGT GGCCCTTATT TCTACCTACC TAAGCTGCAA
GCTTATCAAG AGGCGCAGTG GTGGAGTGAC GTGTTCCATT TCACGGAAGA TTATTTTGGT
TTAGACACAG GCACCATCAA AGCAACAGTG TTGATTGAAA CTCTGCCTGC GGTATTTCAG
ATGGATGAGA TCCTCTTCTC ACTCAAAGAG CACATTGTCG GTCTGAACTG TGGACGCTGG
GATTATATTT TCAGCTACAT CAAAACCTTG AAAAAGCATC CCGACCGCGT ATTGCCGGAT
CGCCAAGTGG TGACCATGGA TAAGCCTTTC CTCAATGCGT ATTCCCGTTT GTTGATTTAC
ACCTGTCATA AACGCGGCGC GTTTGCGATG GGCGGGATGG CGGCGTTTAT TCCAGCCAAA
GATCCTGCGC TCAATGAGCA AGTGCTACAA AAAATTCGCG GCGACAAAAT GCTTGAAGCG
ACCAATGGCC ACGATGGCAC TTGGGTTGCT CACCCCGGTT TAGCGGATAC CGCGATGGCG
GTGTTTAACG AAGTGCTTGG TGAGCGCAAA AACCAACTCA ATGTAAGCCG AATGGCCGAT
GCTCCGATAA CCGCGGCCGA TCTATTGGCA CCTTGCGATG GGCCACGCTC TGAGCATGGC
ATGCGCCACA ACATTCGCGT TGCGTTGCAA TACATCGAAG CATGGATTTC TGGTAACGGT
TGTGTGCCGA TCTACGGCTT GATGGAAGAT GCGGCGACAG CGGAGATTTC ACGCGCTTCG
ATTTGGCAAT GGATCCAACA CGGCAAAACG TTAGATAACG GGCAAGTGGT CACTAACGAA
CTGTTTCGCG ACTACCTCAA ACAAGAGATT GAGGTGGTGA AAAGTGAAGT GGGTGAGTCC
CGTTTTGCAC AAGGGCGTTT TACTGAAGCG GCCGAATTGA TGGAGCGCTT AACGACCAGT
CAAGAGCTAC CGAATTTCTT AACCATACCG GGCTACGACT ACTTGCCGTA G
 
Protein sequence
MFLVSVSRKE FMSINTAELT QHTTETKITQ GKLAVVGTLS PEHQAIFPVE AQTFLQQLCE 
RFAPRVDELL NLREQKQTLI DQGQLPDFLA ETKDIREGSW KILGIPDDLQ DRRVEITGPI
DRKMVINALN ANVKVFMADF EDSMSPAWDK VLDGQINLRD AILGCISYTN PDNGKRYELN
ANPAVLICRV RGLHLKEKHV TYNGLAIPSS LFDFALYFYN NHRALLKKGS GPYFYLPKLQ
AYQEAQWWSD VFHFTEDYFG LDTGTIKATV LIETLPAVFQ MDEILFSLKE HIVGLNCGRW
DYIFSYIKTL KKHPDRVLPD RQVVTMDKPF LNAYSRLLIY TCHKRGAFAM GGMAAFIPAK
DPALNEQVLQ KIRGDKMLEA TNGHDGTWVA HPGLADTAMA VFNEVLGERK NQLNVSRMAD
APITAADLLA PCDGPRSEHG MRHNIRVALQ YIEAWISGNG CVPIYGLMED AATAEISRAS
IWQWIQHGKT LDNGQVVTNE LFRDYLKQEI EVVKSEVGES RFAQGRFTEA AELMERLTTS
QELPNFLTIP GYDYLP